Skip to main content

UltraVox :

UltraVox is a voice orchestration platform that allows you to connect and manage multiple text-to-speech providers through a unified interface. Using Custom Voices, UltraVox can call external TTS APIs such as FASEEH and use those voices seamlessly inside UltraVox workflows.

Prerequisites

Before starting, make sure you have:
  • An active FASEEH account
  • An active UltraVox account

Step-by-step: Add a FASEEH Custom Voice in UltraVox

Step 1: Create an API Key in FASEEH

Log in to your FASEEH account and navigate to the API or Developer section. Create a new API key and copy it. Keep this key secure, as it will be used inside UltraVox.

Step 2: Copy the Voice ID from FASEEH

In FASEEH, go to the Voices section. Select the voice you want to use. Under the voice options, you will find buttons to copy details. Use the button that copies the Voice ID and save it for later use.

Step 3: Log in to UltraVox

Log in to your UltraVox dashboard.

Step 4: Go to Voices and select Custom Voice

From the UltraVox navigation, open Voices and click on Custom Voice.

Step 5: Add a new Custom Voice

Click on Add Custom Voice to create a new voice integration.

Step 6: Fill in Voice Name and Description

Voice Name: Best practice is to use the same name as the voice in FASEEH. This makes it easier to identify and manage later.
Description: Add a short description such as language, dialect, gender, or intended use case.

Step 7: Configure the Custom Voice request

Copy and paste the configuration below into UltraVox. Replace the masked API key with your own FASEEH API key, and replace the Voice ID with the one you copied from FASEEH. Example configuration:
{
"url": "https://api.faseeh.ai/api/v1/text-to-speech/faseeh-mini-v1-preview",
"headers": {
"x-api-key": "REDACTED",
"Content-Type": "application/json"
},
"body": {
"text": "{text}",
"stability": 0.5,
"speed": 1.0,
"streaming": true,
"voice_id": "YOUR_VOICE_ID"
},
"responseSampleRate": 24000
}

Request details

Authentication
FASEEH requires API key authentication using the x-api-key header.
Path parameters
model_id (string, required): The model identifier used for text-to-speech generation. In this example, faseeh-v1-preview is used.
Request body fields
  • voice_id (string, required): The FASEEH voice ID used for synthesis
  • text (string, required): The Arabic text to convert into speech
  • stability (number, required): A value between 0.0 and 1.0. Higher values produce more consistent output
  • speed (number, optional): A value between 0.7 and 1.2. Default is 1.0
    streaming (boolean, required): Must be false to receive a complete WAV file response