Generate speech from Arabic text using a specific model. Supports both streaming and non-streaming responses.
x-api-key header.
| Parameter | Type | Required | Description |
|---|---|---|---|
model_id | string | Yes | The model identifier to use for generation |
| Field | Type | Required | Description |
|---|---|---|---|
voice_id | string | Yes | The voice ID to use for synthesis |
text | string | Yes | The Arabic text to convert to speech |
stability | number | Yes | Voice stability (0.0 to 1.0). Higher values produce more consistent output |
speed | number | No | Speech speed (0.7 to 1.2, default 1.0). Values below 1.0 slow down speech, above 1.0 speed it up |
streaming | boolean | Yes | Must be true for streaming response |
200 OK
Headers:
Content-Type: audio/raw;codec=pcm16;rate=24000;channels=1Cache-Control: no-cacheConnection: keep-aliveAPI key for authentication
The model identifier to use for generation
The voice ID to use for synthesis
The Arabic text to convert to speech
Voice stability (0.0 to 1.0). Higher values produce more consistent output
0 <= x <= 1If true, returns streaming PCM16 audio. If false, returns complete WAV file
Speech speed multiplier (0.7 to 1.2). Values below 1.0 slow down speech, values above 1.0 speed it up
0.7 <= x <= 1.2Audio response (streaming or complete file)
PCM16 audio stream (when streaming=true)