Retrieve a list of all available voices for text-to-speech synthesis
x-api-key header.
| Field | Type | Description |
|---|---|---|
voice_id | string | Unique identifier for the voice (used in text-to-speech requests) |
name | string | Human-readable name of the voice |
description | string | null | Detailed description of the voice characteristics |
gender | string | null | Gender of the voice (male, female, or null) |
age | string | null | Age category of the voice (middle, elderly, or null) |
languages | array[string] | List of language codes supported by the voice (e.g., ["ar", "en"]) |
dialect | array[string] | List of dialects supported by the voice (e.g., ["fusha", "emirati", "najdi"]) |
type | string | null | Voice type (neural or null) |
sample_url | string | URL to an audio sample of the voice |
voice_id from the response in text-to-speech generation endpoints:
POST /text-to-speech/:model_id - Include voice_id in the request bodyWS /text-to-speech - Include voice_id in the WebSocket messagefusha (Modern Standard Arabic), emirati, najdi, hijazi, kuwaiti, egyptian, british, etc.male or femalemiddle or elderlyar for Arabic, en for English)null values for certain fields. These are typically custom user-created voices. The voice_id can still be used in text-to-speech requests regardless of these field values.API key for authentication
List of available voices
Unique identifier for the voice (used in text-to-speech requests)
Human-readable name of the voice
List of language codes supported by the voice (e.g., ["ar", "en"])
List of dialects supported by the voice (e.g., ["fusha", "emirati", "najdi"])
URL to an audio sample of the voice
Detailed description of the voice characteristics
Gender of the voice
male, female Age category of the voice
middle, elderly Voice type
neural