Complete endpoint documentation with request/response examples.
Generate speech from text
text *required
Text to synthesize. Max 5000 characters.
voice *required
Voice ID (af_heart, am_michael, bf_emma, am_adam)
format optional
Audio format: mp3, wav, ogg. Default: mp3
speed optional
Playback speed (0.5 - 2.0). Default: 1.0
Generate video from text with voice
text *required
Text to convert to video
voice *required
Voice ID for audio track
resolution optional
1080p or 720p. Default: 1080p
aspect_ratio optional
16:9, 9:16, or 1:1. Default: 16:9