Text-to-Speech
Convert text to natural-sounding speech
6 models available
ElevenLabs Multilingual v2
ElevenLabs' most capable TTS model. Natural-sounding speech in 29 languages with emotion control and voice cloning.
ElevenLabs Turbo v2.5
Low-latency TTS model from ElevenLabs. Optimized for real-time applications with natural-sounding output.
OpenAI TTS-1
OpenAI's standard TTS model. Fast and affordable text-to-speech synthesis with good quality for most applications.
OpenAI TTS-1 HD
OpenAI's high-definition text-to-speech model. Natural, human-like voice synthesis with 6 preset voices.
Tortoise TTS
High-quality multi-speaker TTS. Generates natural speech with voice cloning capabilities from short reference clips.
XTTS-v2
Coqui's cross-lingual TTS model. Generate speech in 17 languages using voice cloning from a short reference clip.
Start Building with AI
Access all models through a single API. Get free credits when you sign up — no credit card required.