































































curl -X POST https://api.inworld.ai/tts/v1/voice:stream \
-H "Authorization: Basic $INWORLD_API_KEY" \
-H "Content-Type: application/json" \
-d '{ "text": "Hi! What can I help you with today?",
"voice_id": "Clive",
"model_id": "inworld-tts-1.5-max", "audio_config": {
"audio_encoding": "OGG_OPUS",
"sample_rate_hertz": 16000
}
}'3 of the top 5 models on Artificial Analysis are Inworld. Blind tests by thousands of real users, not internal evals. TTS-1.5 Max delivers over 30% more expressiveness than previous models, with optimized stability to eliminate hallucinations and artifacts.
Test quality in Playground3 of the top 5 models on Artificial Analysis are Inworld. Blind tests by thousands of real users, not internal evals. TTS-1.5 Max delivers over 30% more expressiveness than previous models, with optimized stability to eliminate hallucinations and artifacts.
Test quality in PlaygroundCreate custom voices instantly from 15 seconds of audio or a text description. Fine-tune with professional voice cloning for maximum fidelity. All methods produce production-ready voices you can use in the Playground or via API.


Create custom voices instantly from 15 seconds of audio or a text description. Fine-tune with professional voice cloning for maximum fidelity. All methods produce production-ready voices you can use in the Playground or via API.
Built for realtime from the ground up — audio generates the instant it's synthesized via WebSocket. No buffering delay. Comparable latency to competitors at a fraction of the cost.
Built for realtime from the ground up — audio generates the instant it's synthesized via WebSocket. No buffering delay. Comparable latency to competitors at a fraction of the cost.
English, Spanish, French, Korean, Chinese, Hindi, Japanese, German, and more. Native-speaker quality in every language with cross-lingual cloning. Deploy globally without separate pipelines.
Explore voices

English, Spanish, French, Korean, Chinese, Hindi, Japanese, German, and more. Native-speaker quality in every language with cross-lingual cloning. Deploy globally without separate pipelines.
Explore voicesTTS-1.5 Mini starts at $15/million characters. TTS-1.5 Max at $30/million. The next best option is over $150. Scale to millions of users without scale-related cost anxiety.
View pricingTTS-1.5 Mini starts at $15/million characters. TTS-1.5 Max at $30/million. The next best option is over $150. Scale to millions of users without scale-related cost anxiety.
View pricing