Introducing Inworld TTS

State-of-the-art Voice AI at a radically accessible price point. Join thousands of developers building with 50+ voices and simple cloning in 20+ languages at just $5/million characters.

13/1000
Version
Inworld-TTS-1
Inworld-TTS-1-max
Radically accessible pricing
$5/1M characters
(≈ $0.25 per audio-hour)
$10/1M characters
(≈ $0.50 per audio-hour)
Power
State-of-the-art quality
(WER & similarity)
Real-time latency
Soon
Multilingual
(12 languages)
Professional voice cloning
(custom fine-tuning)
Embedded safeguards
SOC2 Type II
On-Premise deployments
Open-source training
& modeling code
Free zero-shot voice cloning
Experimental
Experimental
Audio markups
(emotion/style/non-verbals)
Experimental
Experimental
Cross-lingual
(same voice, language switch)
Experimental
Experimental