Inworld TTS

Making state-of-the-art Voice AI radically accessible. Read our blog post here.

Real-time ready inworld-tts-1

$5 per million characters

13/1000

Note: This demo shows only a few of our most popular English voices. Get started to explore 50 voices in 11 languages—or to clone your own.

Available today in preview

VersionInworld-TTS-1Inworld-TTS-1-max
Radically accessible pricing$5/1M characters (roughly $0.25 per audio-hour)$10/1M characters (roughly $0.50 per audio-hour)
Power
State-of-the-art quality (Word Error Rate and Similarity Scores)
Real-time latencySoon
Multilingual (support for 11 languages)
Professional voice cloning (custom fine-tuning)
Embedded safeguards
SOC2 Type II
On-Premise Deployments
Open-Source training & modelling code
Free zero-shot voice cloningExperimentalExperimental
Audio markups (prompt tags for emotion, style and non-verbals)ExperimentalExperimental
Cross-lingual (language switching with same voice)ExperimentalExperimental
Swipe left to see more columns

Multilingual voices

Alain

French storyteller

Jing

Chinese support agent

Rafael

Spanish teacher

Minji

Korean podcaster

Emotions and non-verbal

Julia

[whispering]

Edward

[angry][sigh]

Sarah

[happy][breathe]

Wendy

[disappointed][laugh]

Education

Unlock engaging learning with expressive voices for e-learning platforms, language apps, and educational creators needing clear pronunciation and motivating narration.

Entertainment

Create immersive characters with emotionally dynamic voices for game developers, streaming platforms, and content creators bringing fictional worlds to life.

Content & Media

Deliver professional-grade narration with natural pacing for publishers, news organizations, and podcasters needing versatile, broadcast-quality human-like voices.

Voice assistant

Build trusted conversational experiences with warm, helpful voices for app developers, customer service platforms, and smart devices requiring empathetic interactions.

Supported Audio Formats

FormatSample rateBitrate
MP316kHz - 48kHz32kbps - 320kbps
PCM (PCL16)8kHz - 48kHz
μ-law /A-law8kHz
Opus8kHz - 48kHz6kbps - 256kbps
Swipe left to see more columns