Inworld TTS

Making state-of-the-art Voice AI radically accessible.

10/1000

Pricing: $5/million characters

Get started

Available today in preview

Radically accessible pricing$5 / 1M chars
State-of-the-art quality (Word Error Rate and Similarity Scores)
Real-time latency
Multilingual
Professional voice cloning (custom fine-tuning)
Embedded safeguards
SOC2 Type II
On-Premise Deployments
Open-Source training & modeling code
Larger (Max) model for use cases requiring ultra-realismExperimental
Free zero shot voice cloningExperimental
Audio markups (prompt tags for emotion, style and non-verbals)Experimental
Cross-lingual (language switching with same voice)Experimental

Multilingual voices

Now also available for Simplified Chinese (Mandarin), Korean, and Japanese

Jing

Chinese support agent

Yoona

Korean podcaster

Minji

Korean support agent

Asuka

Japanese newscaster

Emotions and non-verbal

Alex

[angry][cough]

Mark

[happy][clear-throat]

Julia

[sad][breathe]

Edward

[whispering][cough]

Education

Unlock engaging learning with expressive voices for e-learning platforms, language apps, and educational creators needing clear pronunciation and motivating narration.

Alex

Julia

Edward

Entertainment

Create immersive characters with emotionally dynamic voices for game developers, streaming platforms, and content creators bringing fictional worlds to life.

Hades

Sarah

Theodore

Content & Media

Deliver professional-grade narration with natural pacing for publishers, news organizations, and podcasters needing versatile, broadcast-quality human-like voices.

Ashley

Deborah

Mark

Voice assistant

Build trusted conversational experiences with warm, helpful voices for app developers, customer service platforms, and smart devices requiring empathetic interactions.

Shaun

Timothy

Wendy

Format

FormatSample rateBit rate
MP316kHz - 48kHz32kbps - 320kbps
PCM (PCL16)8kHz - 48kHz
μ-law /A-law8kHz
Opus8kHz - 48kHz6kbps - 256kbps
Swipe left to see more columns