Claude 3 Haiku
Input Cost ($/1M tokens)
$0.25
Output Cost ($/1M tokens)
$1.25
Claude Haiku 4.5
Input Cost ($/1M tokens)
$1
Output Cost ($/1M tokens)
$5
Claude Opus 4.1
Input Cost ($/1M tokens)
$15
Output Cost ($/1M tokens)
$75
Claude Opus 4.2
Input Cost ($/1M tokens)
$15
Output Cost ($/1M tokens)
$75
Claude Opus 4.5
Input Cost ($/1M tokens)
$5
Output Cost ($/1M tokens)
$25
Claude Opus 4.6
Input Cost ($/1M tokens)
$5
Output Cost ($/1M tokens)
$25
Claude Sonnet 4
Input Cost ($/1M tokens)
$3
Output Cost ($/1M tokens)
$15
Claude Sonnet 4.5
Input Cost ($/1M tokens)
$3
Output Cost ($/1M tokens)
$15
Claude Sonnet 4.6
Input Cost ($/1M tokens)
$3
Output Cost ($/1M tokens)
$15
GPT OSS 120B
Input Cost ($/1M tokens)
$0.35
Output Cost ($/1M tokens)
$0.75
Llama 3.1 Instruct 8B Cerebras
Input Cost ($/1M tokens)
$0.3
Output Cost ($/1M tokens)
$0.9
LLama-3.3-70b
Input Cost ($/1M tokens)
$0.85
Output Cost ($/1M tokens)
$1.2
ByteDance/Seed-1.8
Input Cost ($/1M tokens)
$0.25
Output Cost ($/1M tokens)
$2
ByteDance/Seed-2.0-mini
Input Cost ($/1M tokens)
$0.1
Output Cost ($/1M tokens)
$0.4
DeepSeek R1 Distill Llama 70B
Input Cost ($/1M tokens)
$0.7
Output Cost ($/1M tokens)
$0.8
DeepSeek V3.1 Terminus
Input Cost ($/1M tokens)
$0.21
Output Cost ($/1M tokens)
$0.79
DeepSeek V3.2
Input Cost ($/1M tokens)
$0.26
Output Cost ($/1M tokens)
$0.38
deepseek-ai/DeepSeek-R1-0528
Input Cost ($/1M tokens)
$0.5
Output Cost ($/1M tokens)
$2.15
deepseek-ai/DeepSeek-R1-0528-Turbo
Input Cost ($/1M tokens)
$1
Output Cost ($/1M tokens)
$3
deepseek-v3
Input Cost ($/1M tokens)
$0.32
Output Cost ($/1M tokens)
$0.89
deepseek-v3-0324
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.77
deepseek-v3-1
Input Cost ($/1M tokens)
$0.21
Output Cost ($/1M tokens)
$0.79
GLM 4.6
Input Cost ($/1M tokens)
$0.43
Output Cost ($/1M tokens)
$1.75
GLM 4.7
Input Cost ($/1M tokens)
$0.4
Output Cost ($/1M tokens)
$1.75
glm-4-7-flash
Input Cost ($/1M tokens)
$0.06
Output Cost ($/1M tokens)
$0.4
glm-5
Input Cost ($/1M tokens)
$0.8
Output Cost ($/1M tokens)
$2.56
Google Gemma 3 12b it
Input Cost ($/1M tokens)
$0.04
Output Cost ($/1M tokens)
$0.13
Google Gemma 3 27b it
Input Cost ($/1M tokens)
$0.08
Output Cost ($/1M tokens)
$0.16
Google Gemma 3 4b-it
Input Cost ($/1M tokens)
$0.04
Output Cost ($/1M tokens)
$0.08
GPT OSS 120B
Input Cost ($/1M tokens)
$0.039
Output Cost ($/1M tokens)
$0.19
GPT OSS 120B turbo
Input Cost ($/1M tokens)
$0.15
Output Cost ($/1M tokens)
$0.6
GPT OSS 20B
Input Cost ($/1M tokens)
$0.03
Output Cost ($/1M tokens)
$0.14
Gryphe/MythoMax-L2-13b
Input Cost ($/1M tokens)
$0.4
Output Cost ($/1M tokens)
$0.4
Hermes 3 Llama-3.1 405B
Input Cost ($/1M tokens)
$1
Output Cost ($/1M tokens)
$1
hermes-3-llama-3-1-70b
Input Cost ($/1M tokens)
$0.3
Output Cost ($/1M tokens)
$0.3
Kimi K2 Instruct 0905
Input Cost ($/1M tokens)
$0.4
Output Cost ($/1M tokens)
$2
Kimi K2 Thinking
Input Cost ($/1M tokens)
$0.47
Output Cost ($/1M tokens)
$2
kimi-k2-5
Input Cost ($/1M tokens)
$0.6
Output Cost ($/1M tokens)
$3
Llama 3.1 Nemotron 70B Instruct
Input Cost ($/1M tokens)
$1.2
Output Cost ($/1M tokens)
$1.2
Llama 3.2 11B Vision Instruct
Input Cost ($/1M tokens)
$0.049
Output Cost ($/1M tokens)
$0.049
Llama 3.2 3B Instruct
Input Cost ($/1M tokens)
$0.02
Output Cost ($/1M tokens)
$0.02
Llama 3.3 Nemotron Super 49B v1.5
Input Cost ($/1M tokens)
$0.1
Output Cost ($/1M tokens)
$0.4
llama-3-1-instruct-8b
Input Cost ($/1M tokens)
$0.03
Output Cost ($/1M tokens)
$0.04
llama-3-1-instruct-8b-turbo
Input Cost ($/1M tokens)
$0.02
Output Cost ($/1M tokens)
$0.03
llama-3-3-instruct-70b-turbo
Input Cost ($/1M tokens)
$0.1
Output Cost ($/1M tokens)
$0.32
llama-4-maverick
Input Cost ($/1M tokens)
$0.15
Output Cost ($/1M tokens)
$0.6
llama-4-scout
Input Cost ($/1M tokens)
$0.08
Output Cost ($/1M tokens)
$0.3
Llama-Guard-4-12B
Input Cost ($/1M tokens)
$0.18
Output Cost ($/1M tokens)
$0.18
meta-llama/Llama-Guard-4-12B
Input Cost ($/1M tokens)
$0.18
Output Cost ($/1M tokens)
$0.18
Microsoft Phi 4
Input Cost ($/1M tokens)
$0.07
Output Cost ($/1M tokens)
$0.14
MiniMax M2.1
Input Cost ($/1M tokens)
$0.27
Output Cost ($/1M tokens)
$0.95
MiniMax M2.5
Input Cost ($/1M tokens)
$0.27
Output Cost ($/1M tokens)
$0.95
Mistral Nemo Instruct 2407
Input Cost ($/1M tokens)
$0.02
Output Cost ($/1M tokens)
$0.04
Mistral Small 24B Instruct 2501
Input Cost ($/1M tokens)
$0.05
Output Cost ($/1M tokens)
$0.08
Mistral Small 3.2 24B Instruct 2506
Input Cost ($/1M tokens)
$0.075
Output Cost ($/1M tokens)
$0.2
Mixtral-8x7B-Instruct-v0.1
Input Cost ($/1M tokens)
$0.54
Output Cost ($/1M tokens)
$0.54
Nemotron 3 Nano 30B A3B
Input Cost ($/1M tokens)
$0.06
Output Cost ($/1M tokens)
$0.24
NousResearch/Hermes-3-Llama-3.1-405B
Input Cost ($/1M tokens)
$1
Output Cost ($/1M tokens)
$1
nvidia-nemotron-3-nano-30b-a3b
Input Cost ($/1M tokens)
$0.05
Output Cost ($/1M tokens)
$0.2
nvidia-nemotron-nano-12b-v2-vl
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.6
nvidia-nemotron-nano-9b-v2
Input Cost ($/1M tokens)
$0.04
Output Cost ($/1M tokens)
$0.16
nvidia/Llama-3.3-Nemotron-Super-49B-v1.5
Input Cost ($/1M tokens)
$0.1
Output Cost ($/1M tokens)
$0.4
olmo-3-1-32b-instruct
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.6
phi-4
Input Cost ($/1M tokens)
$0.07
Output Cost ($/1M tokens)
$0.14
Qwen 2.5 72B Instruct
Input Cost ($/1M tokens)
$0.12
Output Cost ($/1M tokens)
$0.39
Qwen/Qwen3-Max
Input Cost ($/1M tokens)
$1.2
Output Cost ($/1M tokens)
$6
Qwen/Qwen3-Max-Thinking
Input Cost ($/1M tokens)
$1.2
Output Cost ($/1M tokens)
$6
Qwen2.5 VL 32B Instruct
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.6
Qwen3 235B A22B Thinking 2507
Input Cost ($/1M tokens)
$0.23
Output Cost ($/1M tokens)
$2.3
Qwen3 30B A3B
Input Cost ($/1M tokens)
$0.08
Output Cost ($/1M tokens)
$0.29
Qwen3 Next 80B A3B Instruct
Input Cost ($/1M tokens)
$0.09
Output Cost ($/1M tokens)
$1.1
Qwen3 VL 235B A22B Instruct
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.88
Qwen3 VL 30B A3B Instruct
Input Cost ($/1M tokens)
$0.15
Output Cost ($/1M tokens)
$0.6
qwen3-14b-instruct
Input Cost ($/1M tokens)
$0.12
Output Cost ($/1M tokens)
$0.24
qwen3-235b-a22b-instruct-2507
Input Cost ($/1M tokens)
$0.071
Output Cost ($/1M tokens)
$0.1
qwen3-30b-a3b-instruct
Input Cost ($/1M tokens)
$0.08
Output Cost ($/1M tokens)
$0.28
qwen3-32b-instruct
Input Cost ($/1M tokens)
$0.08
Output Cost ($/1M tokens)
$0.28
qwen3-coder-480b-a35b-instruct
Input Cost ($/1M tokens)
$0.4
Output Cost ($/1M tokens)
$1.6
qwen3-coder-480b-a35b-instruct-turbo
Input Cost ($/1M tokens)
$0.22
Output Cost ($/1M tokens)
$1
Sao10K/L3-8B-Lunaris-v1-Turbo
Input Cost ($/1M tokens)
$0.04
Output Cost ($/1M tokens)
$0.05
Sao10K/L3.1-70B-Euryale-v2.2
Input Cost ($/1M tokens)
$0.85
Output Cost ($/1M tokens)
$0.85
Sao10K/L3.1-70B-Euryale-v2.3
Input Cost ($/1M tokens)
$0.85
Output Cost ($/1M tokens)
$0.85
zai-org/GLM-4.6
Input Cost ($/1M tokens)
$0.43
Output Cost ($/1M tokens)
$1.74
zai-org/GLM-4.6V
Input Cost ($/1M tokens)
$0.3
Output Cost ($/1M tokens)
$0.9
DeepSeek V3.2 FW
Input Cost ($/1M tokens)
$0.56
Output Cost ($/1M tokens)
$1.68
glm-4-7 FW
Input Cost ($/1M tokens)
$0.6
Output Cost ($/1M tokens)
$2.2
glm-5 FW
Input Cost ($/1M tokens)
$1
Output Cost ($/1M tokens)
$3.2
GPT OSS 120B FW
Input Cost ($/1M tokens)
$0.15
Output Cost ($/1M tokens)
$0.6
GPT OSS 20B
Input Cost ($/1M tokens)
$0.07
Output Cost ($/1M tokens)
$0.3
Kimi K2 Instruct
Input Cost ($/1M tokens)
$0.6
Output Cost ($/1M tokens)
$2.5
Kimi K2 Thinking
Input Cost ($/1M tokens)
$0.6
Output Cost ($/1M tokens)
$2.5
Kimi-K2-5 FW
Input Cost ($/1M tokens)
$0.6
Output Cost ($/1M tokens)
$3
Meta Llama 3.1 405B
Input Cost ($/1M tokens)
$3
Output Cost ($/1M tokens)
$3
Meta Llama 3.1 8B Instruct
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.2
Meta Llama 3.2 3B Instruct
Input Cost ($/1M tokens)
$0.1
Output Cost ($/1M tokens)
$0.1
Meta Llama 3.3 70B Instruct
Input Cost ($/1M tokens)
$0.9
Output Cost ($/1M tokens)
$0.9
MiniMax-M2-1-FW
Input Cost ($/1M tokens)
$0.3
Output Cost ($/1M tokens)
$1.2
Mixtral 8x22b instruct
Input Cost ($/1M tokens)
$1.2
Output Cost ($/1M tokens)
$1.2
Gemini 2.0 Flash
Input Cost ($/1M tokens)
$0.15
Output Cost ($/1M tokens)
$0.6
Gemini 2.0 Flash Lite
Input Cost ($/1M tokens)
$0.075
Output Cost ($/1M tokens)
$0.3
Gemini 2.5 Flash
Input Cost ($/1M tokens)
$0.3
Output Cost ($/1M tokens)
$2.5
Gemini 2.5 Flash Lite
Input Cost ($/1M tokens)
$0.1
Output Cost ($/1M tokens)
$0.4
Gemini 2.5 Pro
Input Cost ($/1M tokens)
$1.25
Output Cost ($/1M tokens)
$10
Gemini 3 Flash
Input Cost ($/1M tokens)
$0.5
Output Cost ($/1M tokens)
$3
Gemini 3 Pro
Input Cost ($/1M tokens)
$2
Output Cost ($/1M tokens)
$12
Gemini 3.1 Pro
Input Cost ($/1M tokens)
$2
Output Cost ($/1M tokens)
$12
Gemini Pro Latest
Input Cost ($/1M tokens)
$1.25
Output Cost ($/1M tokens)
$10
DeepSeek R1 Distill Llama 70B 128k
Input Cost ($/1M tokens)
$0.75
Output Cost ($/1M tokens)
$0.99
Gemma 2 9B 8k
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.2
GPT OSS 120B Groq
Input Cost ($/1M tokens)
$0.15
Output Cost ($/1M tokens)
$0.6
GPT OSS 20B Groq
Input Cost ($/1M tokens)
$0.075
Output Cost ($/1M tokens)
$0.3
GPT OSS 20B Safeguard
Input Cost ($/1M tokens)
$0.075
Output Cost ($/1M tokens)
$0.3
Kimi K2 Groq
Input Cost ($/1M tokens)
$1
Output Cost ($/1M tokens)
$3
Llama 3.1 Instruct 8B Groq
Input Cost ($/1M tokens)
$0.05
Output Cost ($/1M tokens)
$0.08
Llama 3.3 Instruct 70B
Input Cost ($/1M tokens)
$0.59
Output Cost ($/1M tokens)
$0.79
Llama 4 Maverick Groq
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.6
Llama 4 Scout
Input Cost ($/1M tokens)
$0.11
Output Cost ($/1M tokens)
$0.34
Llama Guard 4 12B Groq
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.2
Qwen3 32B Instruct Groq
Input Cost ($/1M tokens)
$0.29
Output Cost ($/1M tokens)
$0.59
gemma3 12B
Contact for pricing
Contact for pricing
gemma3 27B
Contact for pricing
Contact for pricing
gpt-oss 20B
Contact for pricing
Contact for pricing
llama3.1 8B
Contact for pricing
Contact for pricing
Codestral
Input Cost ($/1M tokens)
$0.3
Output Cost ($/1M tokens)
$0.9
Codestral Latest
Input Cost ($/1M tokens)
$0.3
Output Cost ($/1M tokens)
$0.9
Devstral 2
Input Cost ($/1M tokens)
$0.4
Output Cost ($/1M tokens)
$2
Devstral Medium Latest
Input Cost ($/1M tokens)
$0.4
Output Cost ($/1M tokens)
$2
Devstral Small
Input Cost ($/1M tokens)
$0.1
Output Cost ($/1M tokens)
$0.3
Devstral Small 2
Input Cost ($/1M tokens)
$0.1
Output Cost ($/1M tokens)
$0.3
labs-mistral-small-creative
Input Cost ($/1M tokens)
$0.1
Output Cost ($/1M tokens)
$0.3
Magistral Medium Latest
Input Cost ($/1M tokens)
$2
Output Cost ($/1M tokens)
$5
Magistral Small Latest
Input Cost ($/1M tokens)
$0.5
Output Cost ($/1M tokens)
$1.5
Ministral-3-14B
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.2
Ministral-3-3B
Input Cost ($/1M tokens)
$0.1
Output Cost ($/1M tokens)
$0.1
Ministral-3-8b
Input Cost ($/1M tokens)
$0.15
Output Cost ($/1M tokens)
$0.15
Mistral Small 3.2
Input Cost ($/1M tokens)
$0.1
Output Cost ($/1M tokens)
$0.3
Mistral-Large-2
Input Cost ($/1M tokens)
$2
Output Cost ($/1M tokens)
$6
Mistral-Large-3
Input Cost ($/1M tokens)
$0.5
Output Cost ($/1M tokens)
$1.5
Mistral-Large-Latest
Input Cost ($/1M tokens)
$0.5
Output Cost ($/1M tokens)
$1.5
Mistral-Medium-Latest
Input Cost ($/1M tokens)
$0.4
Output Cost ($/1M tokens)
$2
Mistral-Tiny-Latest
Input Cost ($/1M tokens)
$0.25
Output Cost ($/1M tokens)
$0.25
Open Mistral Nemo
Input Cost ($/1M tokens)
$0.15
Output Cost ($/1M tokens)
$0.15
Pixtral Large
Input Cost ($/1M tokens)
$2
Output Cost ($/1M tokens)
$6
GPT 3.5 Turbo
Input Cost ($/1M tokens)
$0.5
Output Cost ($/1M tokens)
$1.5
GPT 3.5 Turbo 16k
Input Cost ($/1M tokens)
$3
Output Cost ($/1M tokens)
$4
GPT 4
Input Cost ($/1M tokens)
$30
Output Cost ($/1M tokens)
$60
GPT 4 Turbo
Input Cost ($/1M tokens)
$10
Output Cost ($/1M tokens)
$30
GPT 4.1
Input Cost ($/1M tokens)
$2
Output Cost ($/1M tokens)
$8
GPT 4.1 Mini
Input Cost ($/1M tokens)
$0.4
Output Cost ($/1M tokens)
$1.6
GPT 4.1 Nano
Input Cost ($/1M tokens)
$0.1
Output Cost ($/1M tokens)
$0.4
GPT 4o
Input Cost ($/1M tokens)
$2.5
Output Cost ($/1M tokens)
$10
GPT 4o mini
Input Cost ($/1M tokens)
$0.15
Output Cost ($/1M tokens)
$0.6
GPT 4o Search Preview
Input Cost ($/1M tokens)
$2.5
Output Cost ($/1M tokens)
$10
GPT 5
Input Cost ($/1M tokens)
$1.25
Output Cost ($/1M tokens)
$10
GPT 5 Chat Latest
Input Cost ($/1M tokens)
$1.25
Output Cost ($/1M tokens)
$10
GPT 5 Mini
Input Cost ($/1M tokens)
$0.25
Output Cost ($/1M tokens)
$2
GPT 5 Nano
Input Cost ($/1M tokens)
$0.05
Output Cost ($/1M tokens)
$0.4
GPT 5.1
Input Cost ($/1M tokens)
$1.25
Output Cost ($/1M tokens)
$10
GPT 5.2
Input Cost ($/1M tokens)
$1.75
Output Cost ($/1M tokens)
$14
gpt-3.5-turbo-1106
Input Cost ($/1M tokens)
$1
Output Cost ($/1M tokens)
$2
gpt-4-0125-preview
Input Cost ($/1M tokens)
$10
Output Cost ($/1M tokens)
$30
gpt-4-0613
Input Cost ($/1M tokens)
$30
Output Cost ($/1M tokens)
$60
gpt-4-1106-preview
Input Cost ($/1M tokens)
$10
Output Cost ($/1M tokens)
$30
gpt-4o-2024-05-13
Input Cost ($/1M tokens)
$5
Output Cost ($/1M tokens)
$15
gpt-4o-mini-search-preview
Input Cost ($/1M tokens)
$0.15
Output Cost ($/1M tokens)
$0.6
gpt-5-search-api
Input Cost ($/1M tokens)
$1.25
Output Cost ($/1M tokens)
$10
gpt-5.1-chat-latest
Input Cost ($/1M tokens)
$1.25
Output Cost ($/1M tokens)
$10
gpt-5.2-chat-latest
Input Cost ($/1M tokens)
$1.75
Output Cost ($/1M tokens)
$14
gpt-realtime-mini-2025-10-06
Input Cost ($/1M tokens)
$0.6
Output Cost ($/1M tokens)
$2.4
o3
Input Cost ($/1M tokens)
$2
Output Cost ($/1M tokens)
$8
o3-mini
Input Cost ($/1M tokens)
$1.1
Output Cost ($/1M tokens)
$4.4
o4-mini
Input Cost ($/1M tokens)
$1.1
Output Cost ($/1M tokens)
$4.4
grok-2-vision
Input Cost ($/1M tokens)
$2
Output Cost ($/1M tokens)
$10
grok-2-vision-latest
Input Cost ($/1M tokens)
$2
Output Cost ($/1M tokens)
$10
grok-3
Input Cost ($/1M tokens)
$3
Output Cost ($/1M tokens)
$15
grok-3-fast
Input Cost ($/1M tokens)
$3
Output Cost ($/1M tokens)
$15
grok-3-fast-latest
Input Cost ($/1M tokens)
$3
Output Cost ($/1M tokens)
$15
grok-3-latest
Input Cost ($/1M tokens)
$3
Output Cost ($/1M tokens)
$15
grok-3-mini
Input Cost ($/1M tokens)
$0.3
Output Cost ($/1M tokens)
$0.5
grok-3-mini-fast
Input Cost ($/1M tokens)
$0.3
Output Cost ($/1M tokens)
$0.5
grok-3-mini-fast-latest
Input Cost ($/1M tokens)
$0.3
Output Cost ($/1M tokens)
$0.5
grok-3-mini-latest
Input Cost ($/1M tokens)
$0.3
Output Cost ($/1M tokens)
$0.5
grok-4
Input Cost ($/1M tokens)
$3
Output Cost ($/1M tokens)
$15
grok-4-1-fast
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.5
grok-4-1-fast-non-reasoning
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.5
grok-4-1-fast-non-reasoning-latest
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.5
grok-4-1-fast-reasoning
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.5
grok-4-1-fast-reasoning-latest
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.5
grok-4-fast
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.5
grok-4-fast-non-reasoning
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.5
grok-4-fast-non-reasoning-latest
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.5
grok-4-fast-reasoning
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.5
grok-4-fast-reasoning-latest
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$0.5
grok-4-latest
Input Cost ($/1M tokens)
$3
Output Cost ($/1M tokens)
$15
grok-code-fast
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$1.5
grok-code-fast-1
Input Cost ($/1M tokens)
$0.2
Output Cost ($/1M tokens)
$1.5