Get started
Meta
Meta

Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8

Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8 is a large language model by Meta with support for function calling. It supports a 1M token context window with up to 1M tokens of output. Priced at $0.15 per million input tokens and $0.6 per million output tokens, it is one of the most cost-efficient options in its class. Access Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8 through Inworld Router with OpenAI SDK compatibility, built-in failover, and intelligent routing across providers.
Function Calling

Step 1 — Create Router

curl --location 'https://api.inworld.ai/router/v1/routers' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "name": "compare-frontier-models", "defaultRoute": { "route_id": "default", "variants": [ { "variant": { "variant_id": "deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8-a", "model_id": "deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8" }, "weight": 33 }, { "variant": { "variant_id": "groq/llama-3.1-8b-instant-b", "model_id": "groq/llama-3.1-8b-instant" }, "weight": 33 }, { "variant": { "variant_id": "groq/llama-3.3-70b-versatile-c", "model_id": "groq/llama-3.3-70b-versatile" }, "weight": 34 } ] } }'

Step 2 — Chat Completion

curl --location 'https://api.inworld.ai/v1/chat/completions' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "model": "inworld/compare-frontier-models", "messages": [{"role": "user", "content": "Hello!"}] }'

Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8 pricing and providers

Access Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8 through DeepInfra via Inworld Router or Realtime API. By using the AI provider(s) below you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.
CapabilitiesInput modalitiesOutput modalities
DeepInfraDeepInfra1M1M$0.15$0.60

Other Meta models available through Inworld

Compare Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8 with other Meta models available through Inworld Router or Realtime API.
CapabilitiesInput modalitiesOutput modalitiesInference provider
MetaLlama 3.1 8b Instant128K8.2K$0.05$0.08
MetaLlama 3.3 70b Versatile128K32.8K$0.59$0.79
MetaMeta Llama/Llama 3.2 11B Vision Instruct131.1K131.1K$0.245$0.245
MetaMeta Llama/Llama 3.3 70B Instruct Turbo131.1K131.1K$0.10$0.32
MetaMeta Llama/llama 4 Scout 17b 16e Instruct131.1K8.2K$0.11$0.34
MetaMeta Llama/Llama 4 Scout 17B 16E Instruct327.7K327.7K$0.08$0.30
MetaMeta Llama/Llama Guard 4 12B163.8K163.8K$0.18$0.18
MetaMeta Llama/Meta Llama 3.1 8B Instruct Turbo131.1K131.1K$0.02$0.03

Start building with Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8

Access Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8 and every other model through Inworld Router or Realtime API. Create your free account and start routing in minutes.
Copyright © 2021-2026 Inworld AI
Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8 by Meta — Pricing, Specs & API Access | Inworld