Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8

Name: Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8
Brand: Meta
Price: 0.15 USD
Availability: InStock

Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8 is a large language model by Meta with support for vision and structured output. It supports a 1M token context window with up to 1M tokens of output. Priced at $0.15 per million input tokens and $0.6 per million output tokens, it is one of the most cost-efficient options in its class. Access Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8 through Inworld Router with OpenAI SDK compatibility, built-in failover, and intelligent routing across providers.

→Response SchemaVisionresponseFormatCapability

Use This Model

Step 1 — Create Router

curl --location 'https://api.inworld.ai/router/v1/routers' \
--header 'Authorization: Basic <your-api-key>' \
--header 'Content-Type: application/json' \
--data '{
  "name": "compare-frontier-models",
  "defaultRoute": {
    "route_id": "default",
    "variants": [
      {
        "variant": {
          "variant_id": "deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8-a",
          "model_id": "deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8"
        },
        "weight": 33
      },
      {
        "variant": {
          "variant_id": "groq/llama-3.1-8b-instant-b",
          "model_id": "groq/llama-3.1-8b-instant"
        },
        "weight": 33
      },
      {
        "variant": {
          "variant_id": "groq/llama-3.3-70b-versatile-c",
          "model_id": "groq/llama-3.3-70b-versatile"
        },
        "weight": 34
      }
    ]
  }
}'

import requests

response = requests.post(
    "https://api.inworld.ai/router/v1/routers",
    headers={
        "Authorization": "Basic <your-api-key>",
        "Content-Type": "application/json",
    },
    json={
        "name": "compare-frontier-models",
        "defaultRoute": {
            "route_id": "default",
            "variants": [
                {
                    "variant": {
                        "variant_id": "deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8-a",
                        "model_id": "deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8"
                    },
                    "weight": 33
                },
                {
                    "variant": {
                        "variant_id": "groq/llama-3.1-8b-instant-b",
                        "model_id": "groq/llama-3.1-8b-instant"
                    },
                    "weight": 33
                },
                {
                    "variant": {
                        "variant_id": "groq/llama-3.3-70b-versatile-c",
                        "model_id": "groq/llama-3.3-70b-versatile"
                    },
                    "weight": 34
                }
            ]
        }
    },
)
print(response.json())

const response = await fetch(
  "https://api.inworld.ai/router/v1/routers",
  {
    method: "POST",
    headers: {
      Authorization: "Basic <your-api-key>",
      "Content-Type": "application/json",
    },
    body: JSON.stringify({
      name: "compare-frontier-models",
      defaultRoute: {
        route_id: "default",
        variants: [
          {
            variant: {
              variant_id: "deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8-a",
              model_id: "deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8",
            },
            weight: 33,
          },
          {
            variant: {
              variant_id: "groq/llama-3.1-8b-instant-b",
              model_id: "groq/llama-3.1-8b-instant",
            },
            weight: 33,
          },
          {
            variant: {
              variant_id: "groq/llama-3.3-70b-versatile-c",
              model_id: "groq/llama-3.3-70b-versatile",
            },
            weight: 34,
          },
        ],
      },
    }),
  }
);
const data = await response.json();
console.log(data);

Step 2 — Chat Completion

curl --location 'https://api.inworld.ai/v1/chat/completions' \
--header 'Authorization: Basic <your-api-key>' \
--header 'Content-Type: application/json' \
--data '{
  "model": "inworld/compare-frontier-models",
  "messages": [{"role": "user", "content": "Hello!"}]
}'

Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8 pricing and providers

Access Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8 through DeepInfra via Inworld Router or Realtime API. By using the AI provider(s) below you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.

					Capabilities	Input modalities	Output modalities
DeepInfra	1M	1M	$0.15	$0.60

Other Meta models available through Inworld

Compare Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8 with other Meta models available through Inworld Router or Realtime API.

					Capabilities
Llama 3.1 8b Instant	128K	8.2K	$0.05	$0.08
Llama 3.3 70b Versatile	128K	32.8K	$0.59	$0.79
Meta Llama/Llama 3.2 11B Vision Instruct	131.1K	131.1K	$0.345	$0.345
Meta Llama/Llama 3.3 70B Instruct Turbo	131.1K	131.1K	$0.10	$0.32
Meta Llama/llama 4 Scout 17b 16e Instruct	131.1K	8.2K	$0.11	$0.34	+1
Meta Llama/Llama 4 Scout 17B 16E Instruct	327.7K	327.7K	$0.10	$0.30	+1
Meta Llama/Llama Guard 4 12B	163.8K	163.8K	$0.18	$0.18	—
Meta Llama/Meta Llama 3.1 8B Instruct Turbo	131.1K	131.1K	$0.02	$0.03

Start building with Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8

Access Meta Llama/Llama 4 Maverick 17B 128E Instruct FP8 and every other model through Inworld Router or Realtime API. Create your free account and start routing in minutes.

Get Started Free Contact Sales