Get started
Cerebras
Cerebras

Llama3.1 8b

Llama3.1 8b is a large language model by Cerebras. Priced at $0.1 per million input tokens and $0.1 per million output tokens, it is one of the most cost-efficient options in its class. Access Llama3.1 8b through Inworld Router with OpenAI SDK compatibility, built-in failover, and intelligent routing across providers.

Step 1 — Create Router

curl --location 'https://api.inworld.ai/router/v1/routers' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "name": "compare-frontier-models", "defaultRoute": { "route_id": "default", "variants": [ { "variant": { "variant_id": "cerebras/llama3.1-8b-a", "model_id": "cerebras/llama3.1-8b" }, "weight": 33 }, { "variant": { "variant_id": "cerebras/llama3.1-8b-b", "model_id": "cerebras/llama3.1-8b" }, "weight": 33 }, { "variant": { "variant_id": "cerebras/llama3.1-8b-c", "model_id": "cerebras/llama3.1-8b" }, "weight": 34 } ] } }'

Step 2 — Chat Completion

curl --location 'https://api.inworld.ai/v1/chat/completions' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "model": "inworld/compare-frontier-models", "messages": [{"role": "user", "content": "Hello!"}] }'

Llama3.1 8b pricing and providers

Access Llama3.1 8b through Cerebras via Inworld Router or Realtime API. By using the AI provider(s) below you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.
CapabilitiesInput modalitiesOutput modalities
CerebrasCerebras$0.10$0.10

Start building with Llama3.1 8b

Access Llama3.1 8b and every other model through Inworld Router or Realtime API. Create your free account and start routing in minutes.
Copyright © 2021-2026 Inworld AI
Llama3.1 8b by Cerebras — Pricing, Specs & API Access | Inworld