Get started
NVIDIA
NVIDIA

Llama 3.1 Nemotron Instruct 70b

Llama 3.1 Nemotron Instruct 70b is a large language model by NVIDIA with support for function calling. It supports a 131K token context window with up to 131K tokens of output. Priced at $1.2 per million input tokens and $1.2 per million output tokens. Access Llama 3.1 Nemotron Instruct 70b through Inworld Router with OpenAI SDK compatibility, built-in failover, and intelligent routing across providers.
Function Calling

Step 1 — Create Router

curl --location 'https://api.inworld.ai/router/v1/routers' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "name": "compare-frontier-models", "defaultRoute": { "route_id": "default", "variants": [ { "variant": { "variant_id": "deepinfra/llama-3-1-nemotron-instruct-70b-a", "model_id": "deepinfra/llama-3-1-nemotron-instruct-70b" }, "weight": 33 }, { "variant": { "variant_id": "deepinfra/nvidia-nemotron-3-nano-30b-a3b-b", "model_id": "deepinfra/nvidia-nemotron-3-nano-30b-a3b" }, "weight": 33 }, { "variant": { "variant_id": "deepinfra/nvidia-nemotron-nano-12b-v2-vl-c", "model_id": "deepinfra/nvidia-nemotron-nano-12b-v2-vl" }, "weight": 34 } ] } }'

Step 2 — Chat Completion

curl --location 'https://api.inworld.ai/v1/chat/completions' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "model": "inworld/compare-frontier-models", "messages": [{"role": "user", "content": "Hello!"}] }'

Llama 3.1 Nemotron Instruct 70b pricing and providers

Access Llama 3.1 Nemotron Instruct 70b through DeepInfra via Inworld Router or Realtime API.
CapabilitiesInput modalitiesOutput modalities
DeepInfraDeepInfra131.1K131.1K$1.20$1.20

Other NVIDIA models available through Inworld

Compare Llama 3.1 Nemotron Instruct 70b with other NVIDIA models available through Inworld Router or Realtime API.
CapabilitiesInput modalitiesOutput modalitiesInference provider
NVIDIANvidia Nemotron 3 Nano 30b A3b$0.05$0.20
NVIDIANvidia Nemotron Nano 12b V2 Vl$0.20$0.60
NVIDIANvidia Nemotron Nano 9b V2131.1K131.1K$0.04$0.16
NVIDIANvidia/Llama 3.3 Nemotron Super 49B V1.5131.1K131.1K$0.10$0.40
NVIDIANvidia/NVIDIA Nemotron 3 Super 120B A12B$0.10$0.50

Start building with Llama 3.1 Nemotron Instruct 70b

Access Llama 3.1 Nemotron Instruct 70b and every other model through Inworld Router or Realtime API. Create your free account and start routing in minutes.
Copyright © 2021-2026 Inworld AI