Get started
NVIDIA
NVIDIA

Nvidia/Llama 3.3 Nemotron Super 49B V1.5

Nvidia/Llama 3.3 Nemotron Super 49B V1.5 is a large language model by NVIDIA with support for function calling. It supports a 131K token context window with up to 131K tokens of output. Priced at $0.1 per million input tokens and $0.4 per million output tokens, it is one of the most cost-efficient options in its class. Access Nvidia/Llama 3.3 Nemotron Super 49B V1.5 through Inworld Router with OpenAI SDK compatibility, built-in failover, and intelligent routing across providers.
Function Calling

Step 1 — Create Router

curl --location 'https://api.inworld.ai/router/v1/routers' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "name": "compare-frontier-models", "defaultRoute": { "route_id": "default", "variants": [ { "variant": { "variant_id": "deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5-a", "model_id": "deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5" }, "weight": 33 }, { "variant": { "variant_id": "deepinfra/llama-3-1-nemotron-instruct-70b-b", "model_id": "deepinfra/llama-3-1-nemotron-instruct-70b" }, "weight": 33 }, { "variant": { "variant_id": "deepinfra/nvidia-nemotron-3-nano-30b-a3b-c", "model_id": "deepinfra/nvidia-nemotron-3-nano-30b-a3b" }, "weight": 34 } ] } }'

Step 2 — Chat Completion

curl --location 'https://api.inworld.ai/v1/chat/completions' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "model": "inworld/compare-frontier-models", "messages": [{"role": "user", "content": "Hello!"}] }'

Nvidia/Llama 3.3 Nemotron Super 49B V1.5 pricing and providers

Access Nvidia/Llama 3.3 Nemotron Super 49B V1.5 through DeepInfra via Inworld Router or Realtime API.
CapabilitiesInput modalitiesOutput modalities
DeepInfraDeepInfra131.1K131.1K$0.10$0.40

Other NVIDIA models available through Inworld

Compare Nvidia/Llama 3.3 Nemotron Super 49B V1.5 with other NVIDIA models available through Inworld Router or Realtime API.
CapabilitiesInput modalitiesOutput modalitiesInference provider
NVIDIALlama 3.1 Nemotron Instruct 70b131.1K131.1K$1.20$1.20
NVIDIANvidia Nemotron 3 Nano 30b A3b$0.05$0.20
NVIDIANvidia Nemotron Nano 12b V2 Vl$0.20$0.60
NVIDIANvidia Nemotron Nano 9b V2131.1K131.1K$0.04$0.16
NVIDIANvidia/NVIDIA Nemotron 3 Super 120B A12B$0.10$0.50

Start building with Nvidia/Llama 3.3 Nemotron Super 49B V1.5

Access Nvidia/Llama 3.3 Nemotron Super 49B V1.5 and every other model through Inworld Router or Realtime API. Create your free account and start routing in minutes.
Copyright © 2021-2026 Inworld AI