Get started
NVIDIA
NVIDIA

Nvidia/Llama 3.3 Nemotron Super 49B V1.5

Nvidia/Llama 3.3 Nemotron Super 49B V1.5 is a large language model by NVIDIA with support for function calling. It supports a 131K token context window with up to 131K tokens of output. Priced at $0.1 per million input tokens and $0.4 per million output tokens, it is one of the most cost-efficient options in its class. Access Nvidia/Llama 3.3 Nemotron Super 49B V1.5 through Inworld Router with OpenAI SDK compatibility, built-in failover, and intelligent routing across providers.
Function Calling

Step 1 — Create Router

curl --location 'https://api.inworld.ai/router/v1/routers' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "name": "compare-frontier-models", "defaultRoute": { "route_id": "default", "variants": [ { "variant": { "variant_id": "deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5-a", "model_id": "deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5" }, "weight": 33 }, { "variant": { "variant_id": "deepinfra/llama-3-1-nemotron-instruct-70b-b", "model_id": "deepinfra/llama-3-1-nemotron-instruct-70b" }, "weight": 33 }, { "variant": { "variant_id": "deepinfra/nvidia-nemotron-3-nano-30b-a3b-c", "model_id": "deepinfra/nvidia-nemotron-3-nano-30b-a3b" }, "weight": 34 } ] } }'

Step 2 — Chat Completion

curl --location 'https://api.inworld.ai/v1/chat/completions' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "model": "inworld/compare-frontier-models", "messages": [{"role": "user", "content": "Hello!"}] }'

Nvidia/Llama 3.3 Nemotron Super 49B V1.5 pricing and providers

Access Nvidia/Llama 3.3 Nemotron Super 49B V1.5 through DeepInfra via Inworld Router or Realtime API. By using the AI provider(s) below you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.
CapabilitiesInput modalitiesOutput modalities
DeepInfraDeepInfra131.1K131.1K$0.10$0.40

Other NVIDIA models available through Inworld

Compare Nvidia/Llama 3.3 Nemotron Super 49B V1.5 with other NVIDIA models available through Inworld Router or Realtime API.
CapabilitiesInput modalitiesOutput modalitiesInference provider
NVIDIALlama 3.1 Nemotron Instruct 70b131.1K131.1K$1.20$1.20
NVIDIANvidia Nemotron 3 Nano 30b A3b$0.05$0.20
NVIDIANvidia Nemotron Nano 12b V2 Vl$0.20$0.60
NVIDIANvidia Nemotron Nano 9b V2131.1K131.1K$0.04$0.16
NVIDIANvidia/NVIDIA Nemotron 3 Super 120B A12B$0.10$0.50

Start building with Nvidia/Llama 3.3 Nemotron Super 49B V1.5

Access Nvidia/Llama 3.3 Nemotron Super 49B V1.5 and every other model through Inworld Router or Realtime API. Create your free account and start routing in minutes.
Copyright © 2021-2026 Inworld AI
Nvidia/Llama 3.3 Nemotron Super 49B V1.5 by NVIDIA — Pricing, Specs & API Access | Inworld