NVIDIA

Nvidia/Llama 3.3 Nemotron Super 49B V1.5

Name: Nvidia/Llama 3.3 Nemotron Super 49B V1.5
Brand: NVIDIA
Price: 0.1 USD
Availability: InStock

Nvidia/Llama 3.3 Nemotron Super 49B V1.5 is a large language model by NVIDIA with support for function calling. It supports a 131K token context window with up to 131K tokens of output. Priced at $0.1 per million input tokens and $0.4 per million output tokens, it is one of the most cost-efficient options in its class. Access Nvidia/Llama 3.3 Nemotron Super 49B V1.5 through Inworld Router with OpenAI SDK compatibility, built-in failover, and intelligent routing across providers.

→Function Calling

Use This Model

Step 1 — Create Router

curl --location 'https://api.inworld.ai/router/v1/routers' \
--header 'Authorization: Basic <your-api-key>' \
--header 'Content-Type: application/json' \
--data '{
  "name": "compare-frontier-models",
  "defaultRoute": {
    "route_id": "default",
    "variants": [
      {
        "variant": {
          "variant_id": "deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5-a",
          "model_id": "deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5"
        },
        "weight": 33
      },
      {
        "variant": {
          "variant_id": "deepinfra/nvidia-nemotron-3-nano-30b-a3b-b",
          "model_id": "deepinfra/nvidia-nemotron-3-nano-30b-a3b"
        },
        "weight": 33
      },
      {
        "variant": {
          "variant_id": "deepinfra/nvidia-nemotron-nano-9b-v2-c",
          "model_id": "deepinfra/nvidia-nemotron-nano-9b-v2"
        },
        "weight": 34
      }
    ]
  }
}'

import requests

response = requests.post(
    "https://api.inworld.ai/router/v1/routers",
    headers={
        "Authorization": "Basic <your-api-key>",
        "Content-Type": "application/json",
    },
    json={
        "name": "compare-frontier-models",
        "defaultRoute": {
            "route_id": "default",
            "variants": [
                {
                    "variant": {
                        "variant_id": "deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5-a",
                        "model_id": "deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5"
                    },
                    "weight": 33
                },
                {
                    "variant": {
                        "variant_id": "deepinfra/nvidia-nemotron-3-nano-30b-a3b-b",
                        "model_id": "deepinfra/nvidia-nemotron-3-nano-30b-a3b"
                    },
                    "weight": 33
                },
                {
                    "variant": {
                        "variant_id": "deepinfra/nvidia-nemotron-nano-9b-v2-c",
                        "model_id": "deepinfra/nvidia-nemotron-nano-9b-v2"
                    },
                    "weight": 34
                }
            ]
        }
    },
)
print(response.json())

const response = await fetch(
  "https://api.inworld.ai/router/v1/routers",
  {
    method: "POST",
    headers: {
      Authorization: "Basic <your-api-key>",
      "Content-Type": "application/json",
    },
    body: JSON.stringify({
      name: "compare-frontier-models",
      defaultRoute: {
        route_id: "default",
        variants: [
          {
            variant: {
              variant_id: "deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5-a",
              model_id: "deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5",
            },
            weight: 33,
          },
          {
            variant: {
              variant_id: "deepinfra/nvidia-nemotron-3-nano-30b-a3b-b",
              model_id: "deepinfra/nvidia-nemotron-3-nano-30b-a3b",
            },
            weight: 33,
          },
          {
            variant: {
              variant_id: "deepinfra/nvidia-nemotron-nano-9b-v2-c",
              model_id: "deepinfra/nvidia-nemotron-nano-9b-v2",
            },
            weight: 34,
          },
        ],
      },
    }),
  }
);
const data = await response.json();
console.log(data);

Step 2 — Chat Completion

curl --location 'https://api.inworld.ai/v1/chat/completions' \
--header 'Authorization: Basic <your-api-key>' \
--header 'Content-Type: application/json' \
--data '{
  "model": "inworld/compare-frontier-models",
  "messages": [{"role": "user", "content": "Hello!"}]
}'

Nvidia/Llama 3.3 Nemotron Super 49B V1.5 pricing and providers

Access Nvidia/Llama 3.3 Nemotron Super 49B V1.5 through DeepInfra via Inworld Router or Realtime API. By using the AI provider(s) below you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.

					Capabilities	Input modalities	Output modalities
DeepInfra	131.1K	131.1K	$0.10	$0.40

Other NVIDIA models available through Inworld

Compare Nvidia/Llama 3.3 Nemotron Super 49B V1.5 with other NVIDIA models available through Inworld Router or Realtime API.

					Capabilities
Nvidia Nemotron 3 Nano 30b A3b	—	—	$0.05	$0.20	—
Nvidia Nemotron Nano 9b V2	131.1K	131.1K	$0.04	$0.16
Nvidia/Nemotron 3 Nano Omni 30B A3B Reasoning	—	—	$0.20	$0.80	—
Nvidia/NVIDIA Nemotron 3 Super 120B A12B	—	—	$0.10	$0.50	—

Start building with Nvidia/Llama 3.3 Nemotron Super 49B V1.5

Access Nvidia/Llama 3.3 Nemotron Super 49B V1.5 and every other model through Inworld Router or Realtime API. Create your free account and start routing in minutes.

Get Started Free Contact Sales

Products

Developers