NVIDIA

Llama 3.1 Nemotron Instruct 70b

Name: Llama 3.1 Nemotron Instruct 70b
Brand: NVIDIA
Price: 1.2 USD
Availability: InStock

Llama 3.1 Nemotron Instruct 70b is a large language model by NVIDIA with support for function calling. It supports a 131K token context window with up to 131K tokens of output. Priced at $1.2 per million input tokens and $1.2 per million output tokens. Access Llama 3.1 Nemotron Instruct 70b through Inworld Router with OpenAI SDK compatibility, built-in failover, and intelligent routing across providers.

→Function Calling

Use This Model

Step 1 — Create Router

curl --location 'https://api.inworld.ai/router/v1/routers' \
--header 'Authorization: Basic <your-api-key>' \
--header 'Content-Type: application/json' \
--data '{
  "name": "compare-frontier-models",
  "defaultRoute": {
    "route_id": "default",
    "variants": [
      {
        "variant": {
          "variant_id": "deepinfra/llama-3-1-nemotron-instruct-70b-a",
          "model_id": "deepinfra/llama-3-1-nemotron-instruct-70b"
        },
        "weight": 33
      },
      {
        "variant": {
          "variant_id": "deepinfra/nvidia-nemotron-3-nano-30b-a3b-b",
          "model_id": "deepinfra/nvidia-nemotron-3-nano-30b-a3b"
        },
        "weight": 33
      },
      {
        "variant": {
          "variant_id": "deepinfra/nvidia-nemotron-nano-12b-v2-vl-c",
          "model_id": "deepinfra/nvidia-nemotron-nano-12b-v2-vl"
        },
        "weight": 34
      }
    ]
  }
}'

import requests

response = requests.post(
    "https://api.inworld.ai/router/v1/routers",
    headers={
        "Authorization": "Basic <your-api-key>",
        "Content-Type": "application/json",
    },
    json={
        "name": "compare-frontier-models",
        "defaultRoute": {
            "route_id": "default",
            "variants": [
                {
                    "variant": {
                        "variant_id": "deepinfra/llama-3-1-nemotron-instruct-70b-a",
                        "model_id": "deepinfra/llama-3-1-nemotron-instruct-70b"
                    },
                    "weight": 33
                },
                {
                    "variant": {
                        "variant_id": "deepinfra/nvidia-nemotron-3-nano-30b-a3b-b",
                        "model_id": "deepinfra/nvidia-nemotron-3-nano-30b-a3b"
                    },
                    "weight": 33
                },
                {
                    "variant": {
                        "variant_id": "deepinfra/nvidia-nemotron-nano-12b-v2-vl-c",
                        "model_id": "deepinfra/nvidia-nemotron-nano-12b-v2-vl"
                    },
                    "weight": 34
                }
            ]
        }
    },
)
print(response.json())

const response = await fetch(
  "https://api.inworld.ai/router/v1/routers",
  {
    method: "POST",
    headers: {
      Authorization: "Basic <your-api-key>",
      "Content-Type": "application/json",
    },
    body: JSON.stringify({
      name: "compare-frontier-models",
      defaultRoute: {
        route_id: "default",
        variants: [
          {
            variant: {
              variant_id: "deepinfra/llama-3-1-nemotron-instruct-70b-a",
              model_id: "deepinfra/llama-3-1-nemotron-instruct-70b",
            },
            weight: 33,
          },
          {
            variant: {
              variant_id: "deepinfra/nvidia-nemotron-3-nano-30b-a3b-b",
              model_id: "deepinfra/nvidia-nemotron-3-nano-30b-a3b",
            },
            weight: 33,
          },
          {
            variant: {
              variant_id: "deepinfra/nvidia-nemotron-nano-12b-v2-vl-c",
              model_id: "deepinfra/nvidia-nemotron-nano-12b-v2-vl",
            },
            weight: 34,
          },
        ],
      },
    }),
  }
);
const data = await response.json();
console.log(data);

Step 2 — Chat Completion

curl --location 'https://api.inworld.ai/v1/chat/completions' \
--header 'Authorization: Basic <your-api-key>' \
--header 'Content-Type: application/json' \
--data '{
  "model": "inworld/compare-frontier-models",
  "messages": [{"role": "user", "content": "Hello!"}]
}'

Llama 3.1 Nemotron Instruct 70b pricing and providers

Access Llama 3.1 Nemotron Instruct 70b through DeepInfra via Inworld Router or Realtime API. By using the AI provider(s) below you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.

					Capabilities	Input modalities	Output modalities
DeepInfra	131.1K	131.1K	$1.20	$1.20

Other NVIDIA models available through Inworld

Compare Llama 3.1 Nemotron Instruct 70b with other NVIDIA models available through Inworld Router or Realtime API.

					Capabilities
Nvidia Nemotron 3 Nano 30b A3b	—	—	$0.05	$0.20	—
Nvidia Nemotron Nano 12b V2 Vl	—	—	$0.20	$0.60	—
Nvidia Nemotron Nano 9b V2	131.1K	131.1K	$0.04	$0.16
Nvidia/Llama 3.3 Nemotron Super 49B V1.5	131.1K	131.1K	$0.10	$0.40
Nvidia/NVIDIA Nemotron 3 Super 120B A12B	—	—	$0.10	$0.50	—

Start building with Llama 3.1 Nemotron Instruct 70b

Access Llama 3.1 Nemotron Instruct 70b and every other model through Inworld Router or Realtime API. Create your free account and start routing in minutes.

Get Started Free Contact Sales

Products

Developers