Google

Gemini 2.5 Flash Lite

Name: Gemini 2.5 Flash Lite
Brand: Google
Price: 0.1 USD
Availability: InStock

Gemini 2.5 Flash Lite is a reasoning model by Google with vision, function calling, web search, prompt caching, structured output — designed for complex, multi-step problem solving where accuracy matters more than speed. It supports a 1M token context window with up to 66K tokens of output. Priced at $0.1 per million input tokens and $0.4 per million output tokens, it is one of the most cost-efficient options in its class. Access Gemini 2.5 Flash Lite through Inworld Router with OpenAI SDK compatibility, built-in failover, and intelligent routing across providers.

→Function CallingWeb SearchReasoningPrompt CachingResponse SchemaVisionreasoningCapability

Use This Model

Step 1 — Create Router

curl --location 'https://api.inworld.ai/router/v1/routers' \
--header 'Authorization: Basic <your-api-key>' \
--header 'Content-Type: application/json' \
--data '{
  "name": "compare-frontier-models",
  "defaultRoute": {
    "route_id": "default",
    "variants": [
      {
        "variant": {
          "variant_id": "google-vertex/gemini-2.5-flash-lite-a",
          "model_id": "google-vertex/gemini-2.5-flash-lite"
        },
        "weight": 33
      },
      {
        "variant": {
          "variant_id": "google/gemini-2.5-flash-lite-b",
          "model_id": "google/gemini-2.5-flash-lite"
        },
        "weight": 33
      },
      {
        "variant": {
          "variant_id": "google-ai-studio/gemini-2.5-flash-lite-c",
          "model_id": "google-ai-studio/gemini-2.5-flash-lite"
        },
        "weight": 34
      }
    ]
  }
}'

import requests

response = requests.post(
    "https://api.inworld.ai/router/v1/routers",
    headers={
        "Authorization": "Basic <your-api-key>",
        "Content-Type": "application/json",
    },
    json={
        "name": "compare-frontier-models",
        "defaultRoute": {
            "route_id": "default",
            "variants": [
                {
                    "variant": {
                        "variant_id": "google-vertex/gemini-2.5-flash-lite-a",
                        "model_id": "google-vertex/gemini-2.5-flash-lite"
                    },
                    "weight": 33
                },
                {
                    "variant": {
                        "variant_id": "google/gemini-2.5-flash-lite-b",
                        "model_id": "google/gemini-2.5-flash-lite"
                    },
                    "weight": 33
                },
                {
                    "variant": {
                        "variant_id": "google-ai-studio/gemini-2.5-flash-lite-c",
                        "model_id": "google-ai-studio/gemini-2.5-flash-lite"
                    },
                    "weight": 34
                }
            ]
        }
    },
)
print(response.json())

const response = await fetch(
  "https://api.inworld.ai/router/v1/routers",
  {
    method: "POST",
    headers: {
      Authorization: "Basic <your-api-key>",
      "Content-Type": "application/json",
    },
    body: JSON.stringify({
      name: "compare-frontier-models",
      defaultRoute: {
        route_id: "default",
        variants: [
          {
            variant: {
              variant_id: "google-vertex/gemini-2.5-flash-lite-a",
              model_id: "google-vertex/gemini-2.5-flash-lite",
            },
            weight: 33,
          },
          {
            variant: {
              variant_id: "google/gemini-2.5-flash-lite-b",
              model_id: "google/gemini-2.5-flash-lite",
            },
            weight: 33,
          },
          {
            variant: {
              variant_id: "google-ai-studio/gemini-2.5-flash-lite-c",
              model_id: "google-ai-studio/gemini-2.5-flash-lite",
            },
            weight: 34,
          },
        ],
      },
    }),
  }
);
const data = await response.json();
console.log(data);

Step 2 — Chat Completion

curl --location 'https://api.inworld.ai/v1/chat/completions' \
--header 'Authorization: Basic <your-api-key>' \
--header 'Content-Type: application/json' \
--data '{
  "model": "inworld/compare-frontier-models",
  "messages": [{"role": "user", "content": "Hello!"}]
}'

Gemini 2.5 Flash Lite pricing and providers

Access Gemini 2.5 Flash Lite through Inworld Router or Realtime API via 3 inference providers. Compare pricing and specs across providers below. By using the AI provider(s) below you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.

					Capabilities
Google Vertex	1M	65.5K	$0.10	$0.40	+4
Google	—	—	$0.10	$0.40	—
Google AI Studio	1M	65.5K	$0.10	$0.40	+4

Other Google models available through Inworld

Compare Gemini 2.5 Flash Lite with other Google models available through Inworld Router or Realtime API.

					Capabilities
Gemini 2.0 Flash	1M	8.2K	$0.10	$0.40	+2
Gemini 2.0 Flash 001	—	—	$0.15	$0.60	+2
Gemini 2.0 Flash Lite	—	—	$0.075	$0.30	+2
Gemini 2.0 Flash Lite 001	1M	8.2K	$0.075	$0.30	+2
Gemini 2.5 Flash	—	—	$0.30	$2.50	+4
Gemini 2.5 Flash Image	32.8K	32.8K	$0.30	$2.50	+1
Gemini 2.5 Flash Lite Preview 09 2025	1M	65.5K	$0.10	$0.40	+4
Gemini 2.5 PRO	—	—	$1.25	$10.00	+4
Gemini 3 Flash Preview	—	—	$0.50	$3.00	+4
Gemini 3.1 Flash Image Preview	—	—	$0.50	$3.00	—
Gemini 3.1 Flash Lite	1M	65.5K	$0.25	$1.50	+4
Gemini 3.1 PRO Preview	1M	65.5K	$2.00	$12.00	+4
Gemini 3.1 PRO Preview Customtools	1M	65.5K	$2.00	$12.00	+4
Gemini 3.5 Flash	1M	65.5K	$1.50	$9.00	+4
Gemini Flash Latest	1M	65.5K	$0.30	$2.50	+4
Gemini Flash Lite Latest	1M	65.5K	$0.10	$0.40	+4
Gemma 4 26b A4b It Maas	—	—	$0.15	$0.60	—
Google/gemma 3 12b It	131.1K	131.1K	$0.04	$0.13
Google/gemma 3 27b It	131.1K	131.1K	$0.08	$0.16
Google/gemma 3 4b It	131.1K	131.1K	$0.04	$0.08
Google/gemma 4 26B A4B It	—	—	$0.07	$0.34	—
Google/gemma 4 31B It	—	—	$0.13	$0.38	—