Get started
Google
Google

Gemini 2.5 Flash Lite

Gemini 2.5 Flash Lite is a reasoning model by Google with vision, function calling, web search, prompt caching, structured output — designed for complex, multi-step problem solving where accuracy matters more than speed. It supports a 1M token context window with up to 66K tokens of output. Priced at $0.1 per million input tokens and $0.4 per million output tokens, it is one of the most cost-efficient options in its class. Access Gemini 2.5 Flash Lite through Inworld Router with OpenAI SDK compatibility, built-in failover, and intelligent routing across providers.
Function CallingWeb SearchReasoningPrompt CachingResponse SchemaVision

Step 1 — Create Router

curl --location 'https://api.inworld.ai/router/v1/routers' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "name": "compare-frontier-models", "defaultRoute": { "route_id": "default", "variants": [ { "variant": { "variant_id": "google-vertex/gemini-2.5-flash-lite-a", "model_id": "google-vertex/gemini-2.5-flash-lite" }, "weight": 33 }, { "variant": { "variant_id": "google-ai-studio/gemini-2.5-flash-lite-b", "model_id": "google-ai-studio/gemini-2.5-flash-lite" }, "weight": 33 }, { "variant": { "variant_id": "google-ai-studio/gemini-2.0-flash-c", "model_id": "google-ai-studio/gemini-2.0-flash" }, "weight": 34 } ] } }'

Step 2 — Chat Completion

curl --location 'https://api.inworld.ai/v1/chat/completions' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "model": "inworld/compare-frontier-models", "messages": [{"role": "user", "content": "Hello!"}] }'

Gemini 2.5 Flash Lite pricing and providers

Access Gemini 2.5 Flash Lite through Inworld Router or Realtime API via 2 inference providers. Compare pricing and specs across providers below.
CapabilitiesInput modalitiesOutput modalities
Google VertexGoogle Vertex1M65.5K$0.10$0.40+3
Google AI StudioGoogle AI Studio1M65.5K$0.10$0.40+3

Other Google models available through Inworld

Compare Gemini 2.5 Flash Lite with other Google models available through Inworld Router or Realtime API.
CapabilitiesInput modalitiesOutput modalitiesInference provider
GoogleGemini 2.0 Flash1M8.2K$0.10$0.40+2
GoogleGemini 2.0 Flash 0011M8.2K$0.15$0.60+2
GoogleGemini 2.0 Flash Lite1M8.2K$0.075$0.30+2
GoogleGemini 2.0 Flash Lite 0011M8.2K$0.075$0.30+2
GoogleGemini 2.5 Flash1M65.5K$0.30$2.50+3
GoogleGemini 2.5 Flash Image32.8K32.8K$0.30$2.50+1
GoogleGemini 2.5 Flash Lite Preview 09 20251M65.5K$0.10$0.40+3
GoogleGemini 2.5 PRO1M65.5K$1.25$10.00+3
GoogleGemini 3 Flash Preview1M65.5K$0.50$3.00+3
GoogleGemini 3 PRO Image Preview65.5K32.8K$2.00$12.00+1
GoogleGemini 3.1 Flash Image Preview65.5K32.8K$0.50$3.00+1
GoogleGemini 3.1 Flash Lite Preview1M65.5K$0.25$1.50+3
GoogleGemini 3.1 PRO Preview1M65.5K$2.00$12.00+3
GoogleGemini 3.1 PRO Preview Customtools1M65.5K$2.00$12.00+3
GoogleGemini Flash Latest1M65.5K$0.30$2.50+3
GoogleGemini Flash Lite Latest1M65.5K$0.10$0.40+3
GoogleGemini PRO Latest1M65.5K$1.25$10.00+3
GoogleGemma 3 12b131.1K131.1K$0.04$0.13
GoogleGemma 3 27b131.1K131.1K$0.08$0.16
GoogleGemma 3 4b131.1K131.1K$0.04$0.08

Start building with Gemini 2.5 Flash Lite

Access Gemini 2.5 Flash Lite and every other model through Inworld Router or Realtime API. Create your free account and start routing in minutes.
Copyright © 2021-2026 Inworld AI