Get started
Google
Google

Gemini 2.0 Flash Lite 001

Going away: Jun 1, 2026
Gemini 2.0 Flash Lite 001 is a large language model by Google with support for vision, function calling, web search, prompt caching, and structured output. It supports a 1M token context window with up to 8K tokens of output. Priced at $0.075 per million input tokens and $0.3 per million output tokens, it is one of the most cost-efficient options in its class. Access Gemini 2.0 Flash Lite 001 through Inworld Router with OpenAI SDK compatibility, built-in failover, and intelligent routing across providers.
Function CallingWeb SearchPrompt CachingResponse SchemaVision

Step 1 — Create Router

curl --location 'https://api.inworld.ai/router/v1/routers' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "name": "compare-frontier-models", "defaultRoute": { "route_id": "default", "variants": [ { "variant": { "variant_id": "google-vertex/gemini-2.0-flash-lite-001-a", "model_id": "google-vertex/gemini-2.0-flash-lite-001" }, "weight": 33 }, { "variant": { "variant_id": "google-ai-studio/gemini-2.0-flash-b", "model_id": "google-ai-studio/gemini-2.0-flash" }, "weight": 33 }, { "variant": { "variant_id": "google/gemini-2.0-flash-001-c", "model_id": "google/gemini-2.0-flash-001" }, "weight": 34 } ] } }'

Step 2 — Chat Completion

curl --location 'https://api.inworld.ai/v1/chat/completions' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "model": "inworld/compare-frontier-models", "messages": [{"role": "user", "content": "Hello!"}] }'

Gemini 2.0 Flash Lite 001 pricing and providers

Access Gemini 2.0 Flash Lite 001 through Google Vertex via Inworld Router or Realtime API. By using the AI provider(s) below you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.
CapabilitiesInput modalitiesOutput modalities
Google VertexGoogle Vertex1M8.2K$0.075$0.30+2
Going away: Jun 1, 2026

Other Google models available through Inworld

Compare Gemini 2.0 Flash Lite 001 with other Google models available through Inworld Router or Realtime API.
CapabilitiesInput modalitiesOutput modalitiesInference provider
GoogleGemini 2.0 Flash1M8.2K$0.10$0.40+2
GoogleGemini 2.0 Flash 001$0.15$0.60+2
GoogleGemini 2.0 Flash Lite$0.075$0.30+2
GoogleGemini 2.5 Flash$0.30$2.50+4
GoogleGemini 2.5 Flash Image32.8K32.8K$0.30$2.50+1
GoogleGemini 2.5 Flash Lite$0.10$0.40+4
GoogleGemini 2.5 Flash Lite Preview 09 20251M65.5K$0.10$0.40+4
GoogleGemini 2.5 PRO$1.25$10.00+4
GoogleGemini 3 Flash Preview$0.50$3.00+4
GoogleGemini 3.1 Flash Image Preview65.5K32.8K$0.50$3.00+1
GoogleGemini 3.1 Flash Lite1M65.5K$0.25$1.50+4
GoogleGemini 3.1 PRO Preview1M65.5K$2.00$12.00+4
GoogleGemini 3.1 PRO Preview Customtools1M65.5K$2.00$12.00+4
GoogleGemini 3.5 Flash1M65.5K$1.50$9.00+4
GoogleGemini Flash Latest1M65.5K$0.30$2.50+4
GoogleGemini Flash Lite Latest1M65.5K$0.10$0.40+4
GoogleGemma 4 26b A4b It Maas$0.15$0.60
GoogleGoogle/gemma 3 12b It131.1K131.1K$0.04$0.13
GoogleGoogle/gemma 3 27b It131.1K131.1K$0.08$0.16
GoogleGoogle/gemma 3 4b It131.1K131.1K$0.04$0.08
GoogleGoogle/gemma 4 26B A4B It$0.07$0.34
GoogleGoogle/gemma 4 31B It$0.13$0.38

Start building with Gemini 2.0 Flash Lite 001

Access Gemini 2.0 Flash Lite 001 and every other model through Inworld Router or Realtime API. Create your free account and start routing in minutes.
Copyright © 2021-2026 Inworld AI
Gemini 2.0 Flash Lite 001 by Google — Pricing, Specs & API Access | Inworld