Get started
Google
Google

Gemini 3.1 Flash Lite

Gemini 3.1 Flash Lite is a reasoning model by Google with vision, function calling, web search, prompt caching, structured output — designed for complex, multi-step problem solving where accuracy matters more than speed. It supports a 1M token context window with up to 66K tokens of output. Priced at $0.25 per million input tokens and $1.5 per million output tokens, it is one of the most cost-efficient options in its class. Access Gemini 3.1 Flash Lite through Inworld Router with OpenAI SDK compatibility, built-in failover, and intelligent routing across providers.
Function CallingWeb SearchReasoningPrompt CachingResponse SchemaVisionreasoningCapability

Step 1 — Create Router

curl --location 'https://api.inworld.ai/router/v1/routers' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "name": "compare-frontier-models", "defaultRoute": { "route_id": "default", "variants": [ { "variant": { "variant_id": "google-ai-studio/gemini-3.1-flash-lite-a", "model_id": "google-ai-studio/gemini-3.1-flash-lite" }, "weight": 33 }, { "variant": { "variant_id": "google-ai-studio/gemini-2.0-flash-b", "model_id": "google-ai-studio/gemini-2.0-flash" }, "weight": 33 }, { "variant": { "variant_id": "google/gemini-2.0-flash-001-c", "model_id": "google/gemini-2.0-flash-001" }, "weight": 34 } ] } }'

Step 2 — Chat Completion

curl --location 'https://api.inworld.ai/v1/chat/completions' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "model": "inworld/compare-frontier-models", "messages": [{"role": "user", "content": "Hello!"}] }'

Gemini 3.1 Flash Lite pricing and providers

Access Gemini 3.1 Flash Lite through Google AI Studio via Inworld Router or Realtime API. By using the AI provider(s) below you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.
CapabilitiesInput modalitiesOutput modalities
Google AI StudioGoogle AI Studio1M65.5K$0.25$1.50+4

Other Google models available through Inworld

Compare Gemini 3.1 Flash Lite with other Google models available through Inworld Router or Realtime API.
CapabilitiesInput modalitiesOutput modalitiesInference provider
GoogleGemini 2.0 Flash1M8.2K$0.10$0.40+2
GoogleGemini 2.0 Flash 001$0.15$0.60+2
GoogleGemini 2.0 Flash Lite$0.075$0.30+2
GoogleGemini 2.0 Flash Lite 0011M8.2K$0.075$0.30+2
GoogleGemini 2.5 Flash$0.30$2.50+4
GoogleGemini 2.5 Flash Image32.8K32.8K$0.30$2.50+1
GoogleGemini 2.5 Flash Lite$0.10$0.40+4
GoogleGemini 2.5 Flash Lite Preview 09 20251M65.5K$0.10$0.40+4
GoogleGemini 2.5 PRO$1.25$10.00+4
GoogleGemini 3 Flash Preview$0.50$3.00+4
GoogleGemini 3.1 Flash Image Preview$0.50$3.00
GoogleGemini 3.1 PRO Preview1M65.5K$2.00$12.00+4
GoogleGemini 3.1 PRO Preview Customtools1M65.5K$2.00$12.00+4
GoogleGemini 3.5 Flash1M65.5K$1.50$9.00+4
GoogleGemini Flash Latest1M65.5K$0.30$2.50+4
GoogleGemini Flash Lite Latest1M65.5K$0.10$0.40+4
GoogleGemma 4 26b A4b It Maas$0.15$0.60
GoogleGoogle/gemma 3 12b It131.1K131.1K$0.04$0.13
GoogleGoogle/gemma 3 27b It131.1K131.1K$0.08$0.16
GoogleGoogle/gemma 3 4b It131.1K131.1K$0.04$0.08
GoogleGoogle/gemma 4 26B A4B It$0.07$0.34
GoogleGoogle/gemma 4 31B It$0.13$0.38

Start building with Gemini 3.1 Flash Lite

Access Gemini 3.1 Flash Lite and every other model through Inworld Router or Realtime API. Create your free account and start routing in minutes.
Copyright © 2021-2026 Inworld AI
Gemini 3.1 Flash Lite by Google — Pricing, Specs & API Access | Inworld