Get started
Google
Google

Gemini 3.1 Flash Lite

Gemini 3.1 Flash Lite is a reasoning model by Google with vision, function calling, web search, prompt caching, structured output — designed for complex, multi-step problem solving where accuracy matters more than speed. It supports a 1M token context window with up to 66K tokens of output. Priced at $0.25 per million input tokens and $1.5 per million output tokens, it is one of the most cost-efficient options in its class. Access Gemini 3.1 Flash Lite through Inworld Router with OpenAI SDK compatibility, built-in failover, and intelligent routing across providers.
Function CallingWeb SearchReasoningPrompt CachingResponse SchemaVisionreasoningCapabilityresponseFormatCapability

Step 1 — Create Router

curl --location 'https://api.inworld.ai/router/v1/routers' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "name": "compare-frontier-models", "defaultRoute": { "route_id": "default", "variants": [ { "variant": { "variant_id": "google-ai-studio/gemini-3.1-flash-lite-a", "model_id": "google-ai-studio/gemini-3.1-flash-lite" }, "weight": 33 }, { "variant": { "variant_id": "google/gemini-2.5-flash-b", "model_id": "google/gemini-2.5-flash" }, "weight": 33 }, { "variant": { "variant_id": "google-ai-studio/gemini-2.5-flash-image-c", "model_id": "google-ai-studio/gemini-2.5-flash-image" }, "weight": 34 } ] } }'

Step 2 — Chat Completion

curl --location 'https://api.inworld.ai/v1/chat/completions' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "model": "inworld/compare-frontier-models", "messages": [{"role": "user", "content": "Hello!"}] }'

Gemini 3.1 Flash Lite pricing and providers

Access Gemini 3.1 Flash Lite through Google AI Studio via Inworld Router or Realtime API. By using the AI provider(s) below you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.
CapabilitiesInput modalitiesOutput modalities
Google AI StudioGoogle AI Studio1M65.5K$0.25$1.50+5

Other Google models available through Inworld

Compare Gemini 3.1 Flash Lite with other Google models available through Inworld Router or Realtime API.
CapabilitiesInput modalitiesOutput modalitiesInference provider
GoogleGemini 2.5 Flash1M65.5K$0.30$2.50+5
GoogleGemini 2.5 Flash Image32.8K32.8K$0.30$2.50
GoogleGemini 2.5 Flash Lite1M65.5K$0.10$0.40+5
GoogleGemini 2.5 Flash Lite Preview 09 20251M65.5K$0.10$0.40+5
GoogleGemini 2.5 PRO1M65.5K$1.25$10.00+5
GoogleGemini 3 Flash Preview1M65.5K$0.50$3.00+5
GoogleGemini 3.1 Flash Image Preview65.5K32.8K$0.50$3.00+2
GoogleGemini 3.1 PRO Preview1M65.5K$2.00$12.00+5
GoogleGemini 3.1 PRO Preview Customtools1M65.5K$2.00$12.00+5
GoogleGemini 3.5 Flash1M65.5K$1.50$9.00+5
GoogleGemma 4 26b A4b It Maas128K$0.15$0.60+2
GoogleGoogle/gemma 3 12b It131.1K131.1K$0.05$0.15+1
GoogleGoogle/gemma 3 27b It131.1K131.1K$0.08$0.16+1
GoogleGoogle/gemma 3 4b It131.1K131.1K$0.05$0.10
GoogleGoogle/gemma 4 26B A4B It262.1K$0.07$0.34+3
GoogleGoogle/gemma 4 31B It262.1K$0.13$0.38+3
GoogleModels/gemma 4 26b A4b It262.1K$0.07$0.34+2
GoogleModels/gemma 4 31b It262.1K$0.13$0.38+2

Start building with Gemini 3.1 Flash Lite

Access Gemini 3.1 Flash Lite and every other model through Inworld Router or Realtime API. Create your free account and start routing in minutes.
Copyright © 2021-2026 Inworld AI
Gemini 3.1 Flash Lite by Google — Pricing, Specs & API Access | Inworld