Get started
Google
Google

Gemini 2.5 Flash

Gemini 2.5 Flash is a reasoning model by Google with vision, function calling, web search, prompt caching, structured output — designed for complex, multi-step problem solving where accuracy matters more than speed. It supports a 1M token context window with up to 66K tokens of output. Priced at $0.3 per million input tokens and $2.5 per million output tokens. Access Gemini 2.5 Flash through Inworld Router with OpenAI SDK compatibility, built-in failover, and intelligent routing across providers.
Function CallingWeb SearchReasoningPrompt CachingResponse SchemaVisionreasoningCapabilityresponseFormatCapability

Step 1 — Create Router

curl --location 'https://api.inworld.ai/router/v1/routers' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "name": "compare-frontier-models", "defaultRoute": { "route_id": "default", "variants": [ { "variant": { "variant_id": "google-vertex/gemini-2.5-flash-a", "model_id": "google-vertex/gemini-2.5-flash" }, "weight": 33 }, { "variant": { "variant_id": "google/gemini-2.5-flash-b", "model_id": "google/gemini-2.5-flash" }, "weight": 33 }, { "variant": { "variant_id": "google-ai-studio/gemini-2.5-flash-c", "model_id": "google-ai-studio/gemini-2.5-flash" }, "weight": 34 } ] } }'

Step 2 — Chat Completion

curl --location 'https://api.inworld.ai/v1/chat/completions' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "model": "inworld/compare-frontier-models", "messages": [{"role": "user", "content": "Hello!"}] }'

Gemini 2.5 Flash pricing and providers

Access Gemini 2.5 Flash through Inworld Router or Realtime API via 3 inference providers. Compare pricing and specs across providers below. By using the AI provider(s) below you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.
CapabilitiesInput modalitiesOutput modalities
Google VertexGoogle Vertex1M65.5K$0.30$2.50+5
GoogleGoogle1M65.5K$0.30$2.50+4
Google AI StudioGoogle AI Studio1M65.5K$0.30$2.50+5

Other Google models available through Inworld

Compare Gemini 2.5 Flash with other Google models available through Inworld Router or Realtime API.
CapabilitiesInput modalitiesOutput modalitiesInference provider
GoogleGemini 2.5 Flash Image32.8K32.8K$0.30$2.50
GoogleGemini 2.5 Flash Lite1M65.5K$0.10$0.40+5
GoogleGemini 2.5 Flash Lite Preview 09 20251M65.5K$0.10$0.40+5
GoogleGemini 2.5 PRO1M65.5K$1.25$10.00+5
GoogleGemini 3 Flash Preview1M65.5K$0.50$3.00+5
GoogleGemini 3.1 Flash Image Preview65.5K32.8K$0.50$3.00+2
GoogleGemini 3.1 Flash Lite1M65.5K$0.25$1.50+5
GoogleGemini 3.1 PRO Preview1M65.5K$2.00$12.00+5
GoogleGemini 3.1 PRO Preview Customtools1M65.5K$2.00$12.00+5
GoogleGemini 3.5 Flash1M65.5K$1.50$9.00+5
GoogleGemma 4 26b A4b It Maas128K$0.15$0.60+2
GoogleGoogle/gemma 3 12b It131.1K131.1K$0.05$0.15+1
GoogleGoogle/gemma 3 27b It131.1K131.1K$0.08$0.16+1
GoogleGoogle/gemma 3 4b It131.1K131.1K$0.05$0.10
GoogleGoogle/gemma 4 26B A4B It262.1K$0.07$0.34+3
GoogleGoogle/gemma 4 31B It262.1K$0.13$0.38+3
GoogleModels/gemma 4 26b A4b It262.1K$0.07$0.34+2
GoogleModels/gemma 4 31b It262.1K$0.13$0.38+2

Start building with Gemini 2.5 Flash

Access Gemini 2.5 Flash and every other model through Inworld Router or Realtime API. Create your free account and start routing in minutes.
Copyright © 2021-2026 Inworld AI
Gemini 2.5 Flash by Google — Pricing, Specs & API Access | Inworld