Gemini API
Official Gemini 3.5 Flash and 3.1 Pro API billed per token—separate from gemini.google consumer subscription quotas
Core models
Fast API flagship (gemini-3.5-flash): $1.50 input / $9 output per MTok, frontier intelligence + search grounding—Free tier for trials.
Strongest Pro preview (gemini-3.1-pro-preview): $2/$12 per MTok (≤200K), multimodal agents and vibe-coding—Paid only.
Best value (gemini-3.1-flash-lite): $0.25/$1.50 per MTok—first choice for high-volume agents and translation.
Prior Pro (gemini-2.5-pro): $1.25/$10 per MTok (≤200K), coding and complex reasoning—prefer 3.x for new work.
Image generation API (gemini-3.1-flash-image): $0.50/M text input, image output priced by resolution (~$0.045–$0.151/image)—separate from chat billing.
Additional core model names still appear above, with full details on the latest official page.
Plan details
Gemini 3.5 Flash
Flagship speedRecommendedDefault for search-grounded fast agent loops and daily production API; escalate to 3.1 Pro for complex multimodal agents.
Gemini 3.1 Pro
Flagship ProSame model family as gemini.google subscription 3.1 Pro, but API bills per token independently of Plus/Pro/Ultra usage multipliers.
Gemini 3.1 Flash-Lite
Best valueAudio input Standard $0.50/M, Batch $0.25/M—estimate separately for speech pipelines.
Gemini 2.5 Pro
Prior-gen ProExisting 2.5 Pro integrations can keep billing; migration plans should weigh 3.1 Pro multimodal agent gains.
Gemini 2.5 Flash
Hybrid reasoningSuited to production needing controllable thinking depth and 1M context without 3.5 Flash pricing.
Gemini 3.1 Flash Image
Image generationGemini 3.1 Flash Live
Real-time dialogueNotes
- Prices below are Paid tier Standard processing in USD per 1M tokens; Free tier in AI Studio offers free input/output on select models (content may improve products)—upgrade to Paid for production.
- Gemini 3.1 Pro and 2.5 Pro use tiered pricing at ≤200K vs >200K prompt tokens (e.g. 3.1 Pro Standard: $2/$12 vs $4/$18 per MTok).
- Batch API is ~50% off input/output; context caching adds storage fees (typically $0.50–$4.50 per 1M tokens/hour, model-dependent).
- Image generation (3.1 Flash Image, etc.) and Live API audio/video use different billing from plain text chat—estimate per use case before integration.
Supported coding tools
Pricing and model data sourced from official vendor websites