Skip to main content
TokenCost logoTokenCost

Gemini 3 Flash

Last verified: April 24, 2026

$0.500/1M input · $3.00/1M output · 1.0M context · Google

budgetreal-timechatspeedhigh-volume

When to use Gemini 3 Flash

Google's speed-optimized model. Excellent for real-time applications where latency matters. The thinking budget feature lets you dial reasoning up or down per request.

  • Fast inference with low latency
  • 1M token context window
  • Configurable thinking budget
  • Very competitive pricing at $0.15/1M input

Count Tokens

Tokens
Input Cost
Output Cost

Estimate Monthly Cost

Monthly Cost Estimator

Quick:
< $0.0001/mo

Pick a preset above or enter custom usage

Alternatives to Gemini 3 Flash

Pricing Details

GooglePreview only (gemini-3-flash-preview). Thinking on by default at high level - thinking tokens billed as output. Batch: $0.25/$1.50. Context cache: $0.05/1M read, $1.00/MTok-hour. Audio input: $1.00/1M. Estimated tokens (different tokenizer).
Input / 1M tokens
$0.5
Output / 1M tokens
$3
Context Window
1.0M
Max Output
66K

Price History

Launched at current price on 2025-12-17. No price changes recorded.

Frequently Asked Questions

Common questions about Gemini 3 Flash pricing and usage

Read More About Gemini 3 Flash