Gemini 3 Flash
Last verified: April 24, 2026$0.500/1M input · $3.00/1M output · 1.0M context · Google
budgetreal-timechatspeedhigh-volume
When to use Gemini 3 Flash
Google's speed-optimized model. Excellent for real-time applications where latency matters. The thinking budget feature lets you dial reasoning up or down per request.
- Fast inference with low latency
- 1M token context window
- Configurable thinking budget
- Very competitive pricing at $0.15/1M input
Count Tokens
—
Tokens
—
Input Cost
—
Output Cost
Estimate Monthly Cost
Monthly Cost Estimator
Quick:
< $0.0001/mo
Pick a preset above or enter custom usage
Alternatives to Gemini 3 Flash
Pricing Details
GooglePreview only (gemini-3-flash-preview). Thinking on by default at high level - thinking tokens billed as output. Batch: $0.25/$1.50. Context cache: $0.05/1M read, $1.00/MTok-hour. Audio input: $1.00/1M. Estimated tokens (different tokenizer).
Input / 1M tokens
$0.5
Output / 1M tokens
$3
Context Window
1.0M
Max Output
66K
Price History
Launched at current price on 2025-12-17. No price changes recorded.
Frequently Asked Questions
Common questions about Gemini 3 Flash pricing and usage