Gemini 3 Flash

Name: Gemini 3 Flash
Brand: Google
Price: 0.3 USD
Availability: InStock

Cheap, fast, 2M-token context.

Try via Vercel AI Gateway Try at Google Docs

context

output max

32K

input /M

$0.30

output /M

$1.20

cached in /M

$0.07

modalities

T·I·A·V

Verdict

What it's actually good at

Strengths

+Unbeatable cost for the context window
+Very fast

Weaknesses

−Reasoning behind Pro/Opus tier

Our scores

reasoning

7.0

coding

7.0

writing

7.0

speed

10.0

value

10.0

Flash tier with near-Pro quality on most workloads at a fraction of the cost. Excellent default for high-throughput multimodal apps.

best for:Cheapest AI models for bulk workloads chat vision and multimodal summarization

Pricing

Cost at common volumes

Monthly volume	Input	Output	Est. monthly
Side project	1M tok	0.3M tok	$0.66
Growing app	50M tok	15M tok	$33.00
Production	500M tok	150M tok	$330.00
Scale	5000M tok	1500M tok	$3300.00

Estimates assume uncached input. Prompt caching and batch APIs can cut this by 50–90% for many workloads. Use the calculator →

Head-to-head

Compare Gemini 3 Flash

vs Anthropic

Claude Haiku 4.5

200K context · $1.00 / $5.00

vs OpenAI

GPT-OSS 20B

128K context · $0.15 / $0.60

vs Mistral

Mistral Small 3

128K context · $0.20 / $0.60

vs OpenAI

GPT-5 Mini

200K context · $0.40 / $1.60