Models / Google

Gemini 3.5 Flash

GA

GA/stable. Output token limit reported as 'up to 65,000' on the what's-new page; model spec convention is 65,536 - treated as 65536. The what's-new page describes additional output capabilities (images, audio, structured outputs) but the spec table for the core text model lists text output. Listed as the recommended replacement for gemini-2.5-flash and the gemini-2.0-flash line. Context-caching also has a per-hour storage fee not captured here. What's-new page last updated 2026-06-24 UTC.

Provider
Google
Status
GA
Input price
$1.50 / 1M tokens
Output price
$9 / 1M tokens
Cached input
$0.15 / 1M tokens
Blended price
$3.38 / 1M tokens
Context window
1,048,576 tokens (1.0M)
Max output
65,536 tokens
Modality
text, image, video, audio, pdf
Knowledge cutoff
2025-01
Released
19 May 2026
API string
gemini-3.5-flash

Source: Google official documentation ↗