Models / Google

Gemini 2.0 Flash-Lite

Retired

RETIRED/shut down 2026-06-01 (deprecations page, alongside gemini-2.0-flash-lite-001). No longer callable as of 2026-06-28. Context caching not offered (cached input null). Max output 8,192 tokens. Knowledge cutoff not separately confirmed on a dedicated spec page; inferred August 2024 same as gemini-2.0-flash family - LOW confidence, treat as approximate. Modalities inferred from the 2.0 family - moderate confidence. Replacement gemini-3.1-flash-lite. Released 2025-02-25 per deprecations page.

Lifecycle warning

This model has been retired. Scheduled shutdown: 1 Jun 2026. Recommended migration: gemini-3.1-flash-lite.

Provider
Google
Status
Retired
Input price
$0.075 / 1M tokens
Output price
$0.3 / 1M tokens
Cached input
Blended price
$0.131 / 1M tokens
Context window
1,048,576 tokens (1.0M)
Max output
8,192 tokens
Modality
text, image, video, audio
Knowledge cutoff
2024-08
Released
25 Feb 2025
API string
gemini-2.0-flash-lite

Source: Google official documentation ↗