Models / DeepSeek

DeepSeek-V4-Pro

Preview

Larger/most-capable V4 model (~1.6T total / ~49B active params per HuggingFace model card + authoritative third-party reports; MIT License; mixed FP4/FP8). Context length 1M, max output 384K tokens. Input price $0.435/M cache-miss, $0.003625/M cache-hit; output $0.87/M (USD). Supports three reasoning-effort modes (non-think / think high / think max), JSON output, tool calls; FIM completion non-thinking-mode only. Concurrency limit 500. Part of the 'DeepSeek V4 Preview' generation (released 2026-04-24), hence status=preview. Knowledge cutoff NOT officially published by DeepSeek -> left null. Pr

Provider
DeepSeek
Status
Preview
Input price
$0.435 / 1M tokens
Output price
$0.87 / 1M tokens
Cached input
$0.004 / 1M tokens
Blended price
$0.544 / 1M tokens
Context window
1,000,000 tokens (1M)
Max output
384,000 tokens
Modality
text
Knowledge cutoff
Released
24 Apr 2026
API string
deepseek-v4-pro

Source: DeepSeek official documentation ↗