Models / DeepSeek

DeepSeek-V4-Pro

Preview

Larger/most-capable V4 model (~1.6T total / ~49B active params per HuggingFace model card + authoritative third-party reports; MIT License; mixed FP4/FP8). Context length 1M, max output 384K tokens. Input price $0.435/M cache-miss, $0.003625/M cache-hit; output $0.87/M (USD). Supports three reasoning-effort modes (non-think / think high / think max), JSON output, tool calls; FIM completion non-thinking-mode only. Concurrency limit 500. Part of the 'DeepSeek V4 Preview' generation (released 2026-04-24), hence status=preview. Knowledge cutoff NOT officially published by DeepSeek -> left null. Pr

Provider

DeepSeek

Status

Preview

Input price

$0.435 / 1M tokens

Output price

$0.87 / 1M tokens

Cached input

$0.004 / 1M tokens

Blended price

$0.544 / 1M tokens

Context window

1,000,000 tokens (1M)

Max output

384,000 tokens

Modality

text

Knowledge cutoff

—

Released

24 Apr 2026

API string

deepseek-v4-pro

Source: DeepSeek official documentation ↗

Compare DeepSeek-V4-Pro with…

DeepSeek-V4-Pro vs Claude Opus 4.8→

$0.544 vs $10 blended /M

DeepSeek-V4-Pro vs Claude Opus 4.7→

$0.544 vs $10 blended /M

DeepSeek-V4-Pro vs Claude Opus 4.6→

$0.544 vs $10 blended /M

DeepSeek-V4-Pro vs Claude Opus 4.5→

$0.544 vs $10 blended /M

DeepSeek-V4-Pro

Compare DeepSeek-V4-Pro with…

Track DeepSeek-V4-Pro price & status changes