Models / Alibaba

Qwen3-Max

GA

Previous flagship Max, alias = qwen3-max-2026-01-23. Official International TIERED pricing: 0<token<=32K: $1.2 in / $6 out; 32K<token<=128K: $2.4 / $12; 128K<token<=256K: $3 / $15. Non-Thinking and Thinking modes. Context window 262,144 (256K) and max output 32,768 per OpenRouter (Alibaba spec table is SPA-rendered, not directly fetchable); knowledge cutoff Jun 2025 per OpenRouter. SCHEDULED DEPRECATION: the qwen3-max alias and qwen3-max-preview are listed for deprecation 2026-09-08 00:00:00 -> replacement qwen3.7-max; dated snapshots qwen3-max-2026-01-23 and qwen3-max-2025-09-23 are listed fo

Lifecycle warning

This model is deprecated. Scheduled shutdown: 8 Sep 2026. Recommended migration: qwen3.7-max.

Provider
Alibaba
Status
GA
Input price
$1.20 / 1M tokens
Output price
$6 / 1M tokens
Cached input
Blended price
$2.40 / 1M tokens
Context window
262,144 tokens (262K)
Max output
32,768 tokens
Modality
text
Knowledge cutoff
2025-06
Released
23 Sep 2025
API string
qwen3-max

Source: Alibaba official documentation ↗