Models / Cohere

Rerank 4 Fast (rerank-v4.0-fast)

GA

Low-latency 'light' version of Rerank 4. 32k context. Priced PER SEARCH (1 query + up to 100 docs): ~$0.002 per search (per eesel.ai, pecollective, aipricing.guru, 2026). Per-token fields null (search-unit pricing). Dedicated Model Vault Medium $5.00/hr or $3,250/mo (cohere.com/pricing). Knowledge cutoff/release not published.

Provider
Cohere
Status
GA
Input price
Output price
Cached input
Blended price
Context window
32,000 tokens (32K)
Max output
Modality
text
Knowledge cutoff
Released
API string
rerank-v4.0-fast

Source: Cohere official documentation ↗