Models / Meta

Llama 3.1 405B Instruct

GA

Open-weight, text-only, 405B dense, 128k context. License: Llama 3.1 Community License. Released 2024-07-23 alongside 8B/70B. Hosted price NOT reliably extractable in this pass: OpenRouter slug is 'meta-llama/llama-3.1-405b-instruct' (131k ctx) but the per-token figures did not render; Together AI reportedly does NOT serve 405B on its serverless API (their 405B model page states it is not available serverless), and third-party reports put serverless 405B around $3.00-$3.50 per 1M elsewhere (unverified). Prices left null rather than guessed. Source URL is the Llama 3.1 collection card (lists 8B

Provider
Meta
Status
GA
Input price
Output price
Cached input
Blended price
Context window
128,000 tokens (128K)
Max output
Modality
text
Knowledge cutoff
2023-12
Released
23 Jul 2024
API string
meta-llama/Llama-3.1-405B-Instruct

Source: Meta official documentation ↗