Models / Meta

Llama 3.3 8B Instruct (Llama API)

GA

Listed on the official Meta Llama API models page as a lightweight, ultra-fast text-only variant with 128k context (model ID 'Llama-3.3-8B-Instruct'). NOTE/UNCERTAINTY: there is no corresponding standalone 'Llama-3.3-8B' open-weight checkpoint on Meta's Hugging Face org (the 8B open weight in this generation is Llama-3.1-8B-Instruct); this ID appears specific to the hosted Llama API. Pricing, exact release date, and knowledge cutoff not published on the API models page (null). Not separately priced on OpenRouter/Together under this exact name.

Provider
Meta
Status
GA
Input price
Output price
Cached input
Blended price
Context window
128,000 tokens (128K)
Max output
Modality
text
Knowledge cutoff
Released
API string
Llama-3.3-8B-Instruct

Source: Meta official documentation ↗