The lineup

All Meta models

Official pricing ↗
Model Status Context Input $/M Output $/M Blended $/M Cutoff
Llama 4 Maverick (17B-128E Instruct)
Meta · text, image
GA1M$0.15$0.6$0.2622024-08
Llama 4 Scout (17B-16E Instruct)
Meta · text, image
GA10M$0.1$0.3$0.152024-08
Llama 3.3 70B Instruct
Meta · text
GA128K$0.1$0.32$0.1552023-12
Llama 3.1 8B Instruct
Meta · text
GA128K$0.02$0.03$0.0222023-12
Llama 3.3 8B Instruct (Llama API)
Meta · text
GA128K
Llama 3.1 405B Instruct
Meta · text
GA128K2023-12
Llama 3.1 70B Instruct
Meta · text
GA128K2023-12
Llama 3.2 90B Vision Instruct
Meta · text, image
GA128K2023-12
Llama 3.2 11B Vision Instruct
Meta · text, image
GA128K2023-12
Llama 3.2 3B Instruct
Meta · text
GA128K2023-12
Llama 3.2 1B Instruct
Meta · text
GA128K2023-12
Llama 4 Behemoth (preview)
Meta · text, image
Preview

Blended = 0.75 × input + 0.25 × output $/M tokens (a fair single-number cost proxy). Click any header to sort.

FAQ

Meta pricing & models

What is the cheapest Meta model?

Llama 3.1 8B Instruct is the cheapest generally-available Meta model we track, at $0.02 per 1M input tokens and $0.03 per 1M output tokens ($0.022/1M blended).

What is Meta's flagship model?

Llama 4 Maverick (17B-128E Instruct) is Meta's most prominent model in our catalog, with a 1M-token context window and pricing of $0.15/$0.6 per 1M input/output tokens.

How many Meta models are there?

We track 12 Meta models, of which 11 are generally available.