Llama 3.1 8B (Groq) API Pricing

Groq · 128K context window · Best for ultra-fast 8B inference on Groq LPU

Last updated April 25, 2026 · Data refreshed every 6 hours
Input
$0.05
Output
$0.08
Context
128K
Vendor
Groq

All prices in USD per million tokens. Live as of April 25, 2026.

Cost Examples

Workload Tokens Cost
1M chat messages (~500 tok in / 500 tok out) 1000M $65.00
Process 10K PDFs (~2K tok each, output 200 tok) 22M $1.16
Generate 100K tweets (~280 tok out, 50 tok prompt) 33M $2.49
1 chatbot conversation per user × 10K users (~5K tok each) 50M $3.25
Summarize 1K technical articles (~5K in / 500 out) 6M $0.29
Related Tools & Pages