Gemma 3 4B API Pricing

Name: Gemma 3 4B API
Brand: Google (open weights)
Price: 0.04 USD

Google (open weights) · 128K context window · Best for tiny Google open model, edge use

Last updated May 26, 2026 · Data refreshed every 6 hours

Input

$0.04

Output

$0.04

Context

128K

Vendor

Google (open weights)

All prices in USD per million tokens. Live as of May 26, 2026.

Cost Examples

Workload	Tokens	Cost
1M chat messages (~500 tok in / 500 tok out)	1000M	$40.00
Process 10K PDFs (~2K tok each, output 200 tok)	22M	$0.88
Generate 100K tweets (~280 tok out, 50 tok prompt)	33M	$1.32
1 chatbot conversation per user × 10K users (~5K tok each)	50M	$2.00
Summarize 1K technical articles (~5K in / 500 out)	6M	$0.22

Because Gemma 3 4B has open weights, you can self-host on cloud GPUs instead of paying API rates. Estimated cost on a single H100 spot instance:

API cost / 1M tokens

$0.04

Self-host / 1M tokens (H100 spot)

~$0.667

Self-hosting wins when your sustained throughput justifies a dedicated GPU. See H100 pricing.

Related Tools & Pages