Qwen 3 Coder API Pricing
Alibaba (open weights) · 128K context window · Best for open code model competitive with Codestral
Last updated April 25, 2026 · Data refreshed every 6 hours
Input
$0.40
Output
$1.20
Context
128K
Vendor
Alibaba (open weights)
All prices in USD per million tokens. Live as of April 25, 2026.
Cost Examples
| Workload | Tokens | Cost |
|---|---|---|
| 1M chat messages (~500 tok in / 500 tok out) | 1000M | $800.00 |
| Process 10K PDFs (~2K tok each, output 200 tok) | 22M | $10.40 |
| Generate 100K tweets (~280 tok out, 50 tok prompt) | 33M | $35.60 |
| 1 chatbot conversation per user × 10K users (~5K tok each) | 50M | $40.00 |
| Summarize 1K technical articles (~5K in / 500 out) | 6M | $2.60 |
Self-Hosting on Cloud GPUs
Because Qwen 3 Coder has open weights, you can self-host on cloud GPUs instead of paying API rates. Estimated cost on a single H100 spot instance:
API cost / 1M tokens
$0.80
Self-host / 1M tokens (H100 spot)
~$0.667
Self-hosting wins when your sustained throughput justifies a dedicated GPU. See H100 pricing.
Related Tools & Pages