Nemotron 3 Super API Pricing
NVIDIA · 128K context window · Best for NVIDIA-optimized for TensorRT-LLM
Last updated April 25, 2026 · Data refreshed every 6 hours
Input
$0.20
Output
$0.20
Context
128K
Vendor
NVIDIA
All prices in USD per million tokens. Live as of April 25, 2026.
Cost Examples
| Workload | Tokens | Cost |
|---|---|---|
| 1M chat messages (~500 tok in / 500 tok out) | 1000M | $200.00 |
| Process 10K PDFs (~2K tok each, output 200 tok) | 22M | $4.40 |
| Generate 100K tweets (~280 tok out, 50 tok prompt) | 33M | $6.60 |
| 1 chatbot conversation per user × 10K users (~5K tok each) | 50M | $10.00 |
| Summarize 1K technical articles (~5K in / 500 out) | 6M | $1.10 |
Related Tools & Pages