16GB VRAM GPU Cloud Rental for AI & LLMs
14 GPU models with approximately 16GB VRAM, prices from $0.013/hr. Best for: Stable Diffusion, ~7B model inference, fine-tuning small models.
Last updated May 26, 2026 · Data refreshed every 6 hours
GPU Models
14
Cheapest
$0.013/hr
Best $/GB
$0.001/GB-hr
Total Instances
2243
Will this VRAM tier run my model?
16GB VRAM is best for Stable Diffusion, ~7B model inference, fine-tuning small models. For LLM inference, leave headroom for quantization format, batch size, and KV cache: long context windows can add meaningful VRAM pressure even when the base model fits.
All 16GB VRAM GPUs
| GPU Model | Providers | Instances | From / GB | From |
|---|---|---|---|---|
| V100 | 9 | 515 | $0.001/GB-hr | $0.013/hr |
| RTX5060Ti | 1 | 2 | $0.003/GB-hr | $0.053/hr |
| RTX4070STi | 1 | 4 | $0.004/GB-hr | $0.067/hr |
| T4 | 8 | 1557 | $0.004/GB-hr | $0.068/hr |
| RTX5080 | 2 | 20 | $0.007/GB-hr | $0.107/hr |
| RTX5070Ti | 1 | 10 | $0.007/GB-hr | $0.129/hr |
| RTX2000Ada | 1 | 2 | $0.009/GB-hr | $0.140/hr |
| RTX A4000 | 4 | 9 | $0.009/GB-hr | $0.150/hr |
| RTX4080 | 1 | 4 | $0.010/GB-hr | $0.160/hr |
| RTX4080SUPER | 1 | 20 | $0.011/GB-hr | $0.170/hr |
| RTX 4080 | 1 | 1 | $0.020/GB-hr | $0.320/hr |
| A16 | 1 | 29 | $0.029/GB-hr | $0.471/hr |
| v5litepod-1 | 1 | 10 | $0.032/GB-hr | $0.510/hr |
| P100 | 1 | 60 | $0.040/GB-hr | $0.637/hr |
Other VRAM Tiers