16GB VRAM GPU Cloud Rental for AI & LLMs

14 GPU models with approximately 16GB VRAM, prices from $0.013/hr. Best for: Stable Diffusion, ~7B model inference, fine-tuning small models.

Last updated May 26, 2026 · Data refreshed every 6 hours
GPU Models
14
Cheapest
$0.013/hr
Best $/GB
$0.001/GB-hr
Total Instances
2243

Will this VRAM tier run my model?

16GB VRAM is best for Stable Diffusion, ~7B model inference, fine-tuning small models. For LLM inference, leave headroom for quantization format, batch size, and KV cache: long context windows can add meaningful VRAM pressure even when the base model fits.

All 16GB VRAM GPUs

GPU Model Providers Instances From / GB From
V100 9 515 $0.001/GB-hr $0.013/hr
RTX5060Ti 1 2 $0.003/GB-hr $0.053/hr
RTX4070STi 1 4 $0.004/GB-hr $0.067/hr
T4 8 1557 $0.004/GB-hr $0.068/hr
RTX5080 2 20 $0.007/GB-hr $0.107/hr
RTX5070Ti 1 10 $0.007/GB-hr $0.129/hr
RTX2000Ada 1 2 $0.009/GB-hr $0.140/hr
RTX A4000 4 9 $0.009/GB-hr $0.150/hr
RTX4080 1 4 $0.010/GB-hr $0.160/hr
RTX4080SUPER 1 20 $0.011/GB-hr $0.170/hr
RTX 4080 1 1 $0.020/GB-hr $0.320/hr
A16 1 29 $0.029/GB-hr $0.471/hr
v5litepod-1 1 10 $0.032/GB-hr $0.510/hr
P100 1 60 $0.040/GB-hr $0.637/hr
Other VRAM Tiers