40GB VRAM GPU Cloud Rental for AI & LLMs

3 GPU models with approximately 40GB VRAM, prices from $0.080/hr. Best for: 30B model inference, fine-tuning medium models.

Last updated May 26, 2026 · Data refreshed every 6 hours
GPU Models
3
Cheapest
$0.080/hr
Best $/GB
$0.002/GB-hr
Total Instances
244

Will this VRAM tier run my model?

40GB VRAM is best for 30B model inference, fine-tuning medium models. For LLM inference, leave headroom for quantization format, batch size, and KV cache: long context windows can add meaningful VRAM pressure even when the base model fits.

All 40GB VRAM GPUs

GPU Model Providers Instances From / GB From
A100 9 139 $0.002/GB-hr $0.080/hr
L40S 1 96 $0.007/GB-hr $0.446/hr
A100 40GB 9 9 $0.021/GB-hr $0.850/hr
Other VRAM Tiers