40GB VRAM GPU Cloud Rental for AI & LLMs
3 GPU models with approximately 40GB VRAM, prices from $0.080/hr. Best for: 30B model inference, fine-tuning medium models.
Last updated May 26, 2026 · Data refreshed every 6 hours
GPU Models
3
Cheapest
$0.080/hr
Best $/GB
$0.002/GB-hr
Total Instances
244
Will this VRAM tier run my model?
40GB VRAM is best for 30B model inference, fine-tuning medium models. For LLM inference, leave headroom for quantization format, batch size, and KV cache: long context windows can add meaningful VRAM pressure even when the base model fits.
All 40GB VRAM GPUs
Other VRAM Tiers