40GB VRAM GPU Cloud Rental for AI & LLMs

3 GPU models with approximately 40GB VRAM, prices from $0.080/hr. Best for: 30B model inference, fine-tuning medium models.

Last updated May 26, 2026 · Data refreshed every 6 hours

GPU Models

Cheapest

$0.080/hr

Best $/GB

$0.002/GB-hr

Total Instances

244

Will this VRAM tier run my model?

40GB VRAM is best for 30B model inference, fine-tuning medium models. For LLM inference, leave headroom for quantization format, batch size, and KV cache: long context windows can add meaningful VRAM pressure even when the base model fits.

All 40GB VRAM GPUs

GPU Model	Providers	Instances	From / GB	From
A100	9	139	$0.002/GB-hr	$0.080/hr
L40S	1	96	$0.007/GB-hr	$0.446/hr
A100 40GB	9	9	$0.021/GB-hr	$0.850/hr

Other VRAM Tiers

→ 16GB VRAM → 24GB VRAM → 48GB VRAM → 80GB VRAM → 141GB VRAM → 180GB VRAM