48GB VRAM GPU Cloud Rental for AI & LLMs

7 GPU models with approximately 48GB VRAM, prices from $0.172/hr. Best for: 34B inference, multi-batch serving, image generation pipelines.

Last updated May 26, 2026 · Data refreshed every 6 hours
GPU Models
7
Cheapest
$0.172/hr
Best $/GB
$0.004/GB-hr
Total Instances
192

Will this VRAM tier run my model?

48GB VRAM is best for 34B inference, multi-batch serving, image generation pipelines. For LLM inference, leave headroom for quantization format, batch size, and KV cache: long context windows can add meaningful VRAM pressure even when the base model fits.

All 48GB VRAM GPUs

GPU Model Providers Instances From / GB From
A6000 1 8 $0.004/GB-hr $0.172/hr
A40 5 32 $0.004/GB-hr $0.200/hr
RTX6000Ada 2 22 $0.006/GB-hr $0.289/hr
L40S 18 84 $0.007/GB-hr $0.320/hr
RTX A6000 10 27 $0.010/GB-hr $0.490/hr
L40 4 10 $0.014/GB-hr $0.690/hr
RTX 6000 Ada 3 9 $0.012/GB-hr $0.970/hr
Other VRAM Tiers