48GB VRAM GPU Cloud Rental for AI & LLMs

7 GPU models with approximately 48GB VRAM, prices from $0.172/hr. Best for: 34B inference, multi-batch serving, image generation pipelines.

Last updated May 26, 2026 · Data refreshed every 6 hours

GPU Models

Cheapest

$0.172/hr

Best $/GB

$0.004/GB-hr

Total Instances

192

Will this VRAM tier run my model?

48GB VRAM is best for 34B inference, multi-batch serving, image generation pipelines. For LLM inference, leave headroom for quantization format, batch size, and KV cache: long context windows can add meaningful VRAM pressure even when the base model fits.

All 48GB VRAM GPUs

GPU Model	Providers	Instances	From / GB	From
A6000	1	8	$0.004/GB-hr	$0.172/hr
A40	5	32	$0.004/GB-hr	$0.200/hr
RTX6000Ada	2	22	$0.006/GB-hr	$0.289/hr
L40S	18	84	$0.007/GB-hr	$0.320/hr
RTX A6000	10	27	$0.010/GB-hr	$0.490/hr
L40	4	10	$0.014/GB-hr	$0.690/hr
RTX 6000 Ada	3	9	$0.012/GB-hr	$0.970/hr

Other VRAM Tiers

→ 16GB VRAM → 24GB VRAM → 40GB VRAM → 80GB VRAM → 141GB VRAM → 180GB VRAM