48GB VRAM GPU Cloud Rental for AI & LLMs
7 GPU models with approximately 48GB VRAM, prices from $0.172/hr. Best for: 34B inference, multi-batch serving, image generation pipelines.
Last updated May 26, 2026 · Data refreshed every 6 hours
GPU Models
7
Cheapest
$0.172/hr
Best $/GB
$0.004/GB-hr
Total Instances
192
Will this VRAM tier run my model?
48GB VRAM is best for 34B inference, multi-batch serving, image generation pipelines. For LLM inference, leave headroom for quantization format, batch size, and KV cache: long context windows can add meaningful VRAM pressure even when the base model fits.
All 48GB VRAM GPUs
| GPU Model | Providers | Instances | From / GB | From |
|---|---|---|---|---|
| A6000 | 1 | 8 | $0.004/GB-hr | $0.172/hr |
| A40 | 5 | 32 | $0.004/GB-hr | $0.200/hr |
| RTX6000Ada | 2 | 22 | $0.006/GB-hr | $0.289/hr |
| L40S | 18 | 84 | $0.007/GB-hr | $0.320/hr |
| RTX A6000 | 10 | 27 | $0.010/GB-hr | $0.490/hr |
| L40 | 4 | 10 | $0.014/GB-hr | $0.690/hr |
| RTX 6000 Ada | 3 | 9 | $0.012/GB-hr | $0.970/hr |
Other VRAM Tiers