141GB VRAM GPU Cloud Rental for AI & LLMs

2 GPU models with approximately 141GB VRAM, prices from $1.19/hr. Best for: 70B at full precision, multi-batch 70B serving.

Last updated May 26, 2026 · Data refreshed every 6 hours
GPU Models
2
Cheapest
$1.19/hr
Best $/GB
$0.008/GB-hr
Total Instances
136

Will this VRAM tier run my model?

141GB VRAM is best for 70B at full precision, multi-batch 70B serving. For LLM inference, leave headroom for quantization format, batch size, and KV cache: long context windows can add meaningful VRAM pressure even when the base model fits.

All 141GB VRAM GPUs

GPU Model Providers Instances From / GB From
H200 9 133 $0.008/GB-hr $1.19/hr
H200 SXM 3 3 $0.025/GB-hr $3.50/hr
Other VRAM Tiers