141GB VRAM GPU Cloud Rental for AI & LLMs
2 GPU models with approximately 141GB VRAM, prices from $1.19/hr. Best for: 70B at full precision, multi-batch 70B serving.
Last updated May 26, 2026 · Data refreshed every 6 hours
GPU Models
2
Cheapest
$1.19/hr
Best $/GB
$0.008/GB-hr
Total Instances
136
Will this VRAM tier run my model?
141GB VRAM is best for 70B at full precision, multi-batch 70B serving. For LLM inference, leave headroom for quantization format, batch size, and KV cache: long context windows can add meaningful VRAM pressure even when the base model fits.
All 141GB VRAM GPUs
Other VRAM Tiers