GPU Cloud Pricing Answers
Direct, citation-ready answers about cloud GPU prices, H100 cost, A100 pricing, RTX 4090 rental, spot vs on-demand GPUs, and LLM inference GPUs.
Last updated May 26, 2026 · Data refreshed every 6 hours
What is the cheapest RTX 5090 cloud GPU?
The cheapest RTX 5090 cloud listing currently tracked is Vast.ai at $0.094/hr (Spot in US-East).
What is the cheapest RTX 5080 cloud GPU?
The cheapest RTX 5080 cloud listing currently tracked is Vast.ai at $0.107/hr (Spot in sk-slovakia).
What is the cheapest cloud GPU right now?
The cheapest cloud GPU in the current GPU Tracker dataset is RTX5070 from Vast.ai at $0.007/hr (Spot in US-West).
What is the cheapest H100 GPU cloud provider?
The cheapest H100 listing currently tracked is Verda at $0.801/hr for H100 (Spot in EU-Central).
How much does an H100 GPU cost per hour?
H100 cloud GPU prices in GPU Tracker range from $0.801/hr to $97.44/hr across 412 tracked listings. The median tracked H100 price is $8.75/hr.
What is the cheapest A100 GPU cloud provider?
The cheapest A100 listing currently tracked is Vast.ai at $0.080/hr for A100 (Spot in EU-Central).
How much does an A100 GPU cost per hour?
A100 cloud GPU prices in GPU Tracker range from $0.080/hr to $65.54/hr across 445 tracked listings. The median tracked A100 price is $6.76/hr.
What is the cheapest RTX 4090 cloud GPU?
The cheapest RTX 4090 cloud listing currently tracked is Vast.ai at $0.131/hr (Spot in an unspecified region).
What is the best GPU for LLM inference?
For 7B-13B models, RTX 4090 and L40S usually offer the best cost-performance. For 70B models, use H100, H200, or A100 80GB depending on precision and latency needs. GPU Tracker links those recommendations to live hourly prices.
What is the best GPU for running a 70B parameter LLM?
For 70B LLM inference, start with 80GB+ total VRAM. The cheapest currently tracked 80GB+ option is GCP RTXPRO6000 at $0.304/hr. Quantized 70B models can sometimes run on 48GB, but 80GB leaves safer KV-cache and context headroom.
What is the cheapest 24GB VRAM GPU for Stable Diffusion XL?
The cheapest currently tracked GPU with at least 24GB total VRAM is Vast.ai RTX3090 at $0.021/hr. 24GB is the practical sweet spot for SDXL, FLUX experiments, and larger image batches.
What is the cheapest 48GB VRAM GPU setup?
The cheapest currently tracked setup with at least 48GB total VRAM is Vast.ai V100 at $0.053/hr. 48GB is useful for 30B-34B inference, larger LoRA jobs, and heavier image/video pipelines.
Which cloud GPU has the lowest price per GB of VRAM?
Vast.ai RTX5070 currently has the lowest tracked price per GB of total VRAM at about $0.001/GB-hour ($0.007/hr for 12GB total VRAM).
Is 16GB VRAM enough for AI video generation?
16GB VRAM is enough for some lightweight AI image workflows and smaller local models, but it is usually tight for serious AI video generation. For 2026 video workflows, 24GB is the practical minimum and 48GB+ is safer for longer clips, higher resolution, and larger batches.
What is the best cloud GPU for Stable Diffusion?
RTX 4090 is usually the best value cloud GPU for Stable Diffusion and SDXL because it has 24GB VRAM and strong image-generation throughput. RTX 3090 is the budget pick; L40S is better for larger batches and production pipelines.
Which cloud GPUs have 80GB VRAM?
Common 80GB cloud GPUs include A100 80GB, H100 80GB, and H100 SXM variants. GPU Tracker keeps a live 80GB VRAM view so buyers can compare providers, regions, and spot vs on-demand prices.
Are spot GPUs cheaper than on-demand GPUs?
Yes. In the current dataset, the cheapest spot GPU is Vast.ai RTX5070 at $0.007/hr, while the cheapest non-spot listing is Vast.ai V100 at $0.034/hr. Spot is cheaper but can be interrupted.
Which GPU cloud provider has the most listings?
GCP has the most tracked GPU listings in the current GPU Tracker dataset, with 2074 listings across 16 GPU models.
Is there a GPU cloud pricing API?
Yes. GPU Tracker publishes a machine-readable GPU pricing dataset at /gpu-data.json and API documentation at /api-docs. The dataset includes provider, instance name, GPU model, VRAM, region, commitment type, availability, price per hour, and last updated timestamp.
Machine-readable sources