When will Rubin GPUs be available in the cloud?

Vera Rubin NVL72 racks begin shipping H2 2026. AWS, Azure, GCP, OCI, CoreWeave, and Lambda Labs have all confirmed Rubin deployments.

How much cheaper is Rubin vs Blackwell for inference?

NVIDIA claims 10x lower inference token cost and 4x fewer GPUs for MoE training compared to Blackwell NVL72 systems.

Should I wait for Rubin or buy Blackwell now?

It depends on urgency. If you can wait 6 months, Rubin's 10x token economics make waiting rational. If you need compute now, Blackwell is shipping and available.

NVIDIA Vera Rubin Platform

Rubin is coming.
Your GPU strategy should be ready.

10x cheaper tokens. 4x fewer GPUs for training. The Vera Rubin platform ships H2 2026 and will reshape cloud GPU pricing. Here's what you need to know — and what to do right now.

Should I wait for Rubin? Read the full GTC breakdown

10×

cheaper

Token Cost vs Blackwell

4×

fewer

GPUs for MoE Training

288GB

HBM4

HBM per GPU

H2 2026

shipping

Availability

Per-GPU Comparison

Rubin vs Blackwell vs Hopper

Spec	Rubin	Blackwell (B200)	Hopper (H100)
FP4 Inference	~50 PFLOPS	~9 PFLOPS	N/A
FP8 Compute	~25 PFLOPS	~9 PFLOPS	~1.98 PFLOPS
HBM Capacity	288 GB HBM4	192 GB HBM3e	80 GB HBM3
Memory Bandwidth	22 TB/s	8 TB/s	3.35 TB/s
NVLink Bandwidth	3.6 TB/s	1.8 TB/s	900 GB/s
Process Node	TSMC 3 nm	TSMC 4 nm	TSMC 4 nm
Transistors	~336B	~208B	~80B
Token Cost (est.)	~$0.005/1M	~$0.05/1M	~$0.20/1M

Rack-Scale Comparison

NVL72 Rack: Rubin vs Blackwell

Metric	Vera Rubin NVL72	Blackwell NVL72
Total GPUs	72 Rubin	72 B200
FP4 Inference	3.6 EFLOPS	~720 PFLOPS
HBM Capacity	20.7 TB	13.8 TB
HBM Bandwidth	~1.58 PB/s	~576 TB/s
NVLink Rack BW	260 TB/s	130 TB/s
Power Draw	~120–130 kW	~120 kW

Cloud Availability

Who's getting Rubin and when

Rubin entered full production January 2026. Volume is constrained to 200K–300K GPUs by TSMC 3nm and HBM4 supply.

AWSH2 2026

1M+ NVIDIA GPUs (Blackwell + Rubin)

Oracle OCIH2 2026

Rubin NVL72 Superclusters, 17+ ZetaFLOPS

AzureH2 2026

Rubin NVL72 in Fairwater AI superfactories

Google CloudH2 2026

Rubin deployments confirmed

CoreWeaveH2 2026

10x token throughput vs Blackwell

Lambda LabsH2 2026

Rubin NVL72 in Superintelligence Cloud

Nebius2027

Rubin deployments announced

Nscale2027

Rubin deployments announced

Decision Tool

Should you wait for Rubin?

Answer three questions. We'll give you a concrete recommendation.

1. What's your primary workload?

2. When do you need compute?

3. Budget sensitivity?

Available Now

Blackwell & Hopper: what you can rent today

While you wait for Rubin, these are the cheapest Blackwell and Hopper instances across our tracked providers.

GPU	Provider	VRAM	Gen	Price/hr
5070 RTX5070	Vast.ai	12GB	Blackwell	$0.0067/hr	Deploy
5070 RTX5070	Vast.ai	12GB	Blackwell	$0.0400/hr	Deploy
5060 RTX5060Ti	Vast.ai	16GB	Blackwell	$0.0533/hr	Deploy
5070 RTX5070	Vast.ai	12GB	Blackwell	$0.0773/hr	Deploy
5090 RTX5090	Vast.ai	32GB	Blackwell	$0.0940/hr	Deploy
5090 RTX5090	Vast.ai	32GB	Blackwell	$0.100/hr	Deploy
5080 RTX5080	Vast.ai	16GB	Blackwell	$0.107/hr	Deploy
5070 RTX5070x2	Vast.ai	24GB	Blackwell	$0.107/hr	Deploy

View all 5,213 instances

Track the Rubin rollout with us

We'll add Rubin instances to our tracker as soon as cloud providers launch them. Check back or read our deep-dive article for the full GTC 2026 breakdown.

Read the GTC 2026 article Token Cost Calculator

Rubin is coming.Your GPU strategy should be ready.