Elon Web Services: What SpaceX's $15B Anthropic Deal Means for Cloud GPU Pricing

The Deal at a Glance

$1.25B

Monthly payment

~$15B

Annual run rate

May 2029

Contract end

90-day notice

Cancellation

SpaceX's S-1 filing — the one filed ahead of its 2026 IPO — quietly disclosed something that reshapes the cloud GPU map. Anthropic has signed a cloud services agreement to pay SpaceX $1.25 billion per month through May 2029. That's a roughly $15 billion per year compute commitment from one customer, on one contract, to a company that doesn't appear in any GPU pricing comparison most buyers run today.

The filing language is unusually direct. SpaceX writes that the structure "allows us to monetize unused compute capacity in our infrastructure while still permitting a reallocation of the capacity for our own internal initiatives if needed in the future." That is the Amazon Web Services playbook, almost word for word, repurposed for the AI era. Investors have already nicknamed it Elon Web Services (EWS).

We track 5,000+ live GPU instances across 54 cloud providers, and the EWS contract changes our read on H100, H200, and B200 inference pricing for the next 24 months. Here's the deal decoded — and what it means for anyone renting cloud GPUs in 2026.

What the S-1 Actually Says

The Anthropic disclosure sits inside SpaceX's related-party and material-contracts section. The exact terms:

Cloud services agreement signed May 2026.
Anthropic pays $1.25 billion per month through May 2029.
Capacity ramps through May and June 2026 at a reduced introductory fee.
Either party can terminate on 90 days' notice.
Anthropic retains full ownership of its content, models, and data.
The same filing states SpaceX is "in discussions with other companies to do the same" — meaning Anthropic is the first announced customer, not the only intended one.

Three things matter here. First, $15B/yr from a single inference customer is roughly the same scale as Microsoft's entire 2024 Azure-OpenAI infrastructure commitment. Second, the 90-day exit clause means this is rented capacity, not a colocation deal — Anthropic doesn't own the hardware. Third, the explicit "reallocate for our own initiatives" carve-out tells you SpaceX considers internal AI training (xAI's Grok) the primary user, with external customers absorbing surplus.

$1.25B/Month Translated Into GPUs

The most useful exercise for anyone shopping cloud GPUs is to convert that monthly check into actual hardware. Using the live median H100 SXM price across the 13 providers we track ($2.49/hr on-demand as of this writing), the math:

Metric	Value	Notes
Monthly payment	$1,250,000,000	Per S-1 disclosure
H100 SXM median (market)	$2.49/hr	13 providers we track
GPU-hours purchased / month	~502 million	$1.25B ÷ $2.49
Equivalent H100s running 24/7	~690,000 GPUs	502M ÷ 730 hours
At $0.80/hr (cheapest tracked)	~2.1 million GPUs	Hypothetical floor
At $7.97/hr (most expensive)	~215,000 GPUs	Hyperscaler ceiling

Anthropic almost certainly isn't paying $2.49/hr. At hyperscale, custom contracts land somewhere between the cheapest H100 spot rate (~$0.80/hr on Verda, Vast.ai) and the H100 SXM cluster rates that CoreWeave and Lambda Labs quote enterprise customers ($1.80–$2.50/hr fully loaded). If you assume Anthropic's effective rate is in the $1.50–$2.00/hr range — reasonable for a 3-year, $45B commitment — that's somewhere between 850,000 and 1.1 million H100-equivalent GPUs running continuously for the duration of the contract.

For context: NVIDIA's total estimated 2024 H100 production was ~2 million units. The EWS-to-Anthropic pipe alone represents roughly half of one year's worldwide H100 output, dedicated to one inference customer for three years. That is the scale that justifies the "new hyperscaler" framing.

The Elon Web Services Playbook

Amazon didn't set out to build AWS. Bezos's team built excess data center capacity to run amazon.com, found most of it sitting idle on weekdays, and rented the surplus. By 2024, AWS generated more operating income than Amazon's retail business combined. The 2026 SpaceX filing reads like the same memo, re-pointed at AI:

Stage	Amazon (2003–2010)	SpaceX / xAI (2024–2029)
Initial buildout	Retail data centers for amazon.com	Colossus cluster (Memphis) for Grok training
First excess capacity	CPU/storage idle on weekends	H100 racks not saturated by xAI workloads
First external customer	Internal Amazon teams, then SmugMug	Anthropic ($1.25B/month)
Pricing model	Per-second EC2 instances	Multi-year fixed compute commitments
Sister-business funding	Retail margin	Starlink ($11.4B revenue, $4.4B operating income in 2025)
Endgame	AWS = ~$120B revenue in 2024	"AI compute as a service" at planetary scale

The structural similarity that makes this work: Starlink is to SpaceX what amazon.com was to AWS — a high-margin subscription business that throws off enough cash to fund the speculative compute buildout, and shields the AI bet from quarterly earnings pressure. Starlink's 10.3 million subscribers (up from 5M in 2024) cover the rocket and data-center capex while the AI line scales into break-even.

Why Anthropic Chose Rent Over Build

Anthropic could have spent $45B building its own data centers over three years. It did the math and chose to rent. Three reasons that should matter to anyone deciding between long-term commit pricing and on-demand:

Power. SpaceX already controls grid-connected capacity that would take 24–36 months to permit elsewhere. The Colossus expansion in Memphis added 150MW in 2025 on infrastructure SpaceX had pre-staged. New AI data centers in 2026 are bottlenecked on substations, not silicon.
Hardware mix. The first Colossus cluster was assembled from a heterogeneous mix of H100, H200, and AMD MI300X units — fine for inference, suboptimal for clean frontier-model training. xAI uses the newer Blackwell-class racks ("Macrohard" and "Macroharder") for Grok training and rents the older, mixed inventory to Anthropic. Both parties get what they need from the same building.
Time-to-deploy. Anthropic gets capacity in 30 days, not 30 months. For an inference workload growing 10× year-over-year, that's worth a 20–30% premium over self-build economics.

This is the same reasoning that drives most teams in our pricing data to choose 1-month or 6-month commitments over 3-year reserved instances. Optionality is worth more than the headline discount when your workload trajectory is uncertain.

What EWS Does to Cloud GPU Pricing

A million H100-equivalents taken off the spot market — even gradually, even with 90-day exits — is a measurable supply shock. Here's what we expect to show up in the data over the next 18 months, based on the existing dataset:

Segment	Current price band	Expected direction
H100 SXM (specialty clouds)	$1.49–$2.49/hr	Flat-to-down: more H100 supply released as hyperscalers rebalance toward Rubin
H100 SXM (hyperscalers)	$4.50–$7.97/hr	Down 10–20%: enterprise rates compress as alternatives mature
H100 spot	$0.80–$1.50/hr	Volatile: large bulk contracts pull spot inventory; prices spike on demand events
H200 / B200	$1.67–$3.50/hr	Stable: Anthropic-style deals lock newer hardware away from spot
L40S / RTX 6000 Ada	$0.79–$2.49/hr	Down: inference workloads migrate up-stack, leaving mid-tier soft
A100 80GB	$0.99–$1.79/hr	Down: oldest commercial AI chip, replaced fastest in big-contract clusters

Counterintuitive read: the EWS deal is bullish for small-buyer pricing in 2026, not bearish. When a hyperscaler-class buyer absorbs 500K–1M GPUs of mid-generation supply, it forces the rest of the inventory pyramid down a tier. CoreWeave and Lambda push customers toward H200/B200 to maintain margin; specialty clouds drop H100 spot pricing to fill empty racks. Indie builders and small startups should benefit.

Who Else Could Become a Hyperscaler This Way

The EWS pattern — internal AI buildout + sister-business cash flow + rent the surplus — has a small but specific set of candidates. The required ingredients: a multi-billion-dollar non-AI revenue line, an existing data center footprint or land bank, and an AI workload that doesn't fully saturate planned capacity. Our shortlist:

Company	Funding business	Likelihood
SpaceX / xAI (confirmed)	Starlink ($11.4B/yr)	Live
Meta	Ads ($150B+/yr)	High — Llama API already public, infrastructure overflow likely
Tesla (energy / Dojo)	Vehicles + Megapack	Medium — Dojo positioned for FSD, capacity could be sold
Oracle (with NVIDIA)	Database + cloud	High — already running OCI Rubin Superclusters
Apple	Devices ($380B/yr)	Low — strategic preference for first-party Private Cloud Compute
ByteDance	TikTok ads	Medium — US/EU export controls limit reach

Every one of these companies represents potential new H100/H200/B200 supply that doesn't exist on any GPU pricing site today. If even two of them follow SpaceX into "AI compute as a service" by 2027, the number of meaningful providers in this market roughly doubles — and the spot/on-demand price spread compresses materially.

Should You Lock in a Long-Term Contract Now?

This is the question most teams should be asking after the filing dropped. Our pricing data points to a clear answer: no — keep optionality through end of 2026. Three reasons:

Hopper supply expansion is still arriving. The EWS deal is one of several large hyperscaler rebalancings happening through 2026 as Rubin (NVL72) racks come online in H2. The same H100s being sold to Anthropic today are being released from frontier training clusters at competing labs. Expect 2027 H100 on-demand pricing to be 25–40% below today's median.
Rubin will reset the floor. NVIDIA's claimed 10× token-cost reduction on Rubin doesn't mean H100 becomes obsolete — it means H100 becomes the "value tier" the way A100 is today. Long contracts at 2026 prices look expensive in 2027.
90-day exits are the new standard. The fact that even a $45B contract between SpaceX and Anthropic includes a 90-day notice clause tells you something. Buyers with leverage are no longer signing 3-year fixed deals. If a $1.25B/month customer keeps optionality, a $5K/month customer should too.

Practical translation: for any inference workload under $50K/month, stay on monthly or 6-month commitments. Use our live pricing table to rebid every quarter — the median H100 cluster price moved by 18% in the last two quarters alone.

The Orbital Compute Footnote

The S-1 also discloses something further out: SpaceX has filed for permission to launch up to one million orbital data center satellites, with first deployments targeted for 2028. The pitch — unlimited solar power, free cooling from the vacuum of space, and the only company on Earth that can put that much mass in orbit cheaply — has been kicking around for years. The filing puts it in writing for the first time.

For GPU pricing in 2026? It doesn't matter. For GPU pricing in 2029? It might matter a lot. NVIDIA already announced the Vera Rubin Space-1 module at GTC 2026 — a radiation-tolerant Rubin-derived compute unit purpose-built for orbital deployment, delivering roughly 25× the AI compute of an H100. The supply chain for "compute that isn't on Earth" is now a real category, with at least one anchor customer (SpaceX itself) and a stated 2028 launch window.

The Bottom Line for GPU Buyers

The five takeaways that matter if you're renting cloud GPUs in 2026:

A new hyperscaler exists. SpaceX/xAI is now in the same tier as AWS, GCP, Azure, and Oracle for raw AI compute output. It just doesn't sell to you directly yet.
Spot supply gets weirder. When 1M H100-equivalents move on a single contract, day-to-day spot pricing becomes more volatile. Build that into capacity planning.
Hopper prices keep falling. The EWS deal accelerates the H100 → H200/B200 → Rubin migration cycle. Lock in 2026 inference on H100; plan 2027 on Blackwell.
Optionality beats commitment. If a $45B buyer keeps a 90-day exit clause, you should too. Re-shop your H100 contracts quarterly.
Expect EWS competitors. Meta, Tesla, Oracle, and maybe ByteDance are the most likely next entrants to "AI compute as a service." Add new providers to your shortlist every six months.

The deeper story is that "GPU cloud pricing" is becoming a moving target the way oil pricing did in the 1980s — a market where weekly supply announcements from a handful of producers reset the curve for everyone downstream. The cheapest H100 today is on Verda at $0.80/hr. By next quarter, the cheapest H100 may live somewhere none of us have heard of yet, on infrastructure originally built to train someone else's model. That's the EWS pattern, repeating.

Track Cloud GPU Pricing as the EWS Pattern Plays Out

H100 prices →H200 prices →B200 prices →Cheapest GPU now →All providers →Price trends →