Skip to main content
analysistrendsinference

Elon Web Services: What SpaceX's $15B Anthropic Deal Means for Cloud GPU Pricing

SpaceX's IPO filing reveals a $1.25B/month compute deal with Anthropic — a new AI hyperscaler hiding inside a rocket company. Here's the GPU economics, the AWS playbook parallel, and what it does to H100 and H200 rental prices.

May 25, 202614 min read
The Deal at a Glance
$1.25B
Monthly payment
~$15B
Annual run rate
May 2029
Contract end
90-day notice
Cancellation

SpaceX's S-1 filing — the one filed ahead of its 2026 IPO — quietly disclosed something that reshapes the cloud GPU map. Anthropic has signed a cloud services agreement to pay SpaceX $1.25 billion per month through May 2029. That's a roughly $15 billion per year compute commitment from one customer, on one contract, to a company that doesn't appear in any GPU pricing comparison most buyers run today.

The filing language is unusually direct. SpaceX writes that the structure "allows us to monetize unused compute capacity in our infrastructure while still permitting a reallocation of the capacity for our own internal initiatives if needed in the future." That is the Amazon Web Services playbook, almost word for word, repurposed for the AI era. Investors have already nicknamed it Elon Web Services (EWS).

We track 5,000+ live GPU instances across 54 cloud providers, and the EWS contract changes our read on H100, H200, and B200 inference pricing for the next 24 months. Here's the deal decoded — and what it means for anyone renting cloud GPUs in 2026.

What the S-1 Actually Says

The Anthropic disclosure sits inside SpaceX's related-party and material-contracts section. The exact terms:

  • Cloud services agreement signed May 2026.
  • Anthropic pays $1.25 billion per month through May 2029.
  • Capacity ramps through May and June 2026 at a reduced introductory fee.
  • Either party can terminate on 90 days' notice.
  • Anthropic retains full ownership of its content, models, and data.
  • The same filing states SpaceX is "in discussions with other companies to do the same" — meaning Anthropic is the first announced customer, not the only intended one.

Three things matter here. First, $15B/yr from a single inference customer is roughly the same scale as Microsoft's entire 2024 Azure-OpenAI infrastructure commitment. Second, the 90-day exit clause means this is rented capacity, not a colocation deal — Anthropic doesn't own the hardware. Third, the explicit "reallocate for our own initiatives" carve-out tells you SpaceX considers internal AI training (xAI's Grok) the primary user, with external customers absorbing surplus.

$1.25B/Month Translated Into GPUs

The most useful exercise for anyone shopping cloud GPUs is to convert that monthly check into actual hardware. Using the live median H100 SXM price across the 13 providers we track ($2.49/hr on-demand as of this writing), the math:

MetricValueNotes
Monthly payment$1,250,000,000Per S-1 disclosure
H100 SXM median (market)$2.49/hr13 providers we track
GPU-hours purchased / month~502 million$1.25B ÷ $2.49
Equivalent H100s running 24/7~690,000 GPUs502M ÷ 730 hours
At $0.80/hr (cheapest tracked)~2.1 million GPUsHypothetical floor
At $7.97/hr (most expensive)~215,000 GPUsHyperscaler ceiling

Anthropic almost certainly isn't paying $2.49/hr. At hyperscale, custom contracts land somewhere between the cheapest H100 spot rate (~$0.80/hr on Verda, Vast.ai) and the H100 SXM cluster rates that CoreWeave and Lambda Labs quote enterprise customers ($1.80–$2.50/hr fully loaded). If you assume Anthropic's effective rate is in the $1.50–$2.00/hr range — reasonable for a 3-year, $45B commitment — that's somewhere between 850,000 and 1.1 million H100-equivalent GPUs running continuously for the duration of the contract.

For context: NVIDIA's total estimated 2024 H100 production was ~2 million units. The EWS-to-Anthropic pipe alone represents roughly half of one year's worldwide H100 output, dedicated to one inference customer for three years. That is the scale that justifies the "new hyperscaler" framing.

The Elon Web Services Playbook

Amazon didn't set out to build AWS. Bezos's team built excess data center capacity to run amazon.com, found most of it sitting idle on weekdays, and rented the surplus. By 2024, AWS generated more operating income than Amazon's retail business combined. The 2026 SpaceX filing reads like the same memo, re-pointed at AI:

StageAmazon (2003–2010)SpaceX / xAI (2024–2029)
Initial buildoutRetail data centers for amazon.comColossus cluster (Memphis) for Grok training
First excess capacityCPU/storage idle on weekendsH100 racks not saturated by xAI workloads
First external customerInternal Amazon teams, then SmugMugAnthropic ($1.25B/month)
Pricing modelPer-second EC2 instancesMulti-year fixed compute commitments
Sister-business fundingRetail marginStarlink ($11.4B revenue, $4.4B operating income in 2025)
EndgameAWS = ~$120B revenue in 2024"AI compute as a service" at planetary scale

The structural similarity that makes this work: Starlink is to SpaceX what amazon.com was to AWS — a high-margin subscription business that throws off enough cash to fund the speculative compute buildout, and shields the AI bet from quarterly earnings pressure. Starlink's 10.3 million subscribers (up from 5M in 2024) cover the rocket and data-center capex while the AI line scales into break-even.

Why Anthropic Chose Rent Over Build

Anthropic could have spent $45B building its own data centers over three years. It did the math and chose to rent. Three reasons that should matter to anyone deciding between long-term commit pricing and on-demand:

  1. Power. SpaceX already controls grid-connected capacity that would take 24–36 months to permit elsewhere. The Colossus expansion in Memphis added 150MW in 2025 on infrastructure SpaceX had pre-staged. New AI data centers in 2026 are bottlenecked on substations, not silicon.
  2. Hardware mix. The first Colossus cluster was assembled from a heterogeneous mix of H100, H200, and AMD MI300X units — fine for inference, suboptimal for clean frontier-model training. xAI uses the newer Blackwell-class racks ("Macrohard" and "Macroharder") for Grok training and rents the older, mixed inventory to Anthropic. Both parties get what they need from the same building.
  3. Time-to-deploy. Anthropic gets capacity in 30 days, not 30 months. For an inference workload growing 10× year-over-year, that's worth a 20–30% premium over self-build economics.

This is the same reasoning that drives most teams in our pricing data to choose 1-month or 6-month commitments over 3-year reserved instances. Optionality is worth more than the headline discount when your workload trajectory is uncertain.

What EWS Does to Cloud GPU Pricing

A million H100-equivalents taken off the spot market — even gradually, even with 90-day exits — is a measurable supply shock. Here's what we expect to show up in the data over the next 18 months, based on the existing dataset:

SegmentCurrent price bandExpected direction
H100 SXM (specialty clouds)$1.49–$2.49/hrFlat-to-down: more H100 supply released as hyperscalers rebalance toward Rubin
H100 SXM (hyperscalers)$4.50–$7.97/hrDown 10–20%: enterprise rates compress as alternatives mature
H100 spot$0.80–$1.50/hrVolatile: large bulk contracts pull spot inventory; prices spike on demand events
H200 / B200$1.67–$3.50/hrStable: Anthropic-style deals lock newer hardware away from spot
L40S / RTX 6000 Ada$0.79–$2.49/hrDown: inference workloads migrate up-stack, leaving mid-tier soft
A100 80GB$0.99–$1.79/hrDown: oldest commercial AI chip, replaced fastest in big-contract clusters

Counterintuitive read: the EWS deal is bullish for small-buyer pricing in 2026, not bearish. When a hyperscaler-class buyer absorbs 500K–1M GPUs of mid-generation supply, it forces the rest of the inventory pyramid down a tier. CoreWeave and Lambda push customers toward H200/B200 to maintain margin; specialty clouds drop H100 spot pricing to fill empty racks. Indie builders and small startups should benefit.

Who Else Could Become a Hyperscaler This Way

The EWS pattern — internal AI buildout + sister-business cash flow + rent the surplus — has a small but specific set of candidates. The required ingredients: a multi-billion-dollar non-AI revenue line, an existing data center footprint or land bank, and an AI workload that doesn't fully saturate planned capacity. Our shortlist:

CompanyFunding businessLikelihood
SpaceX / xAI (confirmed)Starlink ($11.4B/yr)Live
MetaAds ($150B+/yr)High — Llama API already public, infrastructure overflow likely
Tesla (energy / Dojo)Vehicles + MegapackMedium — Dojo positioned for FSD, capacity could be sold
Oracle (with NVIDIA)Database + cloudHigh — already running OCI Rubin Superclusters
AppleDevices ($380B/yr)Low — strategic preference for first-party Private Cloud Compute
ByteDanceTikTok adsMedium — US/EU export controls limit reach

Every one of these companies represents potential new H100/H200/B200 supply that doesn't exist on any GPU pricing site today. If even two of them follow SpaceX into "AI compute as a service" by 2027, the number of meaningful providers in this market roughly doubles — and the spot/on-demand price spread compresses materially.

Should You Lock in a Long-Term Contract Now?

This is the question most teams should be asking after the filing dropped. Our pricing data points to a clear answer: no — keep optionality through end of 2026. Three reasons:

  1. Hopper supply expansion is still arriving. The EWS deal is one of several large hyperscaler rebalancings happening through 2026 as Rubin (NVL72) racks come online in H2. The same H100s being sold to Anthropic today are being released from frontier training clusters at competing labs. Expect 2027 H100 on-demand pricing to be 25–40% below today's median.
  2. Rubin will reset the floor. NVIDIA's claimed 10× token-cost reduction on Rubin doesn't mean H100 becomes obsolete — it means H100 becomes the "value tier" the way A100 is today. Long contracts at 2026 prices look expensive in 2027.
  3. 90-day exits are the new standard. The fact that even a $45B contract between SpaceX and Anthropic includes a 90-day notice clause tells you something. Buyers with leverage are no longer signing 3-year fixed deals. If a $1.25B/month customer keeps optionality, a $5K/month customer should too.

Practical translation: for any inference workload under $50K/month, stay on monthly or 6-month commitments. Use our live pricing table to rebid every quarter — the median H100 cluster price moved by 18% in the last two quarters alone.

The Orbital Compute Footnote

The S-1 also discloses something further out: SpaceX has filed for permission to launch up to one million orbital data center satellites, with first deployments targeted for 2028. The pitch — unlimited solar power, free cooling from the vacuum of space, and the only company on Earth that can put that much mass in orbit cheaply — has been kicking around for years. The filing puts it in writing for the first time.

For GPU pricing in 2026? It doesn't matter. For GPU pricing in 2029? It might matter a lot. NVIDIA already announced the Vera Rubin Space-1 module at GTC 2026 — a radiation-tolerant Rubin-derived compute unit purpose-built for orbital deployment, delivering roughly 25× the AI compute of an H100. The supply chain for "compute that isn't on Earth" is now a real category, with at least one anchor customer (SpaceX itself) and a stated 2028 launch window.

The Bottom Line for GPU Buyers

The five takeaways that matter if you're renting cloud GPUs in 2026:

  • A new hyperscaler exists. SpaceX/xAI is now in the same tier as AWS, GCP, Azure, and Oracle for raw AI compute output. It just doesn't sell to you directly yet.
  • Spot supply gets weirder. When 1M H100-equivalents move on a single contract, day-to-day spot pricing becomes more volatile. Build that into capacity planning.
  • Hopper prices keep falling. The EWS deal accelerates the H100 → H200/B200 → Rubin migration cycle. Lock in 2026 inference on H100; plan 2027 on Blackwell.
  • Optionality beats commitment. If a $45B buyer keeps a 90-day exit clause, you should too. Re-shop your H100 contracts quarterly.
  • Expect EWS competitors. Meta, Tesla, Oracle, and maybe ByteDance are the most likely next entrants to "AI compute as a service." Add new providers to your shortlist every six months.

The deeper story is that "GPU cloud pricing" is becoming a moving target the way oil pricing did in the 1980s — a market where weekly supply announcements from a handful of producers reset the curve for everyone downstream. The cheapest H100 today is on Verda at $0.80/hr. By next quarter, the cheapest H100 may live somewhere none of us have heard of yet, on infrastructure originally built to train someone else's model. That's the EWS pattern, repeating.

Stay ahead on GPU pricing

Get weekly GPU price reports, new hardware analysis, and cost optimization tips. Join engineers and researchers who save thousands on cloud compute.

No spam. Unsubscribe anytime. We respect your inbox.

Find the cheapest GPU for your workload

Compare real-time prices across tracked cloud providers and marketplaces with 5,000+ instances. Updated every 6 hours.

Compare GPU Prices →

Related Articles