VC
Value Add VC
⚑HomePulse⚑Helpful AppsπŸ“Blog
Home/Blog/Amazon AWS AI Capex 2025: $105B Total, ~$80B to AI, and the Custom Chip Strategy Explained
AI & TechnologyJune 22, 2026Β·10 min readΒ·Last updated: June 22, 2026

Amazon AWS AI Capex 2025: $105B Total, ~$80B to AI, and the Custom Chip Strategy Explained

Amazon guided to roughly $105B in 2025 capital expenditure β€” the most of any company on earth. Here is where the money goes, why most of it is AWS AI infrastructure, and why Amazon designs its own Trainium and Inferentia chips instead of just writing checks to Nvidia.

TC
Trace Cohen
Co-Founder & GP at Six Point Ventures Β· 3x founder (BrandYourself, Launch.it, SPOT) Β· 65+ investments Β· Based in Boca Raton, FL
@Trace_CohenΒ·t@nyvp.comΒ·South Florida Advisory

Quick Answer

$105B in 2025 capital expenditure is Amazon's guidance, the highest of any company globally, with the large majority directed at AWS AI infrastructure β€” data centers, networking, and custom Trainium2 and Inferentia chips. AWS runs roughly a 30–38% operating margin, so this capex is funded by the most profitable cloud business in the world rather than by external debt.

Amazon guided to roughly $105B in 2025 capital expenditure β€” the largest of any company on earth β€” and the majority of it goes to AWS AI infrastructure.

That's the short answer. The longer answer is more interesting, because Amazon is not just buying GPUs and pouring concrete. It is running a two-track bet: rent Nvidia's best chips to keep customers happy today, while designing its own Trainium and Inferentia silicon to own the cost curve tomorrow. The capex number is the headline. The chip strategy is the actual story.

Amazon AWS AI capex 2025: the numbers

Amazon's AWS AI capex in 2025 totals roughly $105B in companywide capital expenditure, the highest of any hyperscaler, with the large majority directed at AWS data centers, networking, and AI compute. That is up from about $83B in 2024 and roughly $48B in 2023 β€” a near doubling in two years, driven almost entirely by generative AI demand rather than retail fulfillment, which was the historical driver of Amazon's capex line.

YearTotal CapexYoY ChangePrimary Driver
2021~$55Bβ€”Retail fulfillment + logistics
2022~$59B+7%Fulfillment, early AWS buildout
2023~$48Bβˆ’19%Capex discipline, retail pullback
2024~$83B+73%AWS + generative AI infrastructure
2025E~$105B+27%AWS AI: chips, data centers, power
2026E$115B++10%Continued AI capacity expansion

Figures are approximate, blending reported actuals and management guidance. The shape matters more than the decimal: capex roughly doubled from 2023 to 2025, and the marginal dollar is now AI, not fulfillment. Track the full hyperscaler picture on the AI Spending Dashboard.

Where the ~$80B in AI capex actually goes

When Amazon says "the majority" of $105B supports AWS, the AI-attributable slice is roughly $75–85B. It breaks down across four buckets, and only one of them is the GPUs everyone talks about.

AI compute (GPUs + custom silicon)

~45%

Nvidia H100/H200 plus Amazon's own Trainium2 and Inferentia2 β€” the single largest line

Data center shells + power

~30%

New regions, multi-gigawatt sites, and long-lead electrical infrastructure

Networking + interconnect

~15%

High-bandwidth fabric to cluster hundreds of thousands of accelerators

Storage + general AWS capacity

~10%

Object storage, databases, and non-AI cloud growth

The reason power keeps showing up in capex calls is that GPUs are useless without electricity to run them and water or air to cool them. A single modern AI data center campus can draw 500MW to 1GW β€” enough for a mid-size city. Amazon has signed nuclear, solar, and grid deals to lock in that power, including a multi-year agreement tied to a Pennsylvania nuclear plant. The constraint on AI in 2026 is increasingly megawatts, not chips.

The custom chip strategy: why Amazon builds Trainium and Inferentia

Here is the part of Amazon's AWS AI capex that separates it from a company simply buying Nvidia. Amazon designs two families of custom silicon: Trainium for training and Inferentia for inference. The logic is brutal and simple β€” Nvidia's data center GPUs carry gross margins north of 70%, so every dollar AWS spends on an H200 is roughly 70 cents of Nvidia profit. Designing your own chip claws that margin back.

ChipPurposeStatus (2026)Claimed advantage
Trainium2Model trainingVolume production30–40% better price-performance vs comparable GPU
Trainium3Model trainingAnnounced / ramping~2x performance, ~40% more efficient than Trn2
Inferentia2Inference servingGenerally availableUp to 40% better price-performance for inference
Graviton4General CPU computeGenerally availablePowers non-GPU AWS workloads at lower cost
Nvidia H200Training + inferencePurchased at scaleIndustry-standard, CUDA ecosystem
Nvidia GB200Frontier trainingDeployed in clustersHighest raw performance, premium price

The strategic point is not that Trainium beats Nvidia on raw performance β€” it usually doesn't. The point is total cost of ownership. If AWS can serve a customer's inference workload on Inferentia at 60% of the cost of an Nvidia instance, AWS either keeps that margin or passes the savings on to win the deal. Amazon doesn't need Trainium to be the fastest chip; it needs it to be good enough at a structurally lower cost, and to give AWS leverage in every Nvidia negotiation.

The Anthropic deal and Project Rainier: Amazon's anchor customer

A custom chip is worthless without a flagship customer willing to bet a frontier model on it. That customer is Anthropic. Amazon has committed roughly $8B to Anthropic across two tranches, making AWS Anthropic's primary training and cloud partner. In exchange, Anthropic trains on Trainium at massive scale through Project Rainier β€” a cluster reported to involve several hundred thousand Trainium2 chips, one of the largest AI training deployments in the world.

This is a circular-economy bet that the whole industry now runs on: Amazon invests in Anthropic, Anthropic spends that capital on AWS compute, and AWS validates its custom silicon against a real frontier workload. Critics call it round-tripping revenue. Amazon calls it anchoring demand. Both are true. What matters is that Trainium now has a customer that forces it to actually work β€” and Anthropic's Claude models, available through Amazon Bedrock, give AWS enterprise customers a top-tier model without building one in-house.

For more on how the model layer is priced, see our breakdown of AI company valuations and how the largest private AI labs justify their numbers.

Amazon AWS AI capex vs Microsoft, Google, and Meta

Amazon's ~$105B leads the hyperscaler pack in 2025, but the gap is narrower than the headline suggests because the others are catching up fast. Here is how the four largest spenders compare.

Company2025 Capex (est.)Custom AI siliconFlagship AI customer/product
Amazon (AWS)~$105BTrainium, InferentiaAnthropic / Bedrock
Alphabet (Google)~$85BTPU v6/v7Gemini / Vertex AI
Microsoft~$80B+Maia 100OpenAI / Copilot
Meta~$66–72BMTIALlama (internal + open)
Oracle~$25B+None (Nvidia-heavy)OCI / OpenAI compute
Combined top 4$330B+β€”β€”

The common thread: every hyperscaler now designs its own AI chip. Google has TPUs (the most mature, now in their sixth-plus generation), Microsoft has Maia, Meta has MTIA, and Amazon has Trainium. Nvidia still wins the raw-performance crown and the developer ecosystem, but the buyers are systematically building escape hatches. See the side-by-side in our Big Tech Earnings Dashboard.

The bull and bear case on Amazon's AI capex

Bull Case

  • βœ“ AWS runs a ~30–38% operating margin and funds capex from cash flow
  • βœ“ Trainium/Inferentia claw back Nvidia's 70%+ margins on custom workloads
  • βœ“ The $8B Anthropic deal anchors real frontier-model demand
  • βœ“ AWS is the #1 cloud, so AI demand lands on existing distribution
  • βœ“ Owning the chip stack lowers long-run cost per token structurally

Bear Case

  • βœ• $105B in capex means rising depreciation pressuring AWS margins by 2027
  • βœ• Trainium adoption outside Anthropic is still unproven at scale
  • βœ• Nvidia's CUDA lock-in makes custom silicon a hard sell to customers
  • βœ• The Anthropic loop risks looking like circular revenue if growth slows
  • βœ• Power and grid constraints could strand capacity that's already paid for

The $105B capex number is the headline. The chip is the strategy.

Amazon isn't just renting the AI boom from Nvidia β€” it's spending $105B to own the layer underneath it, one Trainium chip at a time.

Track real-time AI infrastructure and capex data on the AI Spending Dashboard, AI Valuations, and Big Tech Earnings Dashboard at Value Add VC. See also: Microsoft $80B AI Capex and The SoftBank–OpenAI Stargate Deal.

ShareXLinkedInEmail

Frequently Asked Questions

How much is Amazon spending on AI capex in 2025?

Amazon guided to approximately $105B in total 2025 capital expenditure, up from about $83B in 2024 and $48B in 2023. The company has said the majority of this spend supports AWS, and within AWS the largest growth driver is generative AI infrastructure β€” data centers, custom silicon, and networking. That makes Amazon the single largest capex spender among the big tech hyperscalers in 2025.

What is AWS Trainium and how does it compare to Nvidia GPUs?

Trainium is Amazon's custom AI training chip, with Trainium2 now in volume and Trainium3 announced. Amazon claims Trainium2 delivers 30–40% better price-performance than comparable Nvidia GPU instances for many workloads. Amazon still buys large volumes of Nvidia H100 and H200 GPUs, but custom silicon lets it lower cost per token and reduce dependence on a single supplier whose chips can carry 70%+ gross margins.

How does Amazon's AI capex relate to its Anthropic investment?

Amazon has committed roughly $8B to Anthropic across two tranches and is Anthropic's primary cloud and training partner. Anthropic trains models on AWS Trainium chips, including the multi-hundred-thousand-chip cluster known as Project Rainier. The deal anchors demand for Amazon's custom silicon and gives AWS a flagship frontier-model customer to validate its hardware against Nvidia.

Is Amazon's AI capex hurting AWS profit margins?

Not yet meaningfully. AWS posted roughly a 30–38% operating margin through 2025 even as capex surged, and AWS generates the bulk of Amazon's total operating income. Depreciation from the new infrastructure will pressure margins over the next few years, but Amazon argues the spend tracks real customer demand and that under-building would be the bigger risk than over-building.

How does Amazon's 2025 capex compare to Microsoft, Google, and Meta?

Amazon's ~$105B leads the group. Microsoft guided to roughly $80B+ for its fiscal year, Google (Alphabet) to about $85B, and Meta to roughly $66–72B for 2025. Combined, the four hyperscalers are spending well over $300B in a single year on AI infrastructure, and Amazon is the largest single line in that total.

Related Tools & Dashboards

πŸ’ΈAI SpendingπŸ’ΉBig Tech EarningsπŸ€–AI Valuations

Keep Reading

πŸͺŸMicrosoft $80B AI Infrastructure SpendπŸ—οΈThe SoftBank–OpenAI Stargate Deal: $500B in AI InfrastructureπŸ’ΈAI Hyperscaler Capex Compared

Explore 45+ free VC tools, dashboards, and recommended startup software.

Explore DashboardsHelpful Apps & Platforms

Trace Cohen is a serial founder, investor and data geek. Please feel free to reach out t@nyvp.com

VC
Value Add VC
Helpful AppsTwitterContact