Baseten has raised a $1.5 billion Series F at a $13 billion valuation, the single largest venture round of the week, according to Crunchbase News. The round was led by Altimeter Capital and Conviction Partners, with Spark Capital, Sands Capital and Wellington Management joining -- a roster that mixes growth crossover investors with deep-pocketed institutions.
Baseten sells AI inference infrastructure: the systems that take trained models and run them in production at scale, handling the latency, reliability and cost-optimization challenges that come with serving millions of requests. As enterprises move from experimenting with AI to deploying it, inference -- not training -- becomes the recurring, ever-growing compute bill, and the companies that make serving models cheaper and faster are capturing that flow.
“It sits within a broader funding spree dominated by AI infrastructure and applied AI.”
The valuation puts Baseten among the most valuable AI-infrastructure startups outside the chipmakers themselves, a category that has become one of venture's favorite bets precisely because it monetizes AI adoption regardless of which model wins. It competes with a crowded field including Together AI, Fireworks, Modal and the hyperscalers' own inference offerings, where the differentiators are performance, cost-per-token and developer experience.
The timing is telling. Baseten's mega-round landed the same week Groq raised $650 million for its inference-cloud technology -- two large bets on the serving layer in a matter of days, a clear signal that investors see inference demand compounding as agentic AI and always-on applications multiply the number of model calls. It sits within a broader funding spree dominated by AI infrastructure and applied AI.
The bear case is competition and commoditization: inference is a margin-sensitive business where hyperscalers can bundle and undercut, and a $13 billion valuation prices in years of continued hypergrowth. What to watch: Baseten's revenue trajectory and gross margins, whether it can defend pricing as custom silicon and cheaper models proliferate, and how it positions against both startups and the cloud giants racing into the same lane.