Groq Raises $650M to Scale Its AI Inference Cloud and LPU Chips

AI inference chipmaker Groq raised $650 million from investors including Infinitum and Disruptive to scale its LPU-based inference cloud, betting that purpose-built silicon can undercut GPUs on the cost and speed of serving large models. The round is part of a week dominated by bets on the inference layer, from Baseten's $1.5B to AI-networking deals.

$650M

Raised

Infinitum, Disruptive

Investors

LPU inference chips + cloud

Product

Nvidia GPUs

Rival

Trace Cohen

Early-stage VC & angel · Founder, New York Venture Partners

June 26, 2026

2 min read

Groq has raised $650 million, with Infinitum and Disruptive among the investors, to scale its inference cloud and the LPU (language processing unit) chips that power it, according to Crunchbase News. Founded by former Google TPU architect Jonathan Ross, Groq has staked its business on the idea that a chip designed specifically to run large models -- rather than a general-purpose GPU -- can deliver dramatically faster and cheaper token generation.

The bet rides the most important shift in AI economics. Training a model is a one-time, capital-intensive event; serving it is a perpetual, ever-growing cost that scales with usage. As enterprises move from experimenting with AI to deploying it -- and as agentic systems multiply the number of model calls per task -- inference becomes the dominant line item, and whoever makes it cheaper captures that flow. Groq's pitch is that custom silicon wins on the metrics that matter most for production: latency and cost per token.

“Training a model is a one-time, capital-intensive event; serving it is a perpetual, ever-growing cost that scales with usage.”

The round lands in a week thick with inference bets. Baseten raised $1.5 billion at a $13 billion valuation for its inference software, and AI-networking startup Upscale AI hit a $2 billion valuation -- a cluster of capital flowing into the plumbing beneath AI applications. The market is voting that serving models, not training them, is where durable revenue and margin live, and that the layer is big enough to support multiple winners across chips, software and networking.

Groq's challenge is the incumbent. Nvidia dominates both training and inference, with a deep software moat in CUDA and an installed base that is hard to dislodge, and it is racing to optimize its own chips for serving. Other custom-silicon challengers -- from Cerebras to SambaNova to the hyperscalers' in-house accelerators like Google's TPUs and Amazon's Inferentia -- are chasing the same opening. Groq's differentiation rests on its architecture and the developer experience of its cloud.

The bear case is steep: competing on silicon against Nvidia is brutally capital-intensive, requires winning developers away from CUDA, and faces relentless price pressure as cheaper models and custom chips proliferate. What to watch: Groq's deployed capacity and customer adoption, independent benchmarks of its cost-per-token versus GPUs, and whether purpose-built inference silicon can carve out durable share before the giant closes the gap.

Groq Raises $650M to Scale Its AI Inference Cloud and LPU Chips

$650M

Raised

Infinitum, Disruptive

Investors

LPU inference chips + cloud

Product

Nvidia GPUs

Rival

Trace Cohen

Early-stage VC & angel · Founder, New York Venture Partners

June 26, 2026

2 min read

“Training a model is a one-time, capital-intensive event; serving it is a perpetual, ever-growing cost that scales with usage.”

Groq Raises $650M to Scale Its AI Inference Cloud and LPU Chips

Markets Now

Read Next

AppsFlyer Raises $1B Series E With Unity, Meta and Google Backing

General Intuition Raises $320M Series A at $2.3B to Build Spatial-Reasoning AI From Game Video

Defense Startup Stark Raises $569M Led by Founders Fund and Sequoia

Groq Raises $650M to Scale Its AI Inference Cloud and LPU Chips

Markets Now

Read Next

AppsFlyer Raises $1B Series E With Unity, Meta and Google Backing

General Intuition Raises $320M Series A at $2.3B to Build Spatial-Reasoning AI From Game Video

Defense Startup Stark Raises $569M Led by Founders Fund and Sequoia