Etched, the AI chip startup building silicon purpose-designed for transformer inference, disclosed on June 30, 2026 that it has crossed a $5 billion valuation and booked more than $1 billion in customer contracts for its chips and systems, according to TechCrunch. The milestone follows TSMC's successful manufacture of Etched's Sohu chip earlier in the year and caps a dramatic turnaround for a company that was reportedly running month-to-month and struggling to raise capital as recently as 2023.
Etched's product is what it calls 'frontier inference clusters' -- bundles of its Sohu chips with custom racks and software built specifically to run inference for large language models faster, cheaper and with better power efficiency than general-purpose GPUs. The company claims a single 8-chip Sohu server can process roughly 500,000 tokens per second running Meta's Llama 70B, outperforming a 160-GPU H100 cluster while consuming less power and physical space.
The founders' path here is unusual. Two Harvard dropouts and Thiel Fellows wrote a 30-page memo in 2023 arguing that AI inference would eventually demand chips built for transformers specifically, not general-purpose GPUs repurposed for the job -- and every major investor they pitched passed. The company survived close to running out of cash before conviction (and capital) caught up: $800 million raised across several rounds, culminating in December's $500 million round at the same $5 billion post-money mark now validated by real revenue.
โThe competitive landscape for AI inference chips is filling in fast.โ
The competitive landscape for AI inference chips is filling in fast. Nvidia remains dominant through CUDA and its GPU franchise, but Groq has raised $650 million for its own inference-specific chips, and hyperscalers including Google, Amazon and Microsoft are all deploying custom silicon for inference workloads. Etched's bet is that transformer-specific ASICs can beat both general-purpose GPUs and hyperscaler in-house chips on cost-per-token for the specific, enormous workload of running today's dominant model architecture.
$1 billion in disclosed bookings is a meaningfully different signal than a funding round -- it is revenue commitment from real customers, not investor optimism, and it arrives the same week Anthropic and Google are racing to cut inference costs, which only increases demand for cheaper, more efficient inference hardware. For a chip company, that is the best possible market backdrop.
The bear case is architecture risk: betting an entire company on the transformer architecture is dangerous if the industry shifts to a different model paradigm, and Nvidia's software moat (CUDA) and manufacturing scale remain formidable even against superior point-solution hardware. What to watch: named hyperscaler or neocloud customers beyond the disclosed bookings, independent benchmarks validating Etched's performance claims, and whether Etched raises a fresh round to fund the capital-intensive scale-up that turning $1B in orders into delivered systems requires.