Arena, the crowdsourced leaderboard that has become the AI industry's default scoreboard, has reached a $100 million annualized revenue run-rate as of June 29, 2026 -- more than tripling from roughly $30 million in January -- according to TechCrunch. The growth comes from a commercial AI-evaluations service, launched in September 2025, that turns the platform's more than 10 million human model comparisons into paid benchmarking and testing for model labs and enterprises.
Arena's path is a textbook research-to-company story. It began in 2023 as a UC Berkeley project where users compared two anonymous model outputs and voted on the better one, producing the crowd-sourced rankings that labs now cite in launch announcements. It incorporated in April 2025, and its founders include CEO Anastasios Angelopoulos, CTO Wei-Lin Chiang and Databricks co-founder and Berkeley professor Ion Stoica.
“It incorporated in April 2025, and its founders include CEO Anastasios Angelopoulos, CTO Wei-Lin Chiang and Databricks co-founder and Berkeley professor Ion Stoica.”
The business thesis is that as models multiply and every lab claims state-of-the-art, independent, trusted evaluation becomes scarce and valuable infrastructure. Arena raised a $150 million Series A in January 2026 at a $1.7 billion post-money valuation -- bringing total funding to $250 million -- from a deep bench including Andreessen Horowitz, Lightspeed, Kleiner Perkins, Felicis, The House Fund, Laude Ventures and UC Investments.
The competitive landscape spans open benchmarks, internal lab evals, and a wave of AI-testing startups, but Arena's moat is mindshare: its leaderboard is the one developers and labs already point to, giving it distribution and credibility that are hard to manufacture. The consumption-based revenue model means growth tracks real usage rather than seat licenses, which cuts both ways.
The bear case is durability: benchmarks can be gamed, leaderboard prominence can fade, and a consumption model is volatile if a few large labs pull back. There is also the awkward tension of being both the neutral referee and a paid vendor to the teams it ranks. What to watch: whether Arena's evaluations become a procurement standard, how it preserves perceived neutrality, and whether the $100 million run-rate holds as the novelty of public leaderboards matures.