VC
Value Add VC
⚡HomePulse⚡Helpful Apps📝Blog
← Value Add PulseAI

The 'Software Factory' Myth: AI Is Helping Companies Ship Bugs Faster

A widely shared analysis argues that most enterprises adopting AI coding tools to build a 'software factory' are really just shipping bugs faster: AI accelerates code production, but downstream testing, review and CI/CD don't scale with it, so defects and incidents climb. Data cited from Faros AI shows developer throughput up sharply -- but incidents and bugs rising even faster.

+33.7% per developer
Dev Throughput
+16.2%
PR Merge Rate
+242.7%
Incidents-to-PR Ratio
+54%
Bugs per Developer
Faros AI
Data Source
TC
Trace Cohen
Early-stage VC & angel · Founder, New York Venture Partners
June 26, 2026
2 min read
KEY TAKEAWAYS FOR VCs & FOUNDERS
1

The productivity story around AI coding has a quality counter-story that boards are starting to notice

2

If incidents rise faster than output, AI coding can destroy value instead of creating it

3

It creates demand for AI-era testing, review and reliability tooling -- a new picks-and-shovels layer

4

It reframes the real bottleneck from writing code to verifying it

TC
The VC Read · Trace's TakeTrace Cohen

This is the most important contrarian data point in enterprise AI right now: throughput up 34%, but incidents up 243%. AI didn't remove the bottleneck, it moved it -- from writing code to verifying it -- and most orgs haven't reallocated a dollar to the new constraint. That gap is the investable thesis: AI-native testing, review and reliability tooling is the picks-and-shovels layer underneath the coding-assistant boom. Founders pitching 'we make engineers faster' should have an answer for 'faster at what, exactly' -- because boards are about to start asking whether the speed is shipping value or shipping bugs.

🤖 AI Landscape →Enterprise AI Agents →

A widely circulated analysis is puncturing one of enterprise AI's favorite narratives -- the idea that AI coding tools turn engineering organizations into high-output 'software factories.' The argument: AI dramatically speeds up the writing of code, but the downstream parts of the software lifecycle -- testing, code review, deployment safeguards and quality control -- don't automatically scale with it, so the net result for many companies is simply shipping bugs and incidents faster.

The data gives the thesis teeth. Figures cited from Faros AI show task throughput per developer up 33.7% and pull-request merge rate up 16.2% -- real productivity gains. But over the same period, the ratio of incidents to pull requests jumped 242.7% and bugs per developer rose 54%. In other words, the defects are climbing far faster than the output, suggesting that AI-accelerated coding without commensurate investment in quality controls can erode reliability rather than improve it.

“Figures cited from Faros AI show task throughput per developer up 33.7% and pull-request merge rate up 16.2% -- real productivity gains.”

The finding lands amid a broader reckoning over how to actually capture value from generative AI in the enterprise. The early phase was about adoption and raw productivity metrics; the maturing phase is about whether that productivity translates into better, more reliable software or just more churn. It connects to the same discipline behind efficient agent memory and rigorous agent evaluation -- the unglamorous engineering that separates production systems from impressive demos.

The competitive implication is a new market opportunity. If writing code is no longer the bottleneck, verifying it becomes the constraint -- creating demand for AI-native testing, automated code review, observability and reliability tooling. Companies like the agent-evaluation startups drawing fresh venture funding, alongside established devops and quality vendors, are positioned to sell the guardrails that AI-accelerated teams now need. The bottleneck moving from production to verification is itself an investable thesis.

The bear case for the alarm: a single vendor's dataset can be unrepresentative, the quality dip may be a transitional growing pain as teams adapt their processes, and better AI review tools could close the gap. What to watch: whether independent studies corroborate the incident surge, how engineering leaders rebalance investment toward quality controls, and whether 'AI for verifying code' becomes as big a category as 'AI for writing code.'

ShareXLinkedInEmail

Originally reported by VentureBeat. Analysis and editorial commentary by Value Add Pulse.

← Back to Pulse

Markets Now

live
SPCX▲+0.75%
$234.85
CBRS▼-0.92%
$257.40
SPY▲+0.16%
5,961.80
QQQ▲+0.19%
20,098.50
NVDA▼-0.99%
$150.60
MSFT▼-0.52%
$480.10
GOOGL▲+0.57%
$210.30
META▲+0.38%
$657.90

Read Next

AI

DeepSeek Open-Sources DSpark, Claiming 60-85% Faster Generation

DeepSeek released DSpark, a 'semi-parallel' speculative-decoding module for its DeepSeek-V4 Flash and Pro models, and open-sourced DeepSpec, a full codebase for training and evaluating speculative-decoding draft models. The company claims generation runs 60-85% faster on Flash and 57-78% faster on Pro at the same throughput -- a fresh reminder that the open-weight challengers keep narrowing the efficiency gap.

AI

Liquid AI's Tiny LFM2.5-230M Beats Models 4x Its Size and Runs Anywhere

Liquid AI released LFM2.5-230M, its smallest model yet, which the company says outperforms models four times its size at data extraction while being small enough to run 'anywhere' -- including phones, laptops and edge devices. The release advances the counter-narrative to ever-larger models: that efficient, specialized small models can win on the tasks enterprises actually run at scale.

AI

OpenAI's Updated GPT-5.5 Instant Gets Better at Shopping and Complex Constraints

OpenAI shipped an updated GPT-5.5 Instant that is better at shopping, handling complex constraints, and understanding user intent -- and it's already live in the API. The release sharpens OpenAI's fast, low-latency tier for agentic and commerce use cases, even as the company's more powerful new GPT-5.6 family sits behind a government-vetted gate.

@Trace_Cohen·t@nyvp.com