The 'Software Factory' Myth: AI Is Helping Companies Ship Bugs Faster

A widely shared analysis argues that most enterprises adopting AI coding tools to build a 'software factory' are really just shipping bugs faster: AI accelerates code production, but downstream testing, review and CI/CD don't scale with it, so defects and incidents climb. Data cited from Faros AI shows developer throughput up sharply -- but incidents and bugs rising even faster.

+33.7% per developer

Dev Throughput

+16.2%

PR Merge Rate

+242.7%

Incidents-to-PR Ratio

+54%

Bugs per Developer

Faros AI

Data Source

Trace Cohen

Early-stage VC & angel · Founder, New York Venture Partners

June 26, 2026

2 min read

A widely circulated analysis is puncturing one of enterprise AI's favorite narratives -- the idea that AI coding tools turn engineering organizations into high-output 'software factories.' The argument: AI dramatically speeds up the writing of code, but the downstream parts of the software lifecycle -- testing, code review, deployment safeguards and quality control -- don't automatically scale with it, so the net result for many companies is simply shipping bugs and incidents faster.

The data gives the thesis teeth. Figures cited from Faros AI show task throughput per developer up 33.7% and pull-request merge rate up 16.2% -- real productivity gains. But over the same period, the ratio of incidents to pull requests jumped 242.7% and bugs per developer rose 54%. In other words, the defects are climbing far faster than the output, suggesting that AI-accelerated coding without commensurate investment in quality controls can erode reliability rather than improve it.

“Figures cited from Faros AI show task throughput per developer up 33.7% and pull-request merge rate up 16.2% -- real productivity gains.”

The finding lands amid a broader reckoning over how to actually capture value from generative AI in the enterprise. The early phase was about adoption and raw productivity metrics; the maturing phase is about whether that productivity translates into better, more reliable software or just more churn. It connects to the same discipline behind efficient agent memory and rigorous agent evaluation -- the unglamorous engineering that separates production systems from impressive demos.

The competitive implication is a new market opportunity. If writing code is no longer the bottleneck, verifying it becomes the constraint -- creating demand for AI-native testing, automated code review, observability and reliability tooling. Companies like the agent-evaluation startups drawing fresh venture funding, alongside established devops and quality vendors, are positioned to sell the guardrails that AI-accelerated teams now need. The bottleneck moving from production to verification is itself an investable thesis.

The bear case for the alarm: a single vendor's dataset can be unrepresentative, the quality dip may be a transitional growing pain as teams adapt their processes, and better AI review tools could close the gap. What to watch: whether independent studies corroborate the incident surge, how engineering leaders rebalance investment toward quality controls, and whether 'AI for verifying code' becomes as big a category as 'AI for writing code.'

The 'Software Factory' Myth: AI Is Helping Companies Ship Bugs Faster

+33.7% per developer

Dev Throughput

+16.2%

PR Merge Rate

+242.7%

Incidents-to-PR Ratio

+54%

Bugs per Developer

Faros AI

Data Source

Trace Cohen

Early-stage VC & angel · Founder, New York Venture Partners

June 26, 2026

2 min read

“Figures cited from Faros AI show task throughput per developer up 33.7% and pull-request merge rate up 16.2% -- real productivity gains.”

The 'Software Factory' Myth: AI Is Helping Companies Ship Bugs Faster

Markets Now

Read Next

DeepSeek Open-Sources DSpark, Claiming 60-85% Faster Generation

Liquid AI's Tiny LFM2.5-230M Beats Models 4x Its Size and Runs Anywhere

OpenAI's Updated GPT-5.5 Instant Gets Better at Shopping and Complex Constraints

The 'Software Factory' Myth: AI Is Helping Companies Ship Bugs Faster

Markets Now

Read Next

DeepSeek Open-Sources DSpark, Claiming 60-85% Faster Generation

Liquid AI's Tiny LFM2.5-230M Beats Models 4x Its Size and Runs Anywhere

OpenAI's Updated GPT-5.5 Instant Gets Better at Shopping and Complex Constraints