In 2026, you can’t pry AI coding instruments out of builders’ vise grip, researchers have found.
However whereas AI is undoubtedly serving to coders produce code quicker, it will not be producing higher code, different researchers warn. And that would trigger issues down the highway for them.
Particularly, in February 2026, revered AI analysis lab METR published a surprising revelation: Most builders received’t work, even on a restricted variety of duties, with out AI anymore.
METR had hoped to supply an replace to some groundbreaking research published a number of months earlier, in 2025, on AI coding productiveness. In it, researchers measured how a lot time open supply builders took to do duties by hand versus with AI.
Whereas builders in that examine reported that AI was making them extra productive, they have been shocked to be taught it really slowed them down. Positive, it generated code quicker, however then they spent additional time discovering and fixing errors, steering the AI and ready on it to finish duties.
When METR got down to repeat the experiment to measure advances in AI and coder proficiency, they couldn’t.
Devs weren’t prepared to take part “as a result of they don’t want to work with out AI” even only for the examine, the researchers confessed.
As a substitute, METR published a survey in Could that allowed technical staff to self-report their AI productiveness beneficial properties. Not surprisingly, they perceived that AI made them twice as precious to their organizations.
However current headlines about the wild expense of so-called tokenmaxxing, coupled with a smattering of current analysis, make such self-perceptions doubtful.
Tokenmaxxing, or utilizing the variety of tokens an individual makes use of as a proxy for productiveness with AI, has been the development of 2026 to this point. And it could already be over.
Amazon shut down its inside token-tracking leaderboard known as Kirorank after staff have been gaming it by utilizing AI brokers excessively, and operating up prices, the Financial Times reported this week. The staff proved that AI use doesn’t robotically translate to elevated productiveness.
Uber blew by means of its 2026 AI finances throughout the first 4 months of the yr, The Information reported. COO Andrew Macdonald just lately mentioned on a podcast that such spending hadn’t led to a measurable increase in tasks or productiveness.
AI-generated code additionally doesn’t essentially scale back ongoing code upkeep wants and will even improve it, programmer and writer James Shore elegantly argued in a blog post that went viral on Hacker Information.
“You write code twice as fast now? Higher hope you’ve halved your upkeep prices,” he wrote. “In any other case, you’re screwed. You’re buying and selling a short lived pace enhance for everlasting indenture.”
There’s different proof that AI can improve code upkeep woes.
A viral tweet from Aiswarya Sankar, founder and CEO of reliability engineering agent startup Entelligence AI, proclaims that corporations are spending 44% of their tokens on bug fixes that their AI generated. In the meantime, code-reviewing software firm CodeRabbit says it analyzed open supply pull requests and located that AI produced 1.7x extra issues than human code.
These are, admittedly, self-serving stats from these making an attempt to promote AI code reviewing instruments.
But unbiased researchers have additionally discovered such points. Researchers from the revered Singapore Administration College published a report in April warning that “AI-generated code can introduce long-term upkeep prices into actual software program tasks.”
Provided that programmers love their AI assistants, what’s the answer?
Properly, those that need to promote you AI coding brokers say devs can simply use AI coding brokers to do the bone-wearying duties of fixing code as quick as AI spits it out. That’s what Cognition founder and CEO Scott Wu —the maker of AI coding agent Devin — suggests.
However even he admits that, whereas Devin can work independently, he’d presently charge its ability between a junior and mid-level programmer, relying on the duty. This isn’t a hand-it-off and neglect it resolution.
The SMU researchers counsel a extra human method. Programmers ought to know what duties AI does and doesn’t do nicely as deeply as they know their favourite coding languages. They want robust high quality assurance methods designed for AI and they’re caught with rigorously reviewing the AI’s work as if it have been a junior dev.
In the meantime, the researchers say (and Wu agrees), people ought to nonetheless be doing the big-picture work like software program structure and safety design.
While you buy by means of hyperlinks in our articles, we may earn a small commission. This doesn’t have an effect on our editorial independence.









