· AI Activity to Impact · 16 min read
AI Coding Moved The Bottleneck
AI coding tools can make engineering dramatically faster. That does not automatically make the business faster. The constraint often moves to deciding what is worth building, launching safely, getting adoption, and proving impact.
Click image to open full size TL;DR
AI coding tools change the economics of software delivery. A small team can now generate, debug, test, document, and refactor much faster than it could a couple of years ago. That is real. It matters. But it is also where many leaders make the wrong leap: faster engineering output does not automatically mean faster business impact. In many organizations, AI has moved the bottleneck from “can we build and test it?” to “do we know what is worth building, can we launch it safely, will people adopt it, and can we prove it changed anything that matters?”
This is a flow problem, not just an AI tools problem. If leaders only celebrate local engineering productivity, they may create more output, more work in process, more half-launched tools, and more pressure downstream. The leadership job is changing from “how do we make teams faster?” to “where does this new speed actually improve the flow to value?” That means looking at the whole value stream and aiming human plus artificial intelligence at the current constraint.
Start with the flow of value
The useful place to start is not the AI tool. It is the path from idea to business impact. An idea has to be discovered, shaped, decided, built, tested, launched, adopted, measured, and learned from. That is the flow that matters. If one part of that flow suddenly gets much faster, the whole system does not automatically get faster. The constraint usually moves.
AI coding changes what feels possible in the build part of that flow. Coding agents, copilots, and increasingly capable development environments can reduce the effort required to create software. They can help with boilerplate, tests, refactoring, debugging, documentation, migration work, and even full feature slices when the intent is clear enough. For engineering teams, this is not a minor productivity bump.
But the system still has to absorb the work. Someone still has to decide what problem is worth solving. Someone has to shape the feature well enough that an agent does not produce the wrong thing quickly. Someone has to look at the architecture, risk, quality, security, adoption, support model, and business outcome. Someone has to launch the thing into a real workflow where people already have habits, incentives, and constraints of their own.
AI can help with many of those activities too. This is not a “humans do everything after coding” argument. It is a constraint argument. When one capability scales faster than the others, the bottleneck moves. If you do not notice where it moved, you end up optimizing the wrong part of the system.
AI creates speed, not automatic value
I keep coming back to one line because it explains a lot of what I see in AI transformation work: AI creates speed, not automatic value. Everything gets faster, including the good things and the bad things.
If your product discovery is sharp, AI helps you turn good intent into tested options faster. If your backlog is full of half-baked requests, AI helps you produce more half-baked software faster. If your team has strong engineering discipline, AI can raise the leverage of that discipline. If your team already struggles with integration, ownership, and release quality, AI may increase the volume of work that hits those weak spots.
This is the part that gets lost in local productivity conversations. A developer becoming 30%, 50%, or even 10x faster at certain tasks is meaningful, but the business does not experience value at the moment code is generated. The business experiences value when the right change reaches the right workflow, gets used, and improves something that matters.
That path from idea to impact includes discovery, prioritization, design, build, test, security, release, adoption, measurement, learning, and sometimes stopping. AI coding mostly accelerates one section of that path. If the rest of the path does not accelerate with it, the pressure just moves downstream.
The point is not to downplay the engineering gain. AI agents are getting genuinely useful at coding, debugging, test generation, documentation, and the mechanics around software delivery. The point is that the gain is asymmetric. Code is a relatively concrete medium with fast feedback loops. Product discovery, adoption, behavior change, and value validation are more ambiguous. They have weaker observability and more human context. So AI often scales engineering faster than it scales the rest of the value stream.
That asymmetry creates what I think of as acceleration whiplash. The inner loop of coding gets faster, but the outer loop of deciding, reviewing, integrating, releasing, adopting, and learning does not move at the same pace. The organization feels more energy and more activity, but the whole system also becomes more volatile. More code arrives at the next gate before the gate has learned how to handle the new arrival rate.

Local productivity is not enough
Imagine a value stream that starts with an idea and ends with measurable business impact. Before AI coding, engineering throughput may have been the visible constraint. Leaders asked why it took so long to build features, why estimates were unreliable, and why teams could not get more done. Now AI enters the picture and engineering output improves. More prototypes appear. More pull requests are opened. More experiments become technically possible. The demos look better.

Then the organization starts to feel the new congestion. Product leaders cannot make sense of the expanded option set. Architects and security reviewers are asked to look at more changes. Business stakeholders see more demos but still do not know which ones deserve adoption work. Operations teams are asked to absorb tools that change the workflow but not the surrounding policies. Finance still wants to know what changed in cost, revenue, quality, cycle time, or customer experience.
Code review is one of the first places many engineering organizations are already feeling this. AI makes it easier to produce more code, larger pull requests, and more frequent changes. Review capacity does not automatically scale with it. So PRs start accumulating, waiting for review, or they get rubber-stamped because nobody wants to be the person slowing down the new AI-assisted velocity. That makes the metrics look better for a while, but it quietly jeopardizes quality, architecture, security, and shared understanding.
The research direction here is useful, even if I would be cautious about treating any single benchmark as gospel. The patterns to watch are not exotic: PRs getting bigger, review queues getting longer, more changes touching more files, more copy-paste, more churn, and more defects showing up after the merge. These are flow signals. They tell you whether AI is improving the system or just pushing risk into the next queue.
From the inside, this can feel confusing. Everyone can point to more activity. The tools are working. The teams are excited. The engineering demos are real. And still, the business impact lags.
That gap is the signal. It means the constraint is no longer where you thought it was.
Sometimes the gap is visible immediately. Work piles up before release, before adoption, or before business review. Sometimes it is hidden for a while because engineering has a backlog of useful internal work: technical debt, missing tests, brittle pipelines, quality gaps, documentation cleanup, migration work. AI can help burn down that backlog, and that is a good use of the capacity. But it can also delay the moment when leaders notice the new bottleneck. Validating internal quality work is much easier than validating new customer or business value. At some point, if engineering keeps scaling, the constraint will show up elsewhere.
There is another version of this that is more subtle: the expensive busywork factory. When engineering capacity becomes easier to unleash but validated product direction does not keep up, people find work. They clean up papercuts, build internal utilities, prototype side ideas, generate dashboards, and improve things that may be useful but are not necessarily the best use of scarce attention. Some of that work is good. Some of it is the system manufacturing activity because it does not know what value to pull next.
The AI coding bottleneck is often upstream
One of the more surprising effects of AI coding is that it makes weak intent more expensive, not less. When building is slow, vague ideas sometimes die before they consume too much capacity. When building becomes cheap, vague ideas can turn into working software quickly enough to create real downstream cost.
This is why spec-driven development is becoming more important, not less. The AI agent needs a strong description of the desired behavior, the constraints, the acceptance criteria, the edge cases, and the reason the change matters. Without that, it will happily generate plausible code that may or may not solve the right problem.
So the bottleneck often moves upstream to judgment. Which problem is worth solving? What is the smallest valuable slice? What should we learn before we automate? Which customer behavior are we trying to change? What risk should stay human-reviewed? What would make us stop?
Those are not “requirements gathering” questions in the old bureaucratic sense. They are product judgment questions. AI raises the leverage on them because a clear decision can now travel through the build step much faster. A poor decision can travel just as fast.
The bottleneck can also move downstream
In other organizations, the upstream work is decent, but the bottleneck appears after the software exists. The team can build internal tools, automations, agents, dashboards, workflows, and prototypes faster than the organization can adopt them.
That is not a technology problem. It is a workflow and operating-model problem. People have existing habits. Teams have handoffs. Leaders have approval paths. Metrics reinforce the old way of working. The new tool may be better in isolation and still fail to change the work.
This is where a lot of AI theater comes from. There are pilots, demos, hackathons, prototypes, and a growing sense that the organization is “doing AI.” But business impact does not show up because the work around the tool has not changed. The bottleneck is adoption, process redesign, role clarity, decision rights, or value measurement.
When that is the constraint, adding more AI coding capacity is like widening the entrance to a highway that is jammed three exits later. You will move more cars into the traffic jam. You will not get more people to their destination.
Look at the whole flow, not the AI tool
The useful move is to look at the end-to-end flow. Start with the path from business idea to business impact and ask where work is waiting, where decisions are slow, where feedback arrives late, and where value becomes hard to prove.

A cumulative flow diagram is still one of the simplest ways to make this visible. Look at how much work is accumulating over time in each stage: AI activity, explored ideas, working-tested software, shipped software, adopted software, and business impact. If working-tested software is growing much faster than adopted or impact-producing software, the bottleneck has moved. You may have great DORA metrics and still have weak value flow.
You can use the same lens inside engineering. Look at pull requests opened, pull requests waiting for review, review cycle time, PR size, rework after review, defect escape, and how often reviews are approved without meaningful discussion. If AI increases coding throughput but review quality collapses, the bottleneck did not disappear. It moved into review and validation.
This matters even more for AI-enabled development because observability is not just a reporting problem. If you cannot tell whether the thing you shipped is being used, enjoyed, or creating the outcome you expected, your agents cannot close the loop either. They can help produce the next thing, but they cannot learn what “better” means in your context. The feedback loop from real use to future development becomes the constraint.
In some cases, the constraint really is engineering. Maybe a legacy platform needs modernization, the test suite is weak, or teams are buried in toil that AI can help remove. In that case, focus AI there. Use agents to write tests, improve observability, clean up brittle areas, generate migration scaffolding, or reduce manual support work.
In other cases, the constraint is product discovery. Then the right AI use might be research synthesis, opportunity mapping, option generation, experiment design, or turning messy stakeholder input into sharper product bets.
Sometimes the constraint is launch and adoption. Then AI should help with workflow redesign, enablement, support scripts, telemetry, onboarding, internal communications, and the evidence needed to help leaders make the next decision.
And sometimes the constraint is leadership decision quality. Too many ideas are in flight. Funding is spread too thin. Nobody wants to kill a promising-sounding AI initiative. Every team wants its own pilot. In that case, the useful work is portfolio visibility, staged funding, and sponsor-ready evidence. The problem is not that people need more AI ideas. They need a way to choose.
Apply theory of constraints to AI work
The theory of constraints gives a practical way to avoid the trap. Find the constraint. Decide how to exploit it. Subordinate the rest of the system to it. Elevate it. Then look again, because the constraint will move.
In this context, “subordinate” is the important word. Do not just ask where AI is exciting. Ask where human and artificial intelligence should be aimed so the constraint can scale. If adoption is the bottleneck, use AI to improve onboarding, activation, support, workflow fit, and usage feedback. If product judgment is the bottleneck, use AI to sharpen options, synthesize evidence, stress-test assumptions, and make better stop/scale decisions. If value evidence is the bottleneck, use AI to instrument the workflow and connect usage to the business outcome leaders actually care about.

In the AI coding world, that might sound like this:
- If unclear product intent is the constraint, do not start by buying more coding tools. Improve how teams shape work.
- If code review is the constraint, reduce batch size before you add more coding capacity. Smaller PRs, clearer specs, automated pre-review checks, and explicit review WIP limits will do more than asking reviewers to work harder.
- If quality and validation are the constraint, use AI to strengthen tests, review risk, catch duplication, and shorten feedback loops before increasing feature volume.
- If adoption is the constraint, slow down the demo factory and focus on changing the workflow around fewer, higher-value bets.
- If executive decision-making is the constraint, create a visible AI portfolio with confidence by phase, expected impact, cost of pursuit, and clear stop/go criteria.
- If business impact measurement is the constraint, define the evidence before the build, not after the launch.
The uncomfortable part is that this often means not using AI where it is most exciting. It means using AI where the system is actually constrained. That is a different management discipline than “encourage everyone to experiment” or “roll out copilots and track usage.”
Experimentation is useful. Broad literacy matters. I still want people to have access to the tools. But focused enablement should not be spread evenly just because the org chart is even. If the bottleneck is product, operations, adoption, or users themselves, that is where the deepest AI support should go first. Help the constraint scale. Then look again.
What leaders should ask now
If AI coding is showing up in your organization, I would not start with “how much faster are developers?” I would ask a few harder questions.
Where does work wait after engineering gets faster? Are pull requests getting larger, waiting longer, or being approved with less real review? Are we seeing more churn, duplication, fragile changes, or production surprises? Which decisions are becoming more frequent, more complex, or more consequential? Are we creating more working software than the business can absorb? Are AI-generated prototypes changing real workflows, or just making better demos? Do we know which business metric should move if this work succeeds? Are we funding too many AI bets because each individual bet now looks cheaper?
I would also ask what kind of work your best people are being pulled into. If AI reduces toil, those people should move closer to judgment, customer insight, experimentation, and system-level decision-making. If they simply spend the saved time managing more work in process, you have not improved the system. You have just filled the freed capacity with more traffic.
The promise of AI coding is not that engineers type less. The promise is that scarce human judgment can move closer to the work that matters most. People should spend less time on toil and more time in the places where judgment, creativity, customer insight, and problem-solving matter. But even that move needs to be constraint-focused. Do not only move people into their genius zone in the easiest place to do it. Start where the system is constrained.
A note on the older testing-flow lesson
This is not the first time software organizations have run into this pattern. Years ago, the visible bottleneck was often late testing and painful stabilization. Teams could code faster than the organization could validate, harden, release, and learn. DevOps, CI, continuous delivery, better automated testing, and now AI testing assistants improved that dramatically.
But the bottleneck was not eliminated. It moved. The constraint is less often “can we test working software?” and more often “can we validate real value?” I wrote about the earlier version of this problem in Using Flow To Manage Testing Bottlenecks. The same flow logic applies here, but the center of gravity has moved from release readiness toward value realization.
The practical takeaway
AI coding is a real breakthrough. I am not interested in downplaying it. But the organizations that get business impact from it will not be the ones that only measure local productivity. They will be the ones that notice where the bottleneck moved and redesign the system around that constraint.
Sometimes that means accelerating engineering. Sometimes it means slowing down the idea factory. Sometimes it means improving testing and validation. Sometimes it means helping business leaders make fewer, clearer, better-funded bets. Sometimes it means putting AI to work on adoption and value evidence rather than code.
More output into a constrained system does not create flow. It creates queues. AI just makes the lesson harder to ignore because the output arrives faster.
If AI helped your team build faster, the next question is not “how do we build even more?” It is “where did the bottleneck move?”
About Yuval Yeret
Yuval is a rare practitioner who has shaped the agility path of dozens of organizations and influenced the frameworks used across the industry. He helps product and technology leaders move from agile theater to evidence-informed, outcome-oriented delivery that creates better value sooner, safer, and happier.