Enterprise AI doesn’t have a technology problem. MIT’s headline that 95% of GenAI pilots fail sparked debates on whether AI is ‘ready’ for the enterprise. But recent McKinsey and Atlassian research points to a more uncomfortable truth. While 88% of organizations now use AI in at least one function, 62% remain stuck in experimentation, and 96% report little or no improvement in efficiency.

Most companies are already using AI, yet the majority remain trapped in pilot mode, reporting little measurable efficiency gain. What’s breaking down isn’t AI capability, but the way large organizations make decisions, assign ownership, and move anything new into production.

That gap, the last mile between a promising prototype and a deployed operational system, is where most enterprises lose momentum. So why can so few enterprises get over the finish line, and what are the successful ones doing differently?

Is 95% Failure a Crisis, or Just Early-Stage Evolution?

When looking at this statistic from a historical angle, the 95% failure rate from MIT looks much less like a crisis and more like a familiar stage in enterprise adoption. The same pattern played out during the rise of the Internet. Throughout the late 1990s and early 2000s, most large companies had brochureware sites while keeping their core workflows offline. The impact didn’t appear until much later, once organizations had moved through predictable integration phases: exploration, experimentation, and gradual embedding into core systems.

AI is following this same curve, but much faster. Every major technology wave—web, mobile, cloud—takes about 8–10 years to result in meaningful enterprise-level impact. The issues AI currently faces, such as technical debt, siloed data, security controls and organizational bottlenecks, are actually markers of where enterprises sit on that curve.

However, companies without 30 years of accumulated systems, such as startups and AI-native firms, are already seeing value because they can integrate AI straight into workflows from day one. So for larger enterprises, they shouldn’t be fretting over if they’re failing, but thinking, how do we compress that 8–10 year period?

Why the ‘Last Mile’ is So Hard

Most AI pilots fail in the gap between concept and production. And that gap contains two points that repeatedly stall enterprise progress.

From Idea to POC:

For many businesses, the AI lifecycle begins with months spent analyzing and not executing: reviewing strategy decks, feasibility analyses, ROI spreadsheets, and governance frameworks. By the time a prototype is finally approved, the business context may have already moved on.

When a POC does appear, it often lacks real artifacts like audit rules or regulatory documents. Without concrete examples, AI skepticism can persist, as stakeholders can’t understand the specific business value.

From POC to Production:

This is when many pilots stall indefinitely. Enterprise data lives across decades of systems with inconsistent schema and missing lineage. The truth is, you can’t boil the ocean, so waiting for 100% clean data is a fantasy. At the same time, productionizing AI requires four or five departments, such as IT, the business unit, compliance, and security, to agree. However, each team has different priorities and KPIs, so decision cycles stretch as consensus governance is simply unrealistic.

These are the visible symptoms of the last-mile problem. Underneath is a deeper misalignment between how executives imagine AI works and operational reality.

Bridging C-Suite Ambition and Engineering Reality

Executives tend to view AI as a lever for transformation, a way to unlock new revenue streams and automate workflows. But the teams implementing those visions operate under a different reality: risk controls, strict operational boundaries, and legacy systems. These worlds move at different speeds, optimized for different forms of success.

This creates a structural gap long before any technical issues emerge. Leaders expect AI to behave like a strategic lever; engineering teams treat it like a system that must be stable and auditable. The result is not a disagreement about value but a mismatch in what AI can do today and how fast it can be responsibly deployed.

The same cognitive gap is found in Mary’s room knowledge argument. You can study an AI workflow on paper, but until teams see the system handle real inquiries, their understanding and risk appetite stay limited. This last-mile challenge is the gap between executive ambition and operational confidence, and closing it requires a different model of execution.

Lesson One: AI Doesn’t Fail; Expectations Do

A large part of the perceived 95% failure rate comes from unrealistic expectations about timing. Enterprises are trying to compress a historically 8–10 year adoption curve into a single budget year. It’s this that makes normal early-stage friction appear as organizational failure.

Instead, companies must decide where they want to be ‘AI-native’ and where they’re comfortable being a later adopter, rather than trying to make every workflow AI-enabled all at once. This mirrors Amazon’s early internet strategy. The business didn’t go from selling books to a global conglomerate overnight, but worked on a few processes, refined them, and expanded.

AI adoption is no different. Enterprises’ AI endeavors fail because they expect broad transformation before they’ve operationalized one use case. They must reset expectations around scope and maturity, which is the first step to converting pilots into impact.

Lesson 2: Compress the Cycle, or You’ll Never Scale

The organizations that consistently move AI projects from prototype to production do one thing differently: they compress the cycle. Instead of treating AI like a multi-quarter exercise, they can turn an idea into a prototype and then deploy it in a fast, iterative loop.

Discover, Design, and Develop

A working POC shouldn’t take months. With the current model ecosystems, vector stores, orchestration layers, and partner stack, it should take just 14 days. The key is to use real artifacts, like actual regulatory documents, customer inquiries, and RFPs, so stakeholders can interact with the system. A functioning prototype collapses months of conceptual discussion into days of practical learning.

The fastest-moving teams pick low-risk, high-visibility workflows and let real usage surface the edge cases, and then refine from there.

Deploy and Deliver

AI won’t reach production if teams wait for it to be perfect. Businesses must start with a use case that only needs 10% of the data to be clean and then iterate. Successful organizations build a human-in-the-loop layer around the model.

AI flags the tasks it can’t handle properly, subject matter experts review exceptions and correct outputs, and those corrections flow back into the system to tune prompts, rules, and data. This continuous human improvement loop is how to systematically improve quality without delaying deployment, and should typically take no longer than eight weeks.

Operate Through Decentralized Execution

One of the biggest under-the-surface blockers is that scaling a POC requires agreement from various business units, and consensus becomes a bottleneck.

The fix is decentralized execution. Enterprises must push ownership to the people closest to the workflow, and give them a small cross-functional pod, including a domain lead, an engineer, a data specialist, and a risk partner. This removes decision gridlock and lets teams iterate where the work actually happens.

For most enterprises, the lowest-friction starting point is internal knowledge management, letting teams query RFP libraries, client decks, audits, and documentation using an AI interface. It carries little risk and delivers immediate value. Once one workflow succeeds, new use cases will emerge organically.

Lesson 3: Fix the Last Mile Before Customers Abandon Ship

One reason enterprises can’t afford slow AI adoption is that customer expectations are shifting faster than internal processes. This has happened before. E-commerce really advanced when the iPhone came out, which helped make mobile shopping effortless as customers chose retailers based on experience, not brand.

AI is heading the same way. Consumers are already using AI tools in their daily workflows, from document translation to customer support. As they grow accustomed to that immediacy, they expect the same responsiveness from the enterprises they engage with.

That’s where the competitive risk emerges. AI-native companies, the ones building support systems and onboarding flows around AI from day one, will feel substantially more intuitive. Legacy enterprises that haven’t solved the last mile will look increasingly slow and fragmented by comparison. The last mile is about protecting market share in a world where the customer experience gap can widen overnight.

Final Thoughts

The message across all three lessons is that the last mile isn’t a technical failure or a result of immature models—it’s that expectation and decision structures aren’t built for the speed at which AI now evolves. And the urgency to fix it is increasing. Models that were cutting-edge a year ago now run at a fraction of the cost, with significantly higher accuracy and far lower latency. Tasks that once took minutes now complete in seconds.

As models become cheaper, faster, and more capable, AI-native companies will absorb those gains into their workflows. Businesses that don’t will continue planning while the frontier moves ahead, even as they’re still aligning stakeholders. The technology curve isn’t waiting. The question now is who will compress the adoption cycle and who will be left catching up after the next inflection point arrives.