AI Strategy & Business Transformation

How to Measure ROI from AI Investments: A Framework for Business Leaders

Pillar guide Updated 2026 15 min read By Ricardo Mendoza Castro

An IDC study commissioned by Microsoft found that companies average $3.70 in return for every $1 invested in generative AI. IBM's Institute for Business Value — surveying more than 2,500 executives independently — puts the average enterprise AI ROI at 5.9%, well below the typical 10% cost of capital. MIT's NANDA initiative found that 95% of generative AI pilots deliver no measurable P&L impact.

$3.70
Average return per $1 invested in generative AI
IDC / Microsoft
5.9%
Average enterprise AI ROI — below the 10% cost of capital
IBM IBV
95%
Of GenAI pilots deliver no measurable P&L impact
MIT NANDA

Same technology. Broadly similar companies. Overlapping timeframes. These figures are not contradictory — they measure different things. And that distinction is the starting point for every credible AI ROI measurement exercise.

What ROI can you realistically expect from an AI investment? The evidence points to 2–4 years to payback on most enterprise use cases, with function-level returns visible earlier when baselines and KPIs are defined before deployment. Organizations that scale AI — rather than running perpetual pilots — report measurable returns almost universally, per Deloitte's survey of 1,854 executives. The gap between companies that capture value and those that don't is overwhelmingly a measurement and execution problem, not a technology problem.

This guide gives you a four-layer framework, a step-by-step measurement process, and the calculator structure you need to build an AI business case that will hold up to CFO scrutiny.

The Benchmark Paradox: Why AI ROI Numbers Seem to Contradict Each Other


The headline numbers tell two very different stories, and knowing which one applies to your situation is the first act of measurement discipline.

Vendor-commissioned studies and self-reported executive surveys produce the optimistic picture. IDC/Microsoft report $3.70 per $1 invested on average, with top performers reaching $10.30. Google Cloud — in a survey of 2,500 executives — found 74% of organizations already seeing ROI from generative AI, with 86% of growing GenAI users estimating at least 6% annual revenue improvement. Forrester TEI studies for individual AI products show ROI ranges of 106–457%, depending on the product and composite organization analyzed.

Independent research produces a more measured view. IBM's Institute for Business Value puts average enterprise AI ROI at 5.9%, with only about 25% of initiatives delivering expected returns. McKinsey's State of AI 2025, covering 1,993 respondents across 105 countries, found that 88% of organizations use AI in at least one function but only 39% report any EBIT impact — and just 6% qualify as high performers with more than 5% EBIT attributable to AI. BCG's AI Radar 2026, surveying 2,360 executives, found only 26% can move from proof of concept to tangible value.

Chart 1
What the benchmarks actually say about AI ROI
Independent research
Vendor-commissioned / self-reported
IDC / Microsoft, 2024Return per $1 invested
$3.70 : $1
Google Cloud, 2,500 execsAlready seeing ROI from GenAI
74%
BCG AI Radar 2026Able to move PoC → tangible value
26%
Deloitte, 1,854 execsAchieved payback under 1 year
6%
McKinsey State of AI 2025High performers, >5% EBIT impact
6%
IBM IBV, 2025Average enterprise AI ROI
5.9%

Source: IBM Institute for Business Value, Thinking Forward: Lessons from AI Leaders, 2025 · McKinsey & Company, The State of AI in Early 2025, May 2025 · Deloitte Insights, State of Generative AI in the Enterprise, Q4 2024 · Google Cloud / Ipsos, AI Opportunity Study, 2024 · IDC commissioned by Microsoft, The Business Opportunity of AI, 2024 · BCG, AI Radar 2026, 2026. Bars use mixed units; colour encodes methodology.

These numbers diverge so sharply because they measure at different levels of analysis. Before any ROI conversation, you need to decide which level you are measuring:

  1. Use-case ROI — the return one specific AI system generates within its operational unit.
  2. Business-unit ROI — where multiple use cases compound to produce broader cycle-time, capacity, and compliance effects.
  3. Corporate/P&L ROI — where only attributable, material impacts reach the EBIT and cash flow statement.

Most vendor studies and self-reported surveys measure Level 1. Most executives are asking about Level 3. The distance between those two levels — the time, adoption, and workflow redesign required to bridge them — is the benchmark paradox. No ROI calculation can resolve it without first defining which level is being measured.

Why AI ROI Is Harder to Measure Than Traditional Software


The formula is unchanged:

ROI = (Net Benefits − Total Costs) / Total Costs × 100

What changes for AI is that almost every input is harder to pin down.

Benefits are indirect. When AI helps an analyst produce reports 30% faster, the value shows up as productivity — not as a cost line that disappears from the accounts. The analyst stays employed and handles more work. Proving that the "more work" translates to a measurable business outcome requires measurement infrastructure most organizations never build.

Costs are distributed and ongoing. Licensing is a fraction of the real bill. Data preparation alone consumes 30–50% of a typical AI budget, per PwC. A practical rule of thumb: actual total cost of ownership runs 2–3× headline licensing. For smaller businesses, approximately 60% of generative AI costs come from maintenance, training, and scaling — not the initial implementation.

Baselines are missing. How long did contract review take before AI? What was the error rate on invoice processing? Most organizations deploy first and ask measurement questions later — by which point the baseline is gone and attribution becomes guesswork.

Attribution is genuinely complex. In blended human+AI workflows, "the AI caused this result" is rarely accurate. One practical response: a workflow tagging framework that marks each process stage as machine-generated, human-verified, or human-enhanced, making the contribution of automation visible without overstating it.

The economics are inverted. Traditional SaaS is expensive to build and near-zero to run at the margin. AI is cheap to start and carries high, variable operating costs — compute, tokens, retraining. Stanford HAI data shows the cost of inference at GPT-3.5 quality fell more than 280× between November 2022 and October 2024, while hardware costs fell approximately 30% annually. Any business case spanning 24–36 months should model at least three sensitivities: inference and token costs, effective adoption rate, and the percentage of saved time that actually gets reinvested in productive work.

Not every productivity gain is an accounting saving. If a team saves two hours per week per person but management does not redesign workflows to redirect that capacity, the saving exists only on paper. BCG's analysis identifies this as the root cause of the persistent gap between pilot enthusiasm and enterprise-level EBIT impact.

Chart 2 · The 93/7 Problem
Where AI budgets go vs. where value actually comes from
Where AI budgets typically go
93%Technology
Technology — 93%
People — 7%
BCG prescriptive split for value capture
70%People & process
People & process — 70%
Data & technology — 20%
Algorithms — 10%

Source: Fortune, "AI is booming — so why are companies failing to profit from it?" (quoting Deloitte CTO Bill Briggs), December 2025 · BCG, It's Not a Data Problem: 70% of AI Value Comes from People and Processes (10-20-70 rule, recurring research 2019–2025).

The Three Levels of AI ROI — and Why Mixing Them Is the Core Mistake


Before choosing a measurement method, define the level you are measuring. Mixing levels in a single business case is the most common reason AI investment proposals stall in finance committees.

Level 1 — Use-case ROI answers: does this specific AI system produce measurable value in its operational context? This is the most tractable level. A peer-reviewed randomized controlled trial of GitHub Copilot showed developers completed a defined programming task 55.8% faster — 71 minutes versus 160 minutes — a statistically significant, independently verified result. A BCG and Harvard Business School study of 758 management consultants showed participants completed 12.2% more tasks, worked 25.1% faster, and produced higher-quality output inside the AI's capability frontier. Outside it, they were 19% less likely to reach the correct answer — a finding critical to any ROI model that claims gains regardless of task type.

Level 2 — Business-unit ROI asks: what happens when multiple use cases compound across a team or department? Cycle times shorten. Compliance workloads drop. Throughput rises. Benefits at this level require more complex attribution because multiple factors interact simultaneously.

Level 3 — Corporate/P&L ROI asks: does AI move the EBIT line? McKinsey found only 39% of organizations report any enterprise-level EBIT impact, and most say it falls below 5%. This is not evidence that AI does not work — it reflects the compounding time required to reach Level 3. Deloitte's survey confirmed that typical enterprise AI payback runs 2–4 years, three to four times longer than conventional IT's 7–12 month timeline.

The CFO's three-return framing

A complementary framing used by leading CFOs separates three types of return that prevent premature cancellation of capability-building investments:

  • Realized ROI — proven P&L impact, directly attributable.
  • Trending ROI — leading indicators pointing toward future financial returns.
  • Capability ROI — foundational infrastructure and skills that compound over time, before appearing in EBIT.

The Four-Layer AI ROI Measurement Framework


No single formula measures AI ROI completely. A defensible measurement system requires four distinct layers, each answering a different question.

Chart 3
The four-layer AI ROI measurement framework
1
CFO Business Case
"Is the investment worth it?"
ROINPVPaybackBenefit / cost ratio
Limit: can overpromise if causal attribution is weak.
2
Causal Use-Case Measurement
"Did AI cause the result?"
Incremental liftATEDelta vs. control
Limit: requires experimental design.
3
Risk & Non-Financial Scorecard
"Is the solution sustainable?"
Error rateCSATAdoptionCompliance
Limit: not immediately monetizable.
4
Operational Workflow Framework
"Does AI change how work gets done?"
Cycle timeReworkThroughputDefects
Limit: requires process redesign and exec sponsorship.

Source: Liorant framework synthesized from Forrester Research, Total Economic Impact (TEI) Methodology, 2024 · McKinsey & Company, Rewired, 2023 · NIST, AI Risk Management Framework (AI RMF 1.0), 2023 · ISO/IEC 42001:2023 · Gartner, AI Opportunity Radar, 2025.

Layer 1 — CFO Business Case translates the investment into the financial language boards and finance committees use: ROI, NPV, payback period, and benefit/cost ratio. Its main limit: it can significantly overpromise if the causal attribution feeding it is weak.

Layer 2 — Causal Use-Case Measurement answers the critical question — did the AI cause the result, or would the outcome have happened anyway? This layer requires experimental or quasi-experimental design: an A/B test, holdout group, staged rollout, or difference-in-differences analysis. It provides the strongest causal defense of any individual ROI claim.

Layer 3 — Risk and Non-Financial Scorecard assesses whether the solution is sustainable and trustworthy. Metrics include error and override rates, CSAT, privacy compliance status, fairness indicators, and employee adoption levels. NIST's AI Risk Management Framework and ISO/IEC 42001 both require this layer for production AI systems. In European markets, the EU AI Act adds binding governance requirements for high-risk AI applications — a compliance cost that must appear in TCO. Organizations that measure holistically report ROI 22% higher for capability development and 30% higher for GenAI integration, per IBM research.

Layer 4 — Operational Workflow Framework determines whether AI genuinely changes how work gets done. Metrics are cycle time, rework rate, throughput, and defect rate. This layer captures the workflow effect that ultimately drives P&L outcomes. McKinsey found high performers are 3.6× more likely to pursue end-to-end workflow transformation rather than simply inserting AI into existing processes.

How to Measure AI ROI: A Step-by-Step Process


Chart 4
AI ROI measurement timeline
0 3 mo 6 mo 12 mo 18 mo 24 mo
FoundationWeeks 0–4
PilotMonths 1–3
ScaleMonths 3–6
RealizationMonths 6–12
OptimizationMonths 12–24
Foundation
  • Document baseline
  • Define primary KPI
  • Build TCO model
Pilot
  • A/B or controlled rollout
  • Track adoption weekly
Scale
  • Expand scope
  • Quasi-experimental check
Realization
  • Attribution analysis
  • ROI / NPV calculation
Optimization
  • Rebalance portfolio
  • Add new use cases

Source: Liorant framework informed by McKinsey & Company, The State of AI in Early 2025 · BCG, AI Radar 2026 · Forrester TEI stage-gate methodology · Deloitte Insights, Q4 2024. Diamonds mark stage-gate checkpoints.

Stage 1 — Pre-Investment: Set the Foundation

  1. Document the current process baseline. Before any AI system goes live, spend one structured day per target process recording: average task duration, headcount involved, error and rework rate, weekly volume, and fully-loaded cost per transaction. If you cannot document current time, cost, volume, and error rate for a process, do not deploy yet.
  2. Choose one primary KPI plus 2–3 secondary metrics. The primary KPI answers: "why are we doing this?" It should be the metric leadership actually tracks — cycle time, deflection rate, cost per transaction, revenue per customer — not model accuracy or API response time.
  3. Build a fully-loaded TCO model. Go beyond licensing: compute and token costs, data engineering and preparation (30–50% of budget), integration, security and compliance, change management, and ongoing retraining. Assume TCO will run 2–3× headline licensing.
  4. Match the payback model to the use case. Back-office automation → cost avoidance. Customer service → deflection rate and cost per transaction. Marketing and sales → revenue attribution. Data infrastructure and platform investments → capability ROI, where value compounds rather than crystallizing immediately.
  5. Set conservative, realistic, and stretch targets for each KPI, and translate each into a specific monetary value. Build in a realization factor of 50–70% to reflect the proportion of productivity gains that actually gets redirected into productive work through workflow redesign.

Stage 2 — During Implementation: Track Leading Indicators

  1. Track adoption weekly. Active users, tasks automated, and percentage of target workflows covered. The scope multiplier — adoption rate × realization factor — is the single most sensitive variable in most business cases. If only 5% of employees use the tool, NPV is approximately 5% of the full-deployment projection.
  2. Monitor quality indicators. Hallucination rate, guardrail triggers, override rate, and model drift signals. These are the early-warning system for customer-satisfaction decline — the risk Klarna discovered after its AI customer service rollout produced lower-quality support.
  3. Run milestone-based stop/go reviews. Define explicit metric thresholds at each gate — baseline to pilot, pilot to scale — with clear criteria for continuing, adjusting, or stopping.
  4. Use A/B tests or holdout groups wherever feasible. Where controlled experiments are not possible, apply difference-in-differences or staged rollouts and control for seasonality and external factors.

Stage 3 — Post-Deployment: Prove and Scale

  1. Calculate Realized ROI, payback period, and NPV against the documented baseline. Report using the three-tier view — Realized, Trending, and Capability ROI — so the full picture is visible.
  2. Review operational KPIs monthly and financial ROI quarterly for early-stage deployments, transitioning to annual review once performance is stable.
  3. Watch for stop triggers: data drift, customer-satisfaction signals declining below threshold, missed ROI gates for two consecutive quarters, or a use case where labor savings are offset by quality costs or churn.
  4. Scale by scope deliberately. Once a use case proves out at 10–20% adoption, raising it toward enterprise-wide is where projected NPV actually materializes. The gap between a promising pilot and a value-generating deployment is almost always an adoption and workflow problem — not a technology problem.

The AI ROI Calculator: Formulas, Variables, and the Scope Multiplier


See how Liorant delivers a working AI system in 4 weeks — and builds the measurement framework alongside it. Explore Rapid AI Activation →

A credible AI ROI calculator must separate realized economic value, potential economic value, and non-financial value. It also includes the variable most generic templates omit: the scope multiplier.

Core Formulas

ROI = (Total Attributable Benefits Total Costs) / Total Costs NPV = Σ ( Net Cash Flowₜ / (1 + r)^t ) Payback Period = Total Investment / Annual Net Benefit Productivity savings = Hours saved × cost/hr × adoption × realization Revenue benefit = Impacted revenue × uplift × gross margin × attribution Risk benefit = Avoided incidents × avg cost per incident

The realization factor answers a specific question: of all the time and capacity AI frees up, how much actually gets redirected into productive work? Vendor case studies typically assume 100%. A conservative estimate of 50–70% reflects the reality that unredesigned workflows absorb recovered capacity without generating a measurable business outcome.

The scope multiplier — adoption rate multiplied by realization factor — is the lever that most consistently determines whether a promising pilot produces a defensible business case. Sensitivity analysis on copilot and agent deployments routinely shows that raising adoption from 40% to 80% doubles projected NPV. Halving the realization factor typically cuts NPV by more than all other cost assumptions combined.

Illustrative Example

To demonstrate the method — not a benchmark or promise
  • 80 users with an AI copilot; licenses and operations at €50,000/year; €20,000 initial investment.
  • Each user saves 1.5 hours/week across 48 working weeks. Fully-loaded cost: €35/hour. Realization factor: 55%.
  • Ramp-up: 50% in Year 1, 80% in Year 2, 100% in Year 3.
  • One operational incident avoided per year, valued at €5,000.

Stabilized annual benefit: approximately €116,000. Three-year simple ROI under these assumptions: approximately 59%. Payback falls in Year 2. The calculation is most sensitive to adoption rate and realization factor — not to licensing cost or per-token inference price.

Chart 5
What drives AI ROI: sensitivity by variable
Baseline ROI ≈ 59% · each variable flexed ±20%
Adoption rate
33%
84%
Realization factor
38%
80%
Hours saved / user / wk
45%
73%
Total user count
49%
69%
Licensing / operating cost
54%64%
Inference / token cost
56%62%
← Lower ROIHigher ROI →

Source: Liorant analysis based on the illustrative model above (not an external benchmark). Sensitivity ordering informed by Forrester TEI methodology · BCG, AI Radar 2026 · Dell'Acqua, F. et al. (2023), "Navigating the Jagged Technological Frontier," HBS Working Paper 24-013.

Request the AI ROI Calculator Template — get in touch with Liorant

What Real AI ROI Cases Actually Show


Chart 6
Published ROI from documented AI deployments
Operational decisions (ROI)
Productivity / copilot
Cautionary case
Vallarta SupermarketsAI fresh inventory · Nucleus Research
1,070%
Border StatesML inventory · 1.3-mo payback · Nucleus
976%
Flash.coAzure AI Foundry · 9.6-mo payback · Nucleus
366%
Productivity gains (knowledge work) — peer-reviewed RCTs
GitHub CopilotFaster task completion · peer-reviewed RCT
55.8% faster
BCG / Harvard, 758 consultantsSpeed gain (also 12.2% more tasks) · RCT
25.1% faster
Cautionary case — Klarna

AI customer service handled the equivalent of 700 agents and cut resolution time from 11 minutes to under 2 — then partially reversed in 2025, re-hiring humans after the labor-savings-first approach produced lower-quality support.

Source: Nucleus Research case studies: Border States/GAINS, Vallarta/Logile, Flash.co/Microsoft Azure (2023–2024) · Kalliamvakou, E. (2022), GitHub Blog · Dell'Acqua, F. et al. (2023), HBS Working Paper 24-013 · Klarna press release, Feb 27 2024; Reuters and The Guardian reporting, 2025.

The highest documented returns consistently come from one use-case type: AI that changes repetitive operational decisions — inventory, forecasting, classification, query routing — not simply AI that "saves time."

Border States, a US industrial distributor, implemented ML-based lead-time prediction through GAINS. Result: 976% ROI, a 1.3-month payback period, $21 million in inventory reduction, and $4.8 million in annual savings.

Vallarta Supermarkets deployed AI-powered fresh inventory management. Result: 1,070% ROI and more than $10 million in cumulative benefits by year three.

GitHub Copilot was tested in a peer-reviewed, randomized controlled trial — the most methodologically robust type of AI productivity study available. Developers completed a defined programming task in 71 minutes versus 160 minutes without AI assistance: a 55.8% improvement in task speed, statistically significant.

BCG and Harvard Business School ran a controlled experiment with 758 management consultants using generative AI tools. Inside the AI's capability frontier, participants completed 12.2% more tasks, worked 25.1% faster, and produced higher-quality output. Outside the frontier, they were 19% less likely to reach the correct answer. Any ROI model that applies uniform productivity gains regardless of task type will systematically overestimate returns.

Klarna offers the most instructive cautionary case. In February 2024, the company reported that its AI assistant handled the equivalent of 700 full-time agents, cut resolution time from 11 minutes to under 2 minutes, and was projected to generate $40 million in profit improvement. By 2025, Klarna reversed course, re-hiring human agents after acknowledging that the cost-first approach had produced lower-quality support. A business case that models labor savings without tracking customer-satisfaction signals and churn will overstate returns — sometimes dramatically.

Real cases serve to parameterize your assumptions — not to copy numbers. The right framing: if your context resembles a documented case in workflow type, volume, and adoption profile, use that case to build low, base, and high scenarios for adoption, savings, uplift, and payback. That framing is far more credible to finance teams than claiming a specific ROI figure derived from someone else's deployment.

The Seven Mistakes That Destroy AI ROI Measurement


1. No baseline. The single most fatal error. Without pre-AI measurements, no improvement can be proven. The fix requires discipline: document time, cost, volume, and error rate per process before any AI system goes live.

2. Wrong KPIs. Optimizing for model accuracy when the board cares about euros creates a credibility gap no retrospective analysis can close. Trace the causal chain from model output to the financial metric leadership actually tracks before selecting indicators.

3. Measuring too early. AI returns compound non-linearly. Year-three outcomes often dwarf year-one results. Organizations that evaluate short-term ROI on capability-building investments — data infrastructure, model fine-tuning, agent orchestration — kill projects before they mature.

4. Ignoring the scope multiplier. If 5% of employees use the tool, NPV is approximately 5% of the full-deployment projection. Most ROI templates model adoption as a binary switch. The fix: model adoption as a ramp curve across 12–24 months and include the realization factor explicitly in every scenario.

5. Counting only direct cost savings. Missing indirect value — decision quality, employee retention, customer experience, innovation velocity — understates returns. Missing downside risk — quality degradation, churn, the cost of reversing a failed deployment — overstates net benefits. Organizations that measure holistically report 22–30% higher ROI than those measuring narrowly.

6. Underestimating hidden costs. Data infrastructure, change management, model retraining, and security review are systematically under-budgeted in fast-moving pilots. Apply the 2–3× TCO multiplier and allocate 30–50% of budget to data preparation before counting any benefits.

7. Using vendor figures as defaults. Forrester TEI studies for Microsoft Copilot products show ROI ranges of 106–314%; for Google Workspace with Gemini, Forrester reports 416%. These are built on composite organizations with vendor-optimistic assumptions. Finance teams reject them on first review — not because the analysis is dishonest, but because it does not reflect their organization. Treat vendor figures as input ranges; substitute your own estimates using the formulas in this guide.

How to Report AI ROI to Your CFO and Board


Finance leaders do not reject AI business cases because they distrust AI. They reject them because the cases rest on unverifiable assumptions, single-point estimates, and activity metrics that have not been connected to financial outcomes.

Lead with financial outcomes, not activity. Hours saved is an activity metric. Euros saved — after applying the causal attribution chain, the realization factor, and the scope multiplier — is a financial outcome. Never report hours saved as cash saved without the complete causal chain.

Present a portfolio of use cases, not a single investment line. Classify investments as quick wins (defend existing margin), differentiating applications (competitive advantage), and transformational bets (new business models or revenue streams). Each category warrants a different payback expectation, time horizon, and success metric.

Use scenario ranges with explicit assumptions. Conservative, realistic, and stretch — each with a specific adoption rate, realization factor, and time horizon attached. Single-point estimates signal to finance committees that the analyst did not stress-test the model.

Show the bridge. Map how X% improvement in model accuracy translates to Y% reduction in processing errors, which produces Z euros in avoided rework cost, which shows up as W improvement in gross margin. Without the bridge, the financial claim is undefendable even if the underlying model is sound.

For agentic AI, standard labor-saving formulas do not capture the full economics. Introduce the concept of agent value multiple (AVM) — the ratio of value generated by an AI agent to the cost of running it — rather than forcing orchestrated multi-step workflows into a "time saved per employee" framework.

McKinsey's research is unambiguous on governance: CEO and CFO oversight of AI performance measurement is the factor most correlated with bottom-line impact. Monthly reviews of leading indicators and quarterly ROI reviews for active deployments are the cadence that separates organizations that compound AI value from those that plateau.

Frequently Asked Questions


What ROI can I realistically expect from an AI investment?

Independent research points to a wide range. IBM's Institute for Business Value found the average enterprise AI ROI at 5.9%, below the typical 10% cost of capital. Deloitte's survey of 1,854 executives found that typical enterprise AI payback runs 2–4 years, with only 6% achieving payback in under 12 months. At the function level, customer operations and software engineering consistently produce the clearest, fastest returns. Marketing and sales attract the most AI investment but, per MIT research, often show the slowest confirmed returns. Plan with scenario modeling, not single-point estimates.

How is measuring AI ROI different from measuring software ROI?

The formula — net benefits minus total cost, divided by total cost — is identical. The inputs are harder. AI benefits are indirect rather than direct: they appear as productivity, quality, and decision speed, not as eliminated headcount or canceled licenses. AI costs scale with usage, not seats, and the economics shift fast — inference costs fell more than 280× between November 2022 and October 2024, per Stanford HAI data. Attribution in blended human+AI workflows requires causal design: A/B tests, holdout groups, or difference-in-differences, not simple before/after comparisons.

Why do so many AI projects fail to deliver measurable ROI?

MIT's NANDA initiative found 95% of generative AI pilots deliver no measurable P&L impact. The consistent root causes across McKinsey, BCG, Deloitte, and Gartner research: no documented baseline before deployment, the wrong KPIs, partial adoption that guts NPV through the scope multiplier, and systematic under-investment in people and workflow redesign. BCG's 10-20-70 principle prescribes 70% of effort on people and process. Deloitte found firms spend 93% of their AI budget on technology and only 7% on the people expected to use it.

What is a realistic payback period for enterprise AI?

Deloitte's data puts typical enterprise AI payback at 2–4 years — three to four times longer than conventional IT's 7–12 month norm. Operational AI — inventory prediction, process automation, document classification — produces the fastest payback. AI copilots and agents typically reach payback between Year 1 and Year 2, depending heavily on adoption rate and workflow redesign. A payback projection beyond four years should prompt either a scope reduction or a different use case selection.

Should I trust the ROI figures AI vendors publish?

Use them as input ranges, not defaults. Forrester TEI studies — commissioned by vendors, built on composite organizations with optimistic adoption assumptions — show ranges of 106–457% for major AI copilot products. IBM's independent research puts the enterprise average at 5.9%. Neither figure is wrong in isolation; they measure different things under different conditions. The most methodologically defensible benchmarks come from peer-reviewed randomized controlled trials: GitHub Copilot (55.8% faster task completion) and the BCG/Harvard consultant study (12.2% more tasks, 25.1% faster, higher quality inside the capability frontier) are the current standard for function-level productivity claims.

Build an AI Business Case That Survives Finance Committee Scrutiny

The AI investments that produce real, sustained returns share three traits: they start with a documented baseline, model a fully-loaded TCO, and treat adoption and workflow redesign as the primary value levers — not the AI model itself. Liorant builds the measurement framework before deployment and reports in euros and hours, not uptime metrics.

Start with a free 30-minute AI discovery session. We identify your highest-value automation opportunity and explain exactly how Liorant can help — no slides, no pitch.

Book your free AI discovery session