The gap between buying Copilot and getting a return from it
Two years after launch, Microsoft 365 Copilot has roughly 15 million paid seats. That sounds large until you set it against the 450 million-plus commercial Microsoft 365 seats it could run on. The conversion rate is about 3.3 percent. The tool is everywhere it could be sold and almost nowhere it has been fully adopted.
The licence itself is straightforward to price. Copilot is an add-on rather than a standalone product, so a qualifying Microsoft 365 base licence has to sit underneath it. Once you account for that base, the true per-seat cost often runs two to three times the headline number. That is the first place ROI math goes wrong: teams compare the productivity story against $30, when the real comparison is against the loaded cost of the seat plus the time it takes a person to change how they work.
The harder problem is measurement. Gartner surveyed organisations that had piloted or deployed Copilot and found that only 3 percent described the value as significant, while roughly half landed on "some value, shows promise." Separately, only about 6 percent of enterprises have moved a generative AI project past the pilot stage into production. The technology works. The business case for scaling it is what keeps stalling.
A Copilot licence is a promise of time saved. Without a baseline, that promise has no denominator, and a number with no denominator never survives a budget review.
The seven licensing traps, and what the success pattern looks like instead
The traps below are not edge cases. They are the default path a Copilot rollout takes when nobody owns the ROI question before procurement signs. Each one has a recognisable failure mode and a recognisable fix. The risk badge reflects how much of the investment is exposed when the trap goes unaddressed.
| Decision area | The trap | The success pattern | Exposure |
|---|---|---|---|
| Baseline | Seats approved on a vendor ROI slide. No internal "before" measure of time spent on target tasks | Two-week time study on the specific workflows Copilot will touch, captured before any licence goes live | High |
| Seat allocation | Licences spread evenly across departments to seem fair, including roles with little document or email load | Seats concentrated on high-correspondence, high-document roles where minutes saved compound daily | High |
| Base licence cost | ROI built against the $18–$30 add-on only, ignoring the qualifying M365 seat underneath | Total-cost model that includes base licence, change effort, and the July 2026 price step | Moderate |
| Data readiness | Copilot switched on over a tenant with broad oversharing; answers pull from files people should not see | Permissions and sensitivity labels cleaned up first, so Copilot output is safe to trust and act on | High |
| Adoption ramp | Usage spikes in week one, then plateaus by month three with no prompt training or workflow redesign | 90-day ramp with role-specific prompt libraries and a named champion per team | Moderate |
| Measurement | ROI judged on self-reported satisfaction surveys that finance will not accept | Hours-saved tracked on instrumented tasks plus output-quality and cycle-time metrics tied to a dollar figure | High |
| Renewal discipline | Annual seats auto-renew regardless of usage; dormant licences quietly become shelfware | Quarterly reclaim of seats below a usage threshold, reallocated to people on the waitlist | Low |
Read the trap column again and notice the common thread. None of these are failures of the product. They are failures to decide, before the money goes out, what success would look like and who is responsible for proving it. That decision is cheap to make in advance and expensive to reconstruct after a year of seats have been billed.
Not sure where your Copilot ROI gaps are?
10decoders can review your current seat allocation, data readiness, and usage data, then show you where the return is leaking before your next renewal. Takes about an hour.
Book a Free AI Assessment →Why the ROI number is so hard to pin down
The standard productivity model is seductive because the arithmetic is clean. Take a knowledge worker at a $75,000 fully loaded cost, assume 45 minutes saved a day across 240 working days, and you recover about $5,625 a year per person. Net the licence and you land somewhere between 8x and 15x. On a slide, the payback period reads as days.
The model breaks the moment you apply real adoption. When you assume 60 to 70 percent of seats are genuinely active and add a 90-day ramp before people work differently, the first-year return drops to roughly 5x to 8x. That is still a strong number, but only if the time saved is real and recaptured. Forty-five minutes saved that gets absorbed into a longer lunch is not 45 minutes of recovered capacity, and finance knows the difference.
There is also a governance tax that rarely shows up in the original business case. Around 64 percent of organisations told Gartner that information governance and security work consumed significant time and resources, and 40 percent delayed their rollout by three months or more over data oversharing concerns. A Copilot that surfaces the wrong salary file in a summary does not just create a compliance incident. It teaches employees not to trust the output, which is the fastest way to kill adoption and the ROI that depends on it.
Seats go live
Curiosity is high, usage looks great in the dashboard and everyone feels validated.
Novelty fades
People fall back to old habits, active usage settles far below the seats billed.
Finance asks for the return
Nobody captured a baseline, and the renewal turns into a fight.
The pre-purchase validation that separates spend from return
The checklist below is the minimum due diligence that should happen before a Copilot purchase order is signed, not after the first renewal notice arrives. It is not a compliance framework. It is the practical work that decides whether the investment produces a number you can defend or a line item you have to explain away.
The problem underneath the licensing problem
Copilot ROI rarely fails on the spreadsheet. It fails because the work that would make the spreadsheet true never gets owned. Procurement owns the contract, IT owns the deployment, and the business owns the outcome, so the baseline and the measurement fall into the gap between them.
That is also why "buy seats and see what happens" is such a common and costly approach. It defers every hard decision (who gets a seat, what we measure, how the data gets cleaned up) until after the money is committed, when the people who could have answered those questions have moved on to the next thing. The licence keeps billing. The answers never arrive.
Treating Copilot as a data and workflow project rather than a software purchase changes the order of operations. You decide what to measure, you ready the data, you target the seats, and only then do you scale. It is slower at the start and far cheaper across the year.
The dormant Copilot seat in your tenant is not a Microsoft problem. It is a measurement gap, and it has been billing every month since the day someone approved it without a baseline.
What to do this week
1.Pull your actual Copilot usage data, by user, for the last 90 days
The Microsoft 365 admin centre and the usage reports show who is genuinely active versus who holds a dormant seat. Sort by activity and you will likely find a meaningful share of licences sitting near zero. That gap is your fastest, lowest-effort saving, and it tells you whether the problem is the tool or the rollout.
2.Run a two-week baseline on four workflows before you buy another seat
Pick drafting, summarising, meeting recap, and internal search. Have a small group log the time these take today. This is the denominator every future ROI claim needs, and two weeks of logging is far cheaper than a year of unprovable assertions.
3.Audit your tenant for oversharing before you widen the rollout
Review SharePoint and file permissions and apply sensitivity labels where they are missing. This protects you from the data exposure that delayed 40 percent of rollouts, and it is the difference between Copilot output people trust and output they quietly stop using.
4.Assign one owner for the Copilot ROI number
Name a single person accountable for the baseline, the metrics, and the quarterly seat reclaim. The trap is shared ownership, where everyone assumes someone else is tracking the return. One owner with a dollar target is what turns a pilot into a defensible business case.
Copilot is not a trap, and it is not a guaranteed win. It is a tool whose return depends entirely on the work done around it. The organisations seeing 5x and better are not running different software. They decided what success looked like before they paid for it, readied their data, and held one person accountable for the number. The data in your own usage reports will tell you, this week, which side of that line you are on.
Let 10decoders turn your Copilot spend into measurable ROI
We help enterprises ready their data, target their seats, and instrument the metrics finance will accept, so your Copilot investment produces a number you can defend at renewal rather than a line item you have to explain.