All posts

Field note

How to Avoid "AI Theater" and Ship Real Value

Akshit Kandi
#AI ROI#enterprise AI#AI strategy#agents#CFO
How to Avoid "AI Theater" and Ship Real Value
AI ROI

How to Avoid "AI Theater" and Ship Real Value

SkySync

Most enterprise AI programs are performances staged for the board, not systems that move a number. Here is how to tell the difference before the budget is gone.


A finance leader I respect described her company's AI program in one sentence: "We have eleven pilots and zero things we'd be sad to lose." That is AI theater. The demos work. The slides are beautiful. The press release went out. And if every model were switched off tomorrow, the P&L would not notice.

AI theater is not fraud. Nobody set out to waste money. It is what happens when an organization optimizes for the appearance of progress, because appearance is easier to manufacture than progress. The fix is not more technology. It is a different set of questions, asked earlier, by the person who controls the budget.

Theater has a tell: it never names the number

Real value attaches itself to a line item. It grows revenue, lowers cost, frees capacity, or reduces risk — and someone can point to where on the statement that shows up. Theater floats above the statement. It speaks in "efficiency," "productivity," "transformation," and "enablement." Those words are real, but unattached to a number they are a way of not being measured.

The simplest diagnostic you have is a single question asked of any AI initiative: which line on which statement does this move, by how much, and by when? If the answer is a paragraph instead of a figure, you are funding a performance.

If a project cannot name the line it moves, it is not an investment. It is a subscription to the feeling of being modern.

The four stages most theater never leaves

Useful AI travels a path: it gets ready, it launches, it scales, and someone keeps it running and accountable. Theater gets stuck — usually proudly — at the first two stops. Watch for these stalls:

  • Eternal pilot. The proof-of-concept that has been "validating" for three quarters. A pilot that never graduates is a tell that nobody believes the production case will survive contact with real data.
  • Demo that only works on clean inputs. It dazzles on the curated example and collapses on the messy Tuesday-afternoon record, so it never ships to where the messy reality lives.
  • Launch with no owner. The thing goes live, the project team disbands, and six weeks later quality drifts because no one is accountable for the agent's behavior in production.
  • Scale by headcount, not by system. "It's working, let's hire a team to babysit it" — which means it isn't really working. You've just moved the cost somewhere the demo didn't show.

The pattern underneath all four: the org rewarded the launch and never funded the running. Value lives in the running. A model that answers brilliantly in March and quietly degrades by June produced theater with a long intermission. The four stops are why we built our own delivery around the full arc — Agent Ready, Agent Launch, Agent Scale, Agent Care — instead of stopping at the demo where most engagements end.

Why smart companies stage it anyway

It would be comforting to blame vendors. But the incentives that produce theater are usually internal. A pilot is cheap, safe, and photogenic. It generates a board update without forcing a hard integration into a system of record. It lets a team claim AI experience without owning an outcome. And it can be relabeled a "learning" if it fails, which means it can never quite fail.

Vendors then meet that demand. The market is full of offers to advise on AI and build a prototype — the two phases that generate invoices without generating accountability. Far fewer offers include running the thing in production and staying on the hook when the number doesn't move. The asymmetry is the point. Build-and-leave is the business model that produces most theater, because the party paid to ship has no stake in whether it still works a month later.

The unglamorous root cause: data, not models

Here is the part the keynotes skip. The reason most agents underperform is not the model. Frontier models are extraordinary and getting cheaper every quarter. The reason is that the agent is reasoning over data that is fragmented, stale, contradictory, or locked in a system it can't reach at the moment it has to act. An agent is only as good as the context you can assemble in front of it at the instant of decision — and that context usually has to be stitched from a CRM, a billing system, a support history, and three exports nobody owns.

This is the work that decides the outcome before a single prompt is written: resolving the same customer across systems, deciding which field is the source of truth when two disagree, and giving the agent a live read of state instead of a nightly snapshot. Theater skips it because data work is invisible in a demo. But a speed-to-lead agent that can't see which leads are real, or a service agent that can't read the customer's actual history, will always look smart and act dumb. That is why our sequence is data first, then the agent — unify what the agent stands on, and the model stops being the bottleneck.

What a real engagement looks like from the buyer's chair

You do not need to understand transformers to govern this well. You need to insist on the shape of a real investment. Concretely:

  • A named metric and a baseline. "Today we convert X of inbound leads; the target is Y" — written down before work starts, not reverse-engineered after.
  • A line of sight to the statement. Revenue, cost, capacity, or risk — pick one and trace the path from the agent's action to the dollar.
  • An owner for the running, not just the building. Someone accountable for the agent's behavior in production after the launch glow fades.
  • A kill criterion. The conditions under which you'd shut it off. Theater has no off switch, because admitting failure was never on the table.
  • Fees that bend toward outcomes. The more a partner's compensation tracks your result instead of their hours, the less theater you'll be sold.

That last point is the strongest filter you have. When you ask a partner to tie part of their fee to the number they promised to move, the theater tends to excuse itself from the room. The ones who stay are the ones who believe the value is real enough to bet on. We run our own engagements that way on purpose — it is the cheapest honesty test in the business.

A small, honest example

Take inbound leads — a place where the math is unforgiving and easy to follow. Say you generate a steady flow of inquiries and convert 10% of them today. The theater version is a chatbot demo that wows the marketing team and never touches the actual lead flow. The real version starts by unifying the lead data so an agent can respond in seconds instead of hours, then runs in production, where slow follow-up was quietly killing deals before anyone got to them.

Notice what makes it real: a baseline (10%), a mechanism (speed-to-lead), a place it lives (production, not the demo), and a number it is accountable to (conversion). Those figures are illustrative — I'm not reporting a result, I'm showing the structure. That structure is the entire difference between theater and value. If your AI program can't be described in that shape, it probably can't move your statement either.

The one question to take into your next AI review

You don't need to audit the architecture. Walk into the next review and ask: "If this works exactly as planned, which number changes, by how much, who owns it in production, and what would make us turn it off?" Four parts. If the team answers all four without flinching, you have an investment. If the answers turn into adjectives, you have theater — and now you know to stop paying for the show. The goal was never to "do AI." It was to move a number you care about and keep it moved after the applause stops. Everything else is staging.

If you want a partner who builds it, runs it, and ties the fee to the number it moves, start with a 30-minute call.