Synopsis: The bill for the experimentation phase of enterprise AI is starting to come due. Most organizations green-lit broad pilots without much friction around cost, on the theory that the technology was moving too fast to put guardrails on. That assumption is breaking. Token consumption is climbing in patterns nobody modeled, vendor pricing is shifting underneath multi-year commitments, and finance teams are starting to ask a question their AI counterparts can't always answer — what did we actually get for this?
Jason Ambrose, CEO of Backstory, sits down with Mike Vizard to walk through what he calls the coming tokenocalypse. His argument is that enterprises are about to hit the same wall cloud spending hit a decade ago, just compressed into a much shorter window. Unchecked consumption, opaque billing units and a long tail of small teams quietly running their own AI workloads add up to a cost base that nobody owns and nobody can predict.
Tokenomics is becoming a real discipline inside engineering and finance. That means routing requests to the right model for the job instead of defaulting to the most expensive one, treating context windows as a budgeted resource, and building the kind of unit-economics view that ties spend to a specific business outcome rather than a vague productivity claim. AI FinOps is showing up in org charts for the same reason cloud FinOps did — because the alternative is a perpetual surprise on the invoice.
The harder conversation is governance. Ambrose makes the case that specialist AI platforms, clear ROI thresholds and a single accountable owner for AI spend have to be in place before enterprises scale further. Otherwise the next phase of adoption isn’t going to look like scaling — it’s going to look like a forced rationalization, with finance pulling the plug on initiatives that never had a defensible business case in the first place.

