Axios C-Suite: Why the token is your future cost-driver

Jim VandeHeiAdd Axios as your preferred source tosee more of our stories on Google.Illustration: Gabriella Turrisi/AxiosYou soon won't just be paying for software — you'll also foot the bill for the AI thinking your AI-powered software does.It's worth 60 seconds to understand what that actually means because it will start showing up in your finance reviews.Why it matters: Per-seat pricing was predictable, but it's about to become a relic. In the AI era, costs will increasingly be based on how many tokens you burn through.Tokens, at their simplest: the chunk of text AI reads or writes. Verbose AI uses more tokens, making it more expensive.Three things to familiarize yourself with ASAP to control verbosity and cost:Routing: Ask easier questions to cheaper AI models (open-source models, lighter versions of Claude and ChatGPT), harder ones and more complex tasks to advanced frontier models (Claude Opus 4.7, GPT-5.5 Thinking or Pro).Caching: Claude and ChatGPT can be configured to store repeated context — like system prompts or documents — server-side so you only pay full price once and a fraction of the cost on every subsequent call.Compressing: Every AI request drags the full conversation history along with it. Strip each request down to its shortest form before sending, paying only for what's essential.The bottom line: Unless you're encouraging "tokenmaxxing," your AI line item might outgrow every other software bill on your P&L if you're not careful.📈 If you're a CEO or on a CEO's team: Ask to join Jim's new weekly Axios C-Suite newsletter.

Axios C-Suite: Why the token is your future cost-driver

Other newsrooms on this story

Related reading

Axios C-Suite: Your agent prep kit

How the currency of the AI economy actually works

AI may never be as cheap as it is today

Why Anthropic Costs Are Unpredictable

Anthropic tightens limits on Claude subscriptions

Amazon’s Andy Jassy bets on $200bn AI spending drive to revive AWS