Skip to Content
  • Bureaux

    Bureaux

    Amérique du Nord et Amérique du Sud
    • Atlanta
    • Austin
    • Bogota
    • Boston
    • Buenos Aires
    • Chicago
    • Dallas
    • Denver
    • Houston
    • Los Angeles
    • Mexico City
    • Minneapolis
    • Monterrey
    • Montreal
    • New York
    • Rio de Janeiro
    • San Francisco
    • Santiago
    • São Paulo
    • Seattle
    • Silicon Valley
    • Toronto
    • Washington, DC
    Europe, Moyen-Orient et Afrique
    • Amsterdam
    • Athens
    • Berlin
    • Brussels
    • Copenhagen
    • Doha
    • Dubai
    • Dusseldorf
    • Frankfurt
    • Helsinki
    • Istanbul
    • Johannesburg
    • Kyiv
    • Lisbon
    • London
    • Madrid
    • Milan
    • Munich
    • Oslo
    • Paris
    • Riyadh
    • Rome
    • Stockholm
    • Vienna
    • Warsaw
    • Zurich
    Asie et Australie
    • Bangkok
    • Beijing
    • Bengaluru
    • Brisbane
    • Ho Chi Minh City
    • Hong Kong
    • Jakarta
    • Kuala Lumpur
    • Manila
    • Melbourne
    • Mumbai
    • New Delhi
    • Perth
    • Shanghai
    • Singapore
    • Sydney
    • Tokyo
    Voir tous les bureaux
  • Alumni
  • Presse
  • S’abonner
  • Contacter
  • Canada | Français

    Sélectionnez votre région et votre langue

    Global
    • Global (English)
    Amérique du Nord et Amérique du Sud
    • Brazil (Português)
    • Argentina (Español)
    • Canada (Français)
    • Chile (Español)
    • Colombia (Español)
    Europe, Moyen-Orient et Afrique
    • France (Français)
    • DACH Region (Deutsch)
    • Italy (Italiano)
    • Spain (Español)
    • Greece (Elliniká)
    Asie et Australie
    • China (中文版)
    • Korea (한국어)
    • Japan (日本語)
  • Saved items (0)
    Saved items (0)

    You have no saved items.

    Bookmark content that interests you and it will be saved here for you to read or share later.

    Explore Bain Insights
  • Expertises Sectorielles
    Menu principal

    Expertises Sectorielles

    • Aerospace et Défense
    • Agroalimentaire
    • Automobile
    • Chimie
    • Infrastructures, BTP et Matériaux de Construction
    • Grande Consommation
    • Services Financiers
    • Santé
    • Engins & Equipements Industriels
    • Media et Divertissement
    • Metals
    • Mining
    • Pétrole & Gaz
    • Papier et Emballage
    • Private Equity
    • Secteur Public
    • Distribution
    • Technologie
    • Télécommunications
    • Transportation
    • Travel & Leisure
    • Utilities & Energies Renouvelables
  • Expertises Fonctionnelles
    Menu principal

    Expertises Fonctionnelles

    • Expérience Client
    • ESG
    • Innovation
    • Fusions et Acquisitions
    • Opérations
    • People & Organization
    • Private Equity
    • Sales & Marketing
    • Stratégie
    • IA, Perspectives et Solutions
    • Technology
    • Transformation
  • Digital
  • Points de Vue
  • About
    Menu principal

    About

    • Notre Activité
    • Nos Valeurs
    • Nos Collaborateurs et Notre Équipe Dirigeante
    • Notre Impact
    • Prix & Récompenses
    • Partenariats Internationaux
    Further: Our global responsibility
    • Sustainability
    • Impact Social
    • World Economic Forum
    Learn more about Further
  • Carrières
    Menu principal

    Carrières

    • Rejoignez-nous
      Carrières
      Rejoignez-nous
      • Trouvez votre place
      • Nos domaines d’expertise
      • Equipes multidisciplinaires
      • Étudiants
      • Stages et programmes
      • Événements de recrutement
    • La vie chez Bain
      Carrières
      La vie chez Bain
      • Blog: Dans les coulisses de Bain
      • Récits de carrière
      • Nos collaborateurs
      • Nos bureaux
      • Soutenir votre évolution professionnelle
      • Groupes d’affinités
      • Avantages chez Bain
    • Histoires d’impact
    • Notre processus de recrutement
      Carrières
      Notre processus de recrutement
      • Ce que vous pouvez attendre
      • Entretiens
    Trouver un poste
  • Bureaux
    Menu principal

    Bureaux

    • Amérique du Nord et Amérique du Sud
      Bureaux
      Amérique du Nord et Amérique du Sud
      • Atlanta
      • Austin
      • Bogota
      • Boston
      • Buenos Aires
      • Chicago
      • Dallas
      • Denver
      • Houston
      • Los Angeles
      • Mexico City
      • Minneapolis
      • Monterrey
      • Montreal
      • New York
      • Rio de Janeiro
      • San Francisco
      • Santiago
      • São Paulo
      • Seattle
      • Silicon Valley
      • Toronto
      • Washington, DC
    • Europe, Moyen-Orient et Afrique
      Bureaux
      Europe, Moyen-Orient et Afrique
      • Amsterdam
      • Athens
      • Berlin
      • Brussels
      • Copenhagen
      • Doha
      • Dubai
      • Dusseldorf
      • Frankfurt
      • Helsinki
      • Istanbul
      • Johannesburg
      • Kyiv
      • Lisbon
      • London
      • Madrid
      • Milan
      • Munich
      • Oslo
      • Paris
      • Riyadh
      • Rome
      • Stockholm
      • Vienna
      • Warsaw
      • Zurich
    • Asie et Australie
      Bureaux
      Asie et Australie
      • Bangkok
      • Beijing
      • Bengaluru
      • Brisbane
      • Ho Chi Minh City
      • Hong Kong
      • Jakarta
      • Kuala Lumpur
      • Manila
      • Melbourne
      • Mumbai
      • New Delhi
      • Perth
      • Shanghai
      • Singapore
      • Sydney
      • Tokyo
    Voir tous les bureaux
  • Alumni
  • Presse
  • S’abonner
  • Contacter
  • Canada | Français
    Menu principal

    Sélectionnez votre région et votre langue

    • Global
      Sélectionnez votre région et votre langue
      Global
      • Global (English)
    • Amérique du Nord et Amérique du Sud
      Sélectionnez votre région et votre langue
      Amérique du Nord et Amérique du Sud
      • Brazil (Português)
      • Argentina (Español)
      • Canada (Français)
      • Chile (Español)
      • Colombia (Español)
    • Europe, Moyen-Orient et Afrique
      Sélectionnez votre région et votre langue
      Europe, Moyen-Orient et Afrique
      • France (Français)
      • DACH Region (Deutsch)
      • Italy (Italiano)
      • Spain (Español)
      • Greece (Elliniká)
    • Asie et Australie
      Sélectionnez votre région et votre langue
      Asie et Australie
      • China (中文版)
      • Korea (한국어)
      • Japan (日本語)
  • Saved items  (0)
    Menu principal
    Saved items (0)

    You have no saved items.

    Bookmark content that interests you and it will be saved here for you to read or share later.

    Explore Bain Insights
  • Expertises Sectorielles
    • Expertises Sectorielles

      • Aerospace et Défense
      • Agroalimentaire
      • Automobile
      • Chimie
      • Infrastructures, BTP et Matériaux de Construction
      • Grande Consommation
      • Services Financiers
      • Santé
      • Engins & Equipements Industriels
      • Media et Divertissement
      • Metals
      • Mining
      • Pétrole & Gaz
      • Papier et Emballage
      • Private Equity
      • Secteur Public
      • Distribution
      • Technologie
      • Télécommunications
      • Transportation
      • Travel & Leisure
      • Utilities & Energies Renouvelables
  • Expertises Fonctionnelles
    • Expertises Fonctionnelles

      • Expérience Client
      • ESG
      • Innovation
      • Fusions et Acquisitions
      • Opérations
      • People & Organization
      • Private Equity
      • Sales & Marketing
      • Stratégie
      • IA, Perspectives et Solutions
      • Technology
      • Transformation
  • Digital
  • Points de Vue
  • Carrières
    Recherches les plus fréquentes
    • Agile
    • Digital
    • Stratégie
    Vos recherches précédentes
      Pages récemment visitées

      Content added to saved items

      Saved items (0)

      Removed from saved items

      Saved items (0)

      Etude

      How Token Economics Will Change Opex

      How Token Economics Will Change Opex

      As the mix shifts from headcount to tokens, leaders must manage the costs of two economic models at once.

      Par Jue Wang, Anne Hoecker, Ann Bosche, Tamara Lewis, et Peter Bowen

      • min
      }

      Etude

      How Token Economics Will Change Opex
      en
      En Bref
      • The opex mix is shifting from headcount to tokens, but organizations face different bottlenecks at each level.
      • Despite falling model prices, total spending remains stubbornly high as usage scales, complexity increases, and enterprises default to premium frontier models.
      • The winners will actively manage token economics, rightsizing models, instrumenting cost per task, and redesigning operating models to move faster than the technology curve.

      In part I, we laid out the provocation that the opex mix is shifting from headcount to tokens and that there is no clear path of transition.

      In software engineering, the domain furthest along in AI enablement, spending on tokens is only about 1% to 2% the cost of headcount. The pattern is the same across sales, support, strategy, and ops: at 1% and talking about getting to 20% to 30% (see Figure 1). But where we are looks radically different depending on which industry you’re in and where you sit within the organization. Three levels, three different problems, one transformation.

      Figure 1
      The cost of agents, tokens, and data could replace 20% to 30% of today’s headcount operating expenses
      visualization
      Source: Bain & Company

      At the CEO level, the destination is clear. Intuit’s CEO has set a vision to become a builder powerhouse by tripling developer productivity. Across our tech-forward client base, the mindset has flipped from “you helping AI” to “AI helping you.” Alpha teams redesign the workflow first, prove the velocity gain, then expand to the broader organization. These aren’t experiments; they’re mandates. And the leaders setting them aren’t debating the returns on investment. They’re impatient with the pace of organizational change beneath them.

      At the general manager (GM) level, it’s a budget-and-speed problem. Where do I find the unbudgeted millions this quarter? Token spending doesn’t fit an existing line item and requires an approval chain that doesn’t exist. The pilot was stellar, but scaling to 10,000 seats means navigating procurement, security, legal, data, and IT cycles that were built for an annual cadence. The GM knows the destination, but the path is littered with quarterly budget reviews, vendor approvals, and organizational redesigns that can’t happen fast enough.

      At the individual level, there’s a chasm between the heroes and everyone else. Early data suggests that the top 5% of users in each company often consume more tokens than the other 95% combined. In some cases, senior staff engineers, chief architects, sales rainmakers, and strategy directors say, “I don’t need a team; they only slow me down.” It’s unclear how we get everyone to cross the chasm or how to manage superusers’ costs without slowing them down.

      The CEO sees the destination. The general manager sees the obstacles. The individual sees a tool that works for some people but not yet for them. Same transformation, three different bottlenecks.

      The cost-per-task paradox

      Here’s what makes this truly uncomfortable: In some domains, agent and token costs are already more expensive than offshore human resources. Not everywhere: These are still sparse domains. But they exist. Agents consume significant tokens on multistep reasoning, error correction, and context loading, which add up fast on complex workflows. The speed and quality gains are real, but the per-task economics don’t always pencil out, particularly when done for the wrong tasks or without the appropriate orchestration. The question is where you’re paying a premium for capability vs. where you should be getting a cost arbitrage.

      So, will the cost come down? The hope is obvious: Model prices are falling roughly 10 times per generation. But here’s the reality of what we’re actually seeing: The effective cost per task is often staying flat. Why? Three forces are working against the headline price declines. First, everyone stays on frontier. When the next Claude or GPT ships, nobody says, “Great, I’ll keep using the old model and pocket the savings.” They upgrade. Last generation gets cheaper; frontier stays expensive (see Figure 2). Second, tokens per query keep climbing as agents take on more complex, multistep work, such as orchestrating tool calls, correcting errors, and loading context, with more advanced models consuming more tokens on more difficult problems. Third, usage expands: Once a team discovers what agents can do, they find 10 more workflows to throw at them.

      Figure 2
      Each performance tier gets less expensive, but users shift to premium frontier models, keeping overall spending levels largely stable

      Note: MMLU-Pro is massive multitask language understanding, a benchmark used to evaluate large language models

      Source: Bain analysis

      The trend is more nuanced than the headline. The cost of tokens (typically measured in millions) fell by half from December 2024 to December 2025 while tokens consumed grew by 4.5 times over the same period (see Figure 3).

      Figure 3
      Average cost per token is falling, but token use is rising faster
      visualization
      visualization
      Sources: Wells Fargo October 2025; Bain analysis

      Net-net: The models get less expensive per token, the usage gets heavier per task, and the bill stays stubbornly high. The hope is that this is like 2G/3G costs circa 2009: expensive now, destined to plummet. And maybe it is. But every six months feels like it should be the inflection point. It never quite is.

      The models get cheaper. The usage gets heavier. The bill stays stubbornly high.

      What would swing it

      The 70/30 hypothesis, reflecting the cost of headcount and cost of tokens, is a scenario, not a forecast. Whether it materializes and how fast both depend on a handful of variables that interact in nonlinear ways. The range of outcomes is enormous. AT&T learned this firsthand: At 8 billion tokens a day, the company said it reorchestrated so that large "super agents" route tasks to smaller, domain-specific worker models instead of pushing everything through frontier. The company reported a 90% cost reduction and three times throughput, not by using less AI but by right-fitting the model to the job. One architectural decision moved the economics by an order of magnitude. That's the kind of nonlinearity we're talking about. Here are some of the variables we watch most closely.

      • Token cost trajectory: This is the most obvious lever. If frontier pricing falls by a factor of 10 per generation cycle, the economics tip fast. But the pattern so far is that the last generation gets cheaper while frontier stays expensive, and tokens per task scale with complexity.
      • On-premise inference: This could be the wildcard, shifting cost from opex to capex, but few enterprises have even evaluated the true cost.
      • Rightsizing models to tasks: The three variables that matter are cost per token (model complexity), number of tokens (task complexity), and return (value of output). A draft email doesn’t need frontier reasoning; a customer-facing financial model does. Organizations that flex model complexity to the task will have fundamentally different economics than those that run everything on the most powerful model available. Most enterprises haven't started this work.
      • Enterprise software-as-a-service (SaaS) lock-in: Salesforce, ServiceNow, Workday, and SAP control the data fabric. How quickly they open agent-friendly APIs (vs. defending the moat) governs how fast the mix can shift. This is the variable that is most outside your control and the one most likely to move suddenly when a major vendor breaks ranks.
      • Organizational clock speed: Annual planning cycles don’t accommodate weekly capability shifts. The companies that move fastest will decouple organizational design cadence from the fiscal calendar. Organizational design and technology-led transformation have to move in lockstep. Annual cycles don't cut it.
      • Regulatory and trust thresholds: An agent that can do the work of a compliance analyst still needs a human to sign the attestation. How regulators adapt and how quickly institutional trust in AI-generated outputs builds will determine the ceiling.
      • Quality at scale: Agents work brilliantly in pilots. At 1,000 concurrent production workflows with cascading errors, the picture changes. Until enterprise-grade reliability is proven, the human backstop stays.

      The 70/30 scenario on headcount and token cost isn’t a forecast; it’s a stress test. Which assumptions are you betting on?

      Navigating the token economics

      Uncertainty is real; it’s not an excuse for inaction. Here are five moves that hold up across scenarios.

      • Create a dedicated AI compute budget. Stop funding tokens from existing line items. Treat it like cloud migration spending in 2015: a protected transformation line with its own governance and far more control and visibility than most companies currently have. If GMs have to find it midquarter, it dies in every budget review. This is table stakes, and most haven’t done it.
      • Portfolio your model spending. Frontier for high-stakes work. Last generation or open weight for volume. Using one model for everything is flying first class for every trip. We see three to five times cost differences between companies that model-match and those that don’t. Pull last month's API invoices. What percentage of calls went to frontier models? For how many of those was frontier actually necessary?
      • Instrument your token economics. Most companies have no idea what they spend per task, per workflow, per outcome. Build the metering now. You need to know: What does it cost to generate a proposal? To resolve a Tier-1 ticket? To draft a contract? Without this data, you’re optimizing blind. Painful to do early. Impossible to retrofit.
      • Plan for the dual-cost transition. There will be a period during which you’re paying for both the legacy workforce and the scaling token bill. Know where that overlap is, and what the ROI commitments are. Build it into the board narrative now, not when the CFO notices.
      • Test on-premise. For highest-volume workflows, run open-weight models on your own silicon. Understand true cost (capex, energy, ops) vs. API pricing. This is potentially the biggest lever in the entire opex equation and the one most enterprises haven’t evaluated. Don’t wait for the bill to force your hand.

      The bottom line

      The opex shift from headcount to tokens isn’t a budget problem; it’s a structural transformation. The economics are unsettled, and the path is nonlinear. That's not a reason to wait. It's the reason to start instrumenting now so that when the curve breaks, you're navigating with data instead of intuition.

      Next Monday: Pull your top 10 SaaS contracts and your token spend to date. Instrument one workflow end to end: What does it actually cost per task, per outcome? That’s the number nobody in your org knows yet. Once you have it, every other decision gets easier.

      Explore this series

      • The Future of Opex in the Agent Economy

      • How Token Economics Will Change Opex

      Auteurs
      • Headshot of Jue Wang
        Jue Wang
        Partner, Silicon Valley
      • Headshot of Anne Hoecker
        Anne Hoecker
        Partner, Silicon Valley
      • Headshot of Ann Bosche
        Ann Bosche
        Partner, San Francisco
      • Headshot of Tamara Lewis
        Tamara Lewis
        Partner, Seattle
      • Headshot of Peter Bowen
        Peter Bowen
        Partner, Austin
      Contactez-nous
      Synergies sectorielles
      • Software
      • Technologie
      Technologie
      How Will AI Change Software Organizations?

      Employees will be capable of doing more with AI agents—as long as the systems and culture support them.

      Voir plus
      Technologie
      Agentic AI and the New Complexity Trap in Enterprise Software

      Value will accrue to those who empower customers, rather than those who build the most complex solutions.

      Voir plus
      Technologie
      The Future of Opex in the Agent Economy

      AI’s real impact isn’t efficiency; it’s a fundamental rewrite of how decisions are made and work gets done.

      Voir plus
      Technologie
      The $100-Billion SaaS Opportunity Hiding in Cross-System Labor

      Agentic AI can automate much of the coordination work among systems, creating a large new market for software companies.

      Voir plus
      Technologie
      Will IT Accelerate or Stall Your AI Transformation?

      How CIOs can fund, scale, and modernize to keep pace with escalating AI demands.

      Voir plus
      First published in juin 2026
      Mots clés
      • Software
      • Technologie

      Comment nous avons aidé nos clients

      A Technology Company’s Transformation Restores Its Growth Trajectory

      Lire l’étude de cas

      Helping a Midsize ERP Player Compete against the Giants

      Lire l’étude de cas

      Aggressively growing an IT service provider with a high-performance culture

      Lire l’étude de cas

      Vous souhaitez continuer cette conversation ?

      Nous aidons des dirigeants du monde entier à matérialiser des impacts et des résultats pérennes et créateurs de valeur dans leurs organisations.

      Les points de vue de Bain : notre perspective sur des problématiques auxquelles sont confrontées les entreprises à travers le monde, envoyés chaque mois dans votre boîte de réception. 

      *J’ai lu la politique de confidentialité et j’accepte les conditions.

      Merci de lire notre politique de confidentialité.
      Bain & Company
      Contactez-nous Sustainability Accessibility Conditions d’utilisation Politique de Confidentialité Cookie Policy Mentions Légales Sitemap Log In

      © 1996-2026 Bain & Company, Inc.

      Contacter Bain

      Comment pouvons-nous vous aider ?

      • Business inquiry
      • Career information
      • Press relations
      • Partnership request
      • Speaker request
      Voir tous les bureaux