Skip to Content
  • Offices

    Offices

    North & Latin America
    • Atlanta
    • Austin
    • Bogota
    • Boston
    • Buenos Aires
    • Chicago
    • Dallas
    • Denver
    • Houston
    • Los Angeles
    • Mexico City
    • Minneapolis
    • Monterrey
    • Montreal
    • New York
    • Rio de Janeiro
    • San Francisco
    • Santiago
    • São Paulo
    • Seattle
    • Silicon Valley
    • Toronto
    • Washington, DC
    Europe & Africa
    • Amsterdam
    • Athens
    • Berlin
    • Brussels
    • Copenhagen
    • Dusseldorf
    • Frankfurt
    • Helsinki
    • Istanbul
    • Johannesburg
    • Kyiv
    • Lisbon
    • London
    • Madrid
    • Milan
    • Munich
    • Oslo
    • Paris
    • Rome
    • Stockholm
    • Vienna
    • Warsaw
    • Zurich
    Middle East
    • Doha
    • Dubai
    • Riyadh
    Asia & Australia
    • Bangkok
    • Beijing
    • Bengaluru
    • Brisbane
    • Ho Chi Minh City
    • Hong Kong
    • Jakarta
    • Kuala Lumpur
    • Manila
    • Melbourne
    • Mumbai
    • New Delhi
    • Perth
    • Seoul
    • Shanghai
    • Singapore
    • Sydney
    • Tokyo
    See all offices
  • Alumni
  • Media Center
  • Subscribe
  • Contact
  • Global | English

    Select your region and language

    Global
    • Global (English)
    North & Latin America
    • Brazil (Português)
    • Argentina (Español)
    • Canada (Français)
    • Chile (Español)
    • Colombia (Español)
    Europe, Middle East, & Africa
    • France (Français)
    • DACH Region (Deutsch)
    • Italy (Italiano)
    • Spain (Español)
    • Greece (Elliniká)
    Asia & Australia
    • China (中文版)
    • Korea (한국어)
    • Japan (日本語)
  • Saved items (0)
    Saved items (0)

    You have no saved items.

    Bookmark content that interests you and it will be saved here for you to read or share later.

    Explore Bain Insights
  • Industries
    Main menu

    Industries

    • Aerospace & Defense
    • Agribusiness
    • Chemicals
    • Construction & Infrastructure
    • Consumer Products
    • Financial Services
    • Healthcare & Life Sciences
    • Industrial Machinery & Equipment
    • Media & Entertainment
      Industries
      Media & Entertainment
      • Media Lab
    • Metals
    • Mining
    • Oil & Gas
    • Paper & Packaging
    • Private Equity
      Industries
      Private Equity
      • Due Diligence
      • Exit Planning
      • Firm Strategy & Operations
      • Portfolio Value Creation
    • Social Impact
    • Retail
    • Technology
    • Telecommunications
      Industries
      Telecommunications
      • Capital Expenditure
      • Telco Digital Transformation
    • Transportation
    • Travel & Leisure
    • Utilities & Renewables
  • Consulting Services
    Main menu

    Consulting Services

    • Customer Experience
    • Sustainability
    • Innovation
    • M&A
    • Operations
    • People & Organization
    • Private Equity
    • Sales & Marketing
    • Strategy
    • AI, Insights, and Solutions
    • Technology
    • Transformation
  • Digital
  • Insights
    Main menu

    Insights

    • Industry Insights
    • Services Insights
    • Bain Books
    • Webinars
    • Bain Futures
    View all Insights
    Featured topics
    • Tariff Response
    • Artificial Intelligence
    • Thriving in Uncertainty
    • Executive Conversations
    • Macro Trends
    • M&A Report
    • Healthcare Private Equity Report
    • Paper & Packaging Report
    • Technology Report
    • CEO's Guide to Sustainability
    • CEO Insights
    • CFO Insights
    • COO Insights
    • CIO Insights
    • CMO Insights
    View all featured topics
  • About
    Main menu

    About

    • What We Do
    • What We Believe
    • Our People & Leadership
    • Client Results
    • Awards & Recognition
    • Global Affiliations
    Further: Our global responsibility
    • Sustainability
    • Social Impact
    • World Economic Forum
    Learn more about Further
  • Careers
    Main menu

    Careers

    • Work with Us
      Careers
      Work with Us
      • Find Your Place
      • Our Work Areas
      • Integrated Teams
      • Students
      • Internships & Programs
      • Recruiting Events
    • Life at Bain
      Careers
      Life at Bain
      • Blog: Inside Bain
      • Career Stories
      • Our People
      • Where We Work
      • Supporting Your Growth
      • Affinity Groups
      • Benefits
    • Impact Stories
    • Hiring Process
      Careers
      Hiring Process
      • What to Expect
      • Interviewing
    FIND JOBS
  • Offices
    Main menu

    Offices

    • North & Latin America
      Offices
      North & Latin America
      • Atlanta
      • Austin
      • Bogota
      • Boston
      • Buenos Aires
      • Chicago
      • Dallas
      • Denver
      • Houston
      • Los Angeles
      • Mexico City
      • Minneapolis
      • Monterrey
      • Montreal
      • New York
      • Rio de Janeiro
      • San Francisco
      • Santiago
      • São Paulo
      • Seattle
      • Silicon Valley
      • Toronto
      • Washington, DC
    • Europe & Africa
      Offices
      Europe & Africa
      • Amsterdam
      • Athens
      • Berlin
      • Brussels
      • Copenhagen
      • Dusseldorf
      • Frankfurt
      • Helsinki
      • Istanbul
      • Johannesburg
      • Kyiv
      • Lisbon
      • London
      • Madrid
      • Milan
      • Munich
      • Oslo
      • Paris
      • Rome
      • Stockholm
      • Vienna
      • Warsaw
      • Zurich
    • Middle East
      Offices
      Middle East
      • Doha
      • Dubai
      • Riyadh
    • Asia & Australia
      Offices
      Asia & Australia
      • Bangkok
      • Beijing
      • Bengaluru
      • Brisbane
      • Ho Chi Minh City
      • Hong Kong
      • Jakarta
      • Kuala Lumpur
      • Manila
      • Melbourne
      • Mumbai
      • New Delhi
      • Perth
      • Seoul
      • Shanghai
      • Singapore
      • Sydney
      • Tokyo
    See all offices
  • Alumni
  • Media Center
  • Subscribe
  • Contact
  • Global | English
    Main menu

    Select your region and language

    • Global
      Select your region and language
      Global
      • Global (English)
    • North & Latin America
      Select your region and language
      North & Latin America
      • Brazil (Português)
      • Argentina (Español)
      • Canada (Français)
      • Chile (Español)
      • Colombia (Español)
    • Europe, Middle East, & Africa
      Select your region and language
      Europe, Middle East, & Africa
      • France (Français)
      • DACH Region (Deutsch)
      • Italy (Italiano)
      • Spain (Español)
      • Greece (Elliniká)
    • Asia & Australia
      Select your region and language
      Asia & Australia
      • China (中文版)
      • Korea (한국어)
      • Japan (日本語)
  • Saved items  (0)
    Main menu
    Saved items (0)

    You have no saved items.

    Bookmark content that interests you and it will be saved here for you to read or share later.

    Explore Bain Insights
  • Industries
    • Industries

      • Aerospace & Defense
      • Agribusiness
      • Chemicals
      • Construction & Infrastructure
      • Consumer Products
      • Financial Services
      • Healthcare & Life Sciences
      • Industrial Machinery & Equipment
      • Media & Entertainment
      • Metals
      • Mining
      • Oil & Gas
      • Paper & Packaging
      • Private Equity
      • Social Impact
      • Retail
      • Technology
      • Telecommunications
      • Transportation
      • Travel & Leisure
      • Utilities & Renewables
  • Consulting Services
    • Consulting Services

      • Customer Experience
      • Sustainability
      • Innovation
      • M&A
      • Operations
      • People & Organization
      • Private Equity
      • Sales & Marketing
      • Strategy
      • AI, Insights, and Solutions
      • Technology
      • Transformation
  • Digital
  • Insights
    • Insights

      • Industry Insights
      • Services Insights
      • Bain Books
      • Webinars
      • Bain Futures
      View all Insights
      Featured topics
      • Tariff Response
      • Artificial Intelligence
      • Thriving in Uncertainty
      • Executive Conversations
      • Macro Trends
      • M&A Report
      • Healthcare Private Equity Report
      • Paper & Packaging Report
      • Technology Report
      • CEO's Guide to Sustainability
      • CEO Insights
      • CFO Insights
      • COO Insights
      • CIO Insights
      • CMO Insights
      View all featured topics
  • About
    • About

      • What We Do
      • What We Believe
      • Our People & Leadership
      • Client Results
      • Awards & Recognition
      • Global Affiliations
      Further: Our global responsibility
      • Sustainability
      • Social Impact
      • World Economic Forum
      Learn more about Further
  • Careers
    Popular Searches
    • Agile
    • Digital
    • Strategy
    Your Previous Searches
      Recently Visited Pages

      Content added to saved items

      Saved items (0)

      Removed from saved items

      Saved items (0)

      Brief

      DeepSeek: A Game Changer in AI Efficiency?

      DeepSeek: A Game Changer in AI Efficiency?

      Here are some early implications for executives and investors.

      By Peter Hanbury, Jue Wang, Padraic Brick, and Alessandro Cannarsi

      • min read
      }

      Brief

      DeepSeek: A Game Changer in AI Efficiency?
      en

      DeepSeek, a Chinese AI start-up founded in 2023, has quickly made waves in the industry. With fewer than 200 employees and backed by the quant fund High-Flyer ($8 billion assets under management), the company released its open-source model, DeepSeek R1, one day before the announcement of OpenAI’s $500 billion Stargate project​.

      What sets DeepSeek apart is the prospect of radical cost efficiency. The company claims to have trained its model for just $6 million using 2,000 Nvidia H800 graphics processing units (GPUs) vs. the $80 million to $100 million cost of GPT-4 and the 16,000 H100 GPUs required for Meta’s LLaMA 3​. While the comparisons are far from apples to apples, the possibilities are valuable to understand.

      DeepSeek’s rapid adoption underscores its potential impact. Within days, it became the top free app in US app stores, spawned more than 700 open-source derivatives (and growing), and was onboarded by Microsoft, AWS, and Nvidia AI platforms​.

      DeepSeek’s performance appears to be based on a series of engineering innovations that significantly reduce inference costs while also improving training cost. Its mixture-of-experts (MoE) architecture activates only 37 billion out of 671 billion parameters for processing each token, reducing computational overhead without sacrificing performance. The company also has optimized distillation techniques, allowing reasoning capabilities from larger models to be transferred to smaller ones. By using reinforcement learning, DeepSeek enhances performance without requiring extensive supervised fine-tuning. Additionally, its multi-head latent attention (MHLA) mechanism reduces memory usage to 5% to 13% of previous methods​.

      Beyond model architecture, DeepSeek has improved how it handles data. Its mixed-/low-precision computation method, with FP8 mixed precision, cuts computational costs. An optimized reward function ensures compute power is allocated to high-value training data, avoiding wasted resources on redundant information. The company also has incorporated sparsity techniques, allowing the model to predict which parameters are necessary for specific inputs, improving both speed and efficiency​. DeepSeek’s hardware and system-level optimizations further enhance performance. The company has developed memory compression and load balancing techniques to maximize efficiency. Specifically, one novel optimization technique was using PTX programming instead of CUDA, giving DeepSeek engineers better control over GPU instruction execution and enabling more efficient GPU usage. DeepSeek additionally improved the communication between GPUs using the DualPipe algorithm, allowing GPUs to communicate and compute more effectively during training.

      So far, these results aren’t surprising; indeed, they track with broader trends in AI efficiency (see Figure 1). What is more surprising is that an open-source Chinese start-up has managed to close or at least significantly narrow the performance gap with leading proprietary models​.

      Figure 1
      AI inference costs have rapidly declined due to innovation, and DeepSeek follows this trend​

      Notes: Massive multitask language understanding (MMLU) measures how well a large language model (LLM) understands language and solves problems, with results reported by model providers or through external evaluations; the scores of 83 and 42 are performance benchmarks, with higher being better

      Sources: a16z; Bain analysis

      Skepticism and market impact

      Despite DeepSeek’s claims, several uncertainties remain. The true cost of training the model remains unverified, and there is speculation about whether the company relied on a mix of high-end and lower-tier GPUs. Questions have also been raised about intellectual property concerns, particularly regarding the sources and methods used for distillation. Some critics argue that DeepSeek has not introduced fundamentally new techniques but has simply refined existing ones. Nevertheless, boardrooms and leadership teams are now paying closer attention to how AI efficiency improvements could impact long-term investment plans and strategy (see Figure 2)​.

      Figure 2
      Several catalysts could feasibly offset efficiency gains and sustain current AI infrastructure investment levels

      Possible AI market scenarios

      DeepSeek’s impact could unfold in several ways.

      In a bullish scenario, ongoing efficiency improvements would lead to cheaper inference, spurring greater AI adoption—a pattern known as Jevon’s paradox, in which cost reductions drive increased demand. While inference costs drop, high-end training and advanced AI models would likely continue to justify heavy investment, ensuring that spending on cutting-edge AI capabilities remains strong​.

      A moderate scenario suggests that AI training costs remain stable but that spending on AI inference infrastructure decreases by 30% to 50%. In this case, cloud providers would reduce their capital expenditures from a range between $80 billion and $100 billion annually to a range between $65 billion and $85 billion per cloud service provider, which, while lower than current projections, would still represent a 2 times to 3 times increase over 2023 levels​.

      In a bearish scenario, AI training budgets shrink, and spending on inference infrastructure declines significantly. Capital expenditures for cloud providers could drop to a range between $40 billion and $60 billion, which, while lower than moderate estimates, would still be 1.5 times to 2 times higher than 2023 levels​.

      Cutting through the noise

      Amid the speculation, some observations may help put events into context:

      • Significant leap, not surprising: Inference costs have been steadily declining, and DeepSeek’s innovations accelerate this trend rather than disrupt it entirely.
      • Don’t overreact: AI adoption will continue expanding robustly, though the pace and shape of investment may shift.
      • Inference is only one slice: The largest players are still racing to build next-generation models that unlock frontier applications and a bigger total addressable market.
      • Impact by segment: An intensified arms race in the model layer, with open source vs. proprietary forming up as a key battleground, sees short-term volatility and medium-term strength in data center hardware and app players benefitting.
      • Energy demand: Near-term demand through 2030 is unlikely to change materially given power supply constraints; longer-term implications remain uncertain.

      Overall, demand for AI capabilities remains strong. Data centers, hardware providers, and AI application developers will continue evolving as efficiency improvements unlock new possibilities.

      The CEO playbook: What to do now

      For CEOs, the DeepSeek episode is less about one company and more about what it signals for AI’s future. The lesson is clear: The pace of AI innovation is rapid and iterative, and breakthroughs can come from unexpected places.

      Executives can take three key steps:

      • Avoid overreaction, but prepare for cost disruption. DeepSeek’s model may not be an existential threat to AI incumbents, but it highlights the rapid decline in AI costs. Businesses should plan for a world where AI inference is significantly cheaper, enabling broader adoption and new competitive dynamics.
      • Monitor market signals closely. Keep an eye on capex trends, GPU demand, and AI adoption rates. If infrastructure spending slows, it could indicate that efficiency gains are reshaping AI economics (see Figure 3). As enterprise AI adoption accelerates, businesses must move quickly to integrate AI into their core strategies​.
      • Think beyond productivity—AI as a business model catalyst. The real winners in AI will be those that use it to redefine their core offerings not just cut costs. CEOs should push their organizations beyond automation and into AI-driven innovation—whether in product development, customer personalization, or entirely new services.
      Figure 3
      Key sign posts to monitor in the coming months
      Authors
      • Headshot of Peter Hanbury
        Peter Hanbury
        Partner, San Francisco
      • Headshot of Jue Wang
        Jue Wang
        Partner, Silicon Valley
      • Headshot of Padraic Brick
        Padraic Brick
        Partner, San Francisco
      • Headshot of Alessandro Cannarsi
        Alessandro Cannarsi
        Partner, Milan
      Contact us
      Related Industries
      • Technology
      Related Consulting Services
      • AI, Insights, and Solutions
      • Digital
      How We Can Help
      • Artificial Intelligence
      Artificial Intelligence Insights
      Beyond AI Efficiency: A Conversation with Intuit’s Ivan Lazarov

      “Ultimately, we must be audacious enough to envision the impossible and bold enough to build it.”

      Read More
      Artificial Intelligence Insights
      The 2026 Retail Executive Agenda

      Here are letters to the C-suite to help strengthen strategy, catalyze collaboration, and expand value creation in the AI age.

      Read More
      Technology
      Software M&A

      The good news: Most deal best practices still apply to AI acquisitions.

      Read More
      Artificial Intelligence Insights
      What Business Leaders Need to Know About AI Sovereignty

      Aligning business strategy with national AI priorities is necessary to compete and scale.

      Read More
      Artificial Intelligence Insights
      The Rise of the Canadian Venture Scientist

      This is Canada’s moment to turn AI pioneers into venture builders.

      Read More
      Published in February 2025
      Tags
      • AI, Insights, and Solutions
      • Artificial Intelligence
      • Artificial Intelligence Insights
      • CIO Insights
      • Digital
      • Technology

      How We've Helped Clients

      Helping a Midsize ERP Player Compete against the Giants

      Read case study

      Performance Improvement Aggressively growing an IT service provider with a high-performance culture

      Read case study

      Sales and Marketing When the price is right, customers respond

      Read case study

      Ready to talk?

      We work with ambitious leaders who want to define the future, not hide from it. Together, we achieve extraordinary outcomes.

      Stay ahead in a rapidly changing world. Subscribe to Bain Insights, our monthly look at the critical issues facing global businesses.

      *I have read and understand Bain’s Privacy Notice.

      Please read and agree to the Privacy Policy.
      Bain & Company
      Contact us Sustainability Accessibility Terms of use Privacy Modern Slavery Act Statement Cookie Policy Sitemap Log In

      © 1996-2026 Bain & Company, Inc.

      Contact Bain

      How can we help you?

      • Business inquiry
      • Career information
      • Press relations
      • Partnership request
      • Speaker request
      See all offices