Brief

The New AI Stack: Speed, Scale, and Real-World ROI
en
Listen to this article

Amazon Web Services’ annual cloud conference this past week provided further evidence that artificial intelligence has moved from experimental tools to core enterprise infrastructure. The sessions at AWS re:Invent 2025 in Las Vegas demonstrated how cloud and AI advances are translating into cost savings and productivity gains at scale.

For companies to capture this value, the path is twofold: strategic investment in the right technologies (agents, custom silicon, tailored models) and redesigning organizational processes around these new capabilities. The early leaders recognize that all of this adds up to a complex business transformation that must be managed effectively in order to deliver AI’s full benefits at scale.

Reflecting on the keynotes, breakouts, and client conversations, four themes stood out.

Agentic AI is making real progress

AWS signaled confidence that fully autonomous AI agents are ready to deliver business value, even creating a vice president of agentic AI role.

In an evolution from consumer AI chatbots, these enterprise agents can securely tap company data and tools, acting like a digital coworker. The new Amazon Quick Suite connects across internal wikis, documents, and applications. Employees can ask natural-language questions and receive comprehensive answers or execute tasks, all with enterprise-grade security. AWS said one customer reduced average service ticket handling time by 80%, saving 24,000 hours annually.

For key use cases, AWS unveiled “frontier agents,” pre-built autonomous AI agents that serve as virtual team members for technical domains such as coding, cybersecurity, and DevOps. These agents can work for hours or days without intervention. By offloading routine coding, code reviews, or incident prevention, such agents can significantly boost developer productivity and consistency.

Business leaders are eager but cautious. To deploy AI agents at scale, they need systems for clear accountability and predictable behavior. The most effective organizations will treat AI agents as a new class of employees that require training, rules, and supervision to truly become strategic assets instead of risks. Unlike traditional IT systems, most AI agents lack a unified “control tower” interface for non-technical managers. Until vendor solutions mature in areas such as agent monitoring and auditability, C-level executives should proactively invest in talent and processes that govern AI. Already, new roles in agent governance are emerging.  

ROI is now front and center

ROI was a common thread across AWS announcements, with new tools to improve customization, flexibility, and cost-effective scaling of AI agents and cloud systems.

Agents. Enterprises no longer need to spend months building their agent stack from scratch. Cloud providers are offering foundations and open-source frameworks to help companies deploy AI agents faster, with flexibility to bring their preferred models or developer tools to avoid vendor lock-in.

Models. Cloud providers are offering a buffet of models and the means to tailor them so that enterprises can get the performance they need at the right price. AWS has added 18 new open-weight, third-party AI models to its Bedrock platform over the past year, all accessible via a unified API. This selection lets enterprises experiment and swap models without rewriting code, accelerating evaluation of which model best fits a use case.

This also makes it easier to create smaller, specialized models that outperform generic large models in specific domains. For example, reinforcement fine-tuning lets developers apply reward-based feedback to train models more simply, with reported 66% average accuracy gains over base models. Serverless model customization allows teams to launch fine-tuning jobs without provisioning infrastructure, cutting iteration cycles from months to days.

This shift—from relying solely on giant pre-trained models to using fine-tuned, domain-specific small models—means models not only run cheaper and faster, but they can also be deployed on private infrastructure for better control and data privacy.

Going forward, tech strategy will likely include continuously scanning new model options and having a pipeline to customize or switch models as better ones emerge.

Silicon and hardware. AWS’ latest Trainium3 chips promise three times higher throughput per chip and four times faster model response times, with some customers reporting up to 50% cost reductions for training and inference. AWS also signaled that its next-generation chip will have greater compatibility with third-party ecosystems, hinting at more heterogeneous cloud infrastructure.

For enterprises, advances in silicon and other hardware translate to more AI workload headroom at lower unit cost. This enables companies to tackle ambitious AI projects (like real-time vision or large-scale simulations) that previously might have been cost-prohibitive. CIOs and chief technology officers (CTOs) should evaluate how new chips and instance types can reduce AI infrastructure spending or enable new use cases.

Beyond raw performance, AWS stressed flexibility in cost-performance trade-offs to align with business goals (whether that means cutting cloud bills, improving model accuracy, or minimizing carbon footprint). In practice, this is enabled by multiple model and chip options, and tools like Nova Forge that let companies blend proprietary data into models.

Bringing cloud AI to you

In a notable shift, AWS acknowledged that not all AI workloads will live in public cloud data centers. AWS’ new AI Factories bring AWS hardware and services into customers’ own data centers, allowing organizations with data sovereignty or latency requirements to access state-of-the-art AI infrastructure on their own turf while still benefiting from AWS management and support. This should appeal particularly to government clients and large enterprises in highly regulated industries, or really any company that needs to keep sensitive data in-house.

Manage the business transformation, not just the tech

AWS’ focus on AI-powered modernization (even citing a $2.4 trillion annual tech debt estimate across industries) demonstrates a refreshing emphasis on practical enterprise challenges. The hardest part won’t be running the tools but driving the business transformation required to benefit from AI at scale.

Many legacy systems underpin critical business processes, so any AI-led refactoring or migration needs rigorous validation. Clients must prepare for substantial testing, and possibly retraining staff, to ensure AI-generated fixes or optimized code don’t introduce new errors or compliance issues. We suggest a cautious but proactive approach.

  • Identify low-risk, high-pain areas (say, a module that’s costly to maintain but not customer-facing) and pilot the AI modernization there to get a feel for its accuracy and limits.
  • Engage your seasoned developers and domain experts to review AI outputs—their expertise is crucial to verify accuracy.
  • Consider the people impact. Your organization might need to invest in upskilling and changing development workflows to capture even a fraction of potential efficiency gains.
  • More broadly, enterprises can only achieve full potential if they consider truly transformative ways of conducting their business using AI. The bottom line is this is a major business transformation and needs to be approached and managed as such.

By pairing tool adoption with training and change management, executives can help their teams avoid the hype trap and instead achieve sustainable improvements. The most successful organizations are using AI to accelerate modernization and innovation while keeping humans in the loop to ensure the transformed systems truly align with business needs and risk thresholds. But the emerging leaders recognize this is only the start; the greatest opportunity lies in using AI to deliver more transformative change across the business. That’s where many leading organizations are now focusing.

Tags

Ready to talk?

We work with ambitious leaders who want to define the future, not hide from it. Together, we achieve extraordinary outcomes.