Brief

Governance, Trust, and the Data Foundation

You can’t scale what you can’t govern—and you can’t govern what you can’t structure. Here’s how trust and data quality become the foundation for agentic AI at scale.

By Eric Sheng, Roger Zhu, Brendan O'Rourke, Dale Pedzinski, and Kevin Zhang

First published on Απριλίου 14, 2026
min read

}

At a Glance

Governance is a prerequisite for scaling agentic AI, not a compliance afterthought.
Agentic architectures must be designed for failure, not just performance.
Untapped, unstructured enterprise data can be transformed into governed, agent-ready assets.

This is part three of a four-part series on architecting for agentic AI.

In previous posts, we outlined the case for new architecture and a three-layer platform to support it. However, architecture is only as strong as the governance that protects it and the data that feeds it.

Governance and trust are the foundation of scale

Trust is the limiting factor for scaling AI, especially in regulated industries such as financial services, healthcare, and insurance. No matter how advanced the technology is, AI systems can’t scale without earning the trust of customers, employees, executives, and regulators. That requires governance.

With agentic AI, governance must expand beyond model outputs to the actions agents take. Permissions, tool access, and the decisions or transactions they can execute must fit within predetermined parameters.

Governance also depends on strong security foundations, including protection against new attack vectors such as prompt injection, data poisoning, and denial-of-wallet attacks. It also requires identity and authentication models that extend least-privilege principles to autonomous agents, not just human users.

This is a fundamental shift. Legacy identity and access management (IAM) systems were designed around human sessions and role-based access. Agentic architectures require contextual, runtime policy enforcement, where the enforcement layer evaluates permissions based on agent identity, session context, data sensitivity, and the specific tool being invoked—all of which must occur at the moment of execution, not at login.

Industry View: Financial services

How should financial services firms build the data foundation and governance to scale agentic AI?

Financial services firms scale agentic AI by treating governance and data quality as the foundation, not a later fix. Trust carries extra weight in a regulated sector: Concerns about regulatory uncertainty and data quality are more prevalent among financial services firms than in less regulated industries. That's why explainability and behavioral observability belong inside agent systems from day one, to manage risk and hold customer trust.

The institutions pulling ahead are treating data as a product rather than a byproduct, assembling structured and unstructured sources into agent-ready assets. They embed observability, security, governance, and controls into the platform from the start, then extend the controls built for thousands of employees to thousands of AI agents.

The payoff is a self-funding path to scale, delivering value in a few domains early to fund the rest. Without those guardrails built in from the start, today's architectures can't safely support AI agents running in the thousands across the enterprise.

Evaluation becomes a first-class requirement for scale. Platforms should provide shared services to capture and govern traces, test agents against golden sets, and measure behavior across multistep operations and corner cases, not just single tasks. Results should be recorded and fed back into ongoing agent engineering processes using a mix of algorithmic scoring, LLM-as-a-judge approaches, and targeted human review.

When applications capture human feedback, it should be routed through the same platform so teams can inspect traces, debug issues, and continuously improve. This raises the bar for monitoring, since leaders need clear visibility into how agents behave in production, including where they draw context from and how they arrive at outcomes. This moves governance beyond a compliance checkbox, promoting it to a strategic enabler. It allows AI to move from experimental use cases to essential enterprise applications.

Many organizations still run siloed AI initiatives on fragmented systems, with business units operating on separate platforms, data pipelines, and model-serving endpoints. Oversight is inconsistent, and regulatory exposure is rising. In an agentic environment, this fragmentation becomes exponentially more costly.

Without a unified agent registry, centralized token management, and consistent schema and data contract governance, every new agent deployment compounds integration debt and erodes trust. Visibility gaps between siloed systems mean that no single team can trace an agent’s reasoning from prompt to tool invocation to final output. The result is uneven controls, duplicated compliance efforts, and a path from pilot to production that grows longer with each new use case. It’s impractical. Beyond that, regulators and boards increasingly expect clear end-to-end traceability.

As AI becomes more autonomous and interconnected, governance must be designed in lockstep with architecture. Embedding policy enforcement, monitoring, and audit controls directly into the platform at inception helps make governance continuous and allows it to keep pace as agents take on more work. Organizations that design governance into their technology foundations will be better positioned to scale agentic AI responsibly and with confidence as adoption grows.

Industry View: Healthcare & Life Sciences

Why does scaling AI in healthcare depend on the data foundation and governance?

Across healthcare and life sciences, AI requires a strong data foundation to move from pilots to production at scale, which makes it nonnegotiable. A Bain survey shows that 84% of successful AI scalers in pharma and medtech say data quality, pipelines, and governance are critical to building AI foundations.

Yet among payers, providers, and pharma, only 30% of completed proof-of-concept projects reach production, and AI efforts rarely stall on model performance alone. In pharma, nearly half of companies cite data readiness as a top obstacle.

To move from experimentation to AI at scale, health systems can take stock of whether they have the clinical and operational data infrastructure, as well as the governance safeguards, for responsible deployment. Similarly, pharma companies can make data AI-ready by standardizing it across systems, adding missing metadata, and creating a governed path for clean new data. After starting with two or three high-value workflows, they can build orchestrated AI systems with humans in the loop.

Designing for governance also means designing for failure. In production, agents will misbehave—generating noncompliant outputs, exceeding latency thresholds, or producing results that drift from expected quality benchmarks. Mature agentic architectures account for this with several key protocols:

runtime circuit breakers that isolate a failing agent without taking down the broader workflow;
automated rollback triggered by SLO regression on latency, error rates, or output quality scores; and
manual kill switches that give operators the ability to pull an agent offline when it exhibits noncompliant behavior.

Additionally, blast-radius containment is enforced by agent-isolation boundaries within the service mesh to prevent a single agent’s failure from cascading across interconnected workflows.

What happens after an incident is equally important. Forensic access to an agent’s reasoning logs—with end-to-end records from prompt to tool invocation to final output—enables teams to diagnose root causes and refine guardrails rather than simply restarting a black box. System logs across the enterprise technology stack also matter; they allow agent events to be correlated with application, data, and infrastructure activity to provide a true end-to-end view.

In scenarios where confidence scores drop below defined thresholds, the architecture should support graceful degradation: falling back to human-in-the-loop review or simpler rule-based automation until the agent can be revalidated. These aren’t edge cases. They are the operational realities that separate enterprises running AI at scale from those still running experiments.

Unstructured data: from dark asset to strategic advantage

Most enterprise data is unstructured and largely untapped. Of the documents, emails, transcripts, images, PDFs, and other raw materials of business knowledge, very little is actively accessed by enterprise-level AI. That’s a missed opportunity, and arguably a growing liability, as agentic and generative AI rely on context-rich, high-quality input.

The cost of ignoring unstructured data is rising. In an agentic AI architecture, the data platform must do more than store information. It must make unstructured content usable and trustworthy at runtime. It does this through preprocessing pipelines that handle optical character recognition, metadata extraction, and multimodal content, followed by chunking, embedding, and indexing into vector and graph stores that agents query in real time. These techniques turn human-centric content into agent-ready assets, allowing agents to discover, retrieve, deduplicate, and reason over the right context for the task at hand.

Industry View: Insurance

What does insurance need to scale agentic AI responsibly?

To scale agentic AI responsibly, insurers need governance over what agents are allowed to do and a data foundation that makes their unstructured content usable and trustworthy. Insurers depend on unstructured data, expert knowledge, and complex documentation. Early efforts in the sector focus on data readiness and governance foundations rather than broad transformation.

Agentic AI provides the opportunity to reimagine entire customer journeys—in auto claims, for example, that's submission, damage assessment, settlement, and fulfillment, handled end to end. But trust is the limiting factor for scaling AI, especially in regulated industries such as insurance, and the flood of use-case requests can overwhelm risk and control groups without a clear way to categorize and mitigate the risks.

So the winners are deliberate. They build governance into the platform from the start, with policy enforcement, monitoring, and audit controls, so that as agents take on more of the claims workflow, every decision stays traceable, and the path from pilot to production doesn't stretch with each new use case.

Without the right metadata and structure, even sophisticated AI models fall prey to hallucinations, inaccuracies, and missed insights. The “garbage in, garbage out” problem is alive and well. The solution is intelligent data foundations.

Leading organizations are now building these into the data layer of their enterprise technology stack, using platforms that can:

enrich metadata automatically to tag, classify, and contextualize content;
map relationships across unstructured assets to create knowledge graphs; and
enforce governance such as masking, retention, and access controls.

These capabilities turn scattered documents into AI-ready assets. They also reduce friction for scaling agentic AI, enabling teams to reuse the same governed, searchable foundation across multiple use cases instead of rebuilding pipelines each time. Knowledge routing directs each agent to the right data source at the right time, improving compliance, delivering better insights, and building a smarter foundation for automation and decision making.

Treating unstructured data as a strategic resource is a core requirement of the modern enterprise technology platform—and essential to unlocking the next wave of enterprise-level AI performance.

With governance and data foundations in place, the question becomes: How do you get there? In the final post of this series, we will lay out a phased roadmap for bringing agentic architecture into production.