Skip to main content

Regulator-Defensible Architecture

Three Pains Every Tier-1 CDO Is Failing Right Now

AI scale, regulator-defensibility, and continuous modernization are colliding into a single architectural problem.

··11 min read

Answer summary

Tier-1 enterprises are failing at enterprise AI scaling, regulator-defensible evidence production, and continuous modernization because current architectures still treat these as separate problems instead of one composed architectural constraint.

Key takeaways

  • Enterprise AI is blocked primarily at the data layer.
  • Regulatory evidence requirements now exceed most legacy architectures.
  • Migration has become the steady state.
  • Portability is an operational discipline, not a feature.
  • The next architectural category must solve all three pressures simultaneously.

TL;DR

  • Most Tier-1 enterprises in 2026 are dealing with three pressures at the same time: enterprise AI scale, regulator-defensible evidence, and continuous modernization.
  • These pressures are increasingly coupled. Treating them as separate programs often creates more architectural friction than it removes.
  • Enterprise AI is usually blocked less by model capability than by fragmented, weakly governed access to enterprise data.
  • Regulators are moving from governance intent toward operationally reproducible evidence.
  • Migration has become a permanent operating condition rather than a temporary transformation phase.
  • Open formats are useful, but they do not create portability by themselves. Portability requires testing, evidence continuity, and operational discipline.
  • The next generation of enterprise data architecture will likely be defined by how well it composes AI consumption, evidence production, and modernization into one coherent operating model.

Triangular diagram showing AI scale, regulatory defensibility, and continuous modernization converging into one composed architectural constraint.

Opening observation

A few months ago, I joined a discussion with architects and platform leaders at a large European financial institution. The topic was enterprise AI, although the conversation did not stay there for long.

The executive question was simple enough: where is our enterprise GenAI?

The first answers were familiar. The organization needed better copilots, stronger retrieval pipelines, more controlled vector infrastructure, clearer model governance, and perhaps additional compute capacity. None of these answers were wrong. They were also not sufficient.

As the discussion moved from models to data access, the architectural problem became more visible. The group could describe several AI initiatives, but it was much harder to answer which datasets were already feeding those systems, under what authority the data was being consumed, and whether lineage and policy enforcement could still be reconstructed after the next platform migration.

That pattern is now common enough that I no longer treat it as an AI problem alone. What initially appears as an enterprise GenAI discussion often turns into a broader architectural conversation about evidence, residency, portability, and runtime governance.

The same issue appears from different directions. The CDO asks how to scale AI safely. The CRO asks whether BCBS 239 evidence is defensible. The CIO asks how to keep modernizing without creating new lock-in. The legal and compliance functions ask whether data access can be explained under AI Act, DORA, or local residency constraints.

These are usually handled as separate workstreams. Operationally, they are becoming one problem.

The real problem

Most Tier-1 enterprises are struggling with three tests at the same time.

First, they cannot scale enterprise AI safely because the data access patterns underneath AI systems are still fragmented, inconsistently governed, and difficult to reconstruct later.

Second, they cannot produce regulator-defensible evidence consistently because lineage, policy decisions, residency assertions, and access records are often scattered across platforms and reconstructed after the fact.

Third, they cannot modernize their data estate fast enough without creating new forms of lock-in, evidence rupture, or governance gaps.

Large enterprises have always dealt with complexity, so the existence of these problems is not surprising. The more important observation is that the three pressures now interact. An AI initiative consumes data that must be governed under regulatory obligations. A modernization program changes the substrate on which evidence depends. A portability exercise becomes credible only if lineage and policy context survive the migration. A regulatory remediation program slows down AI delivery unless governance becomes part of the runtime architecture rather than a separate review process.

The architectures built over the last decade were rarely designed for this composition. Warehouse architectures optimized for centralized reporting and semantic control. Data lakes optimized for scale and flexibility. Lakehouses improved analytical convergence and operational reliability. Data mesh addressed ownership and organizational scalability. Data fabric improved metadata orchestration and visibility.

Each of these patterns solved real problems. The difficulty is that the current enterprise constraint is no longer one of those problems in isolation. It is the combined pressure of governed AI consumption, regulator-defensible evidence, and continuous modernization.

Pain one: enterprise AI is blocked at the data layer

The public discussion around enterprise AI still spends a great deal of attention on models. Inside large enterprises, the model is often not the binding constraint.

The more persistent bottleneck is the data layer: fragmented retrieval pipelines, scattered vector databases, inconsistent identity propagation, weak lineage, unclear lawful basis, and governance models that still assume human consumers using dashboards, APIs, or analytical tools.

A large enterprise can accumulate a surprisingly complex AI estate in less than eighteen months: multiple embedding services, several vector stores, duplicated retrieval pipelines, locally built copilots, and proof-of-concept systems that were never designed to become governed production services. Many of these systems work technically. Far fewer are operationally governable.

Once AI systems move beyond isolated pilots, the enterprise has to answer questions that older data architectures were not designed to answer continuously. Which dataset version was used? Which policy was active at the moment of access? Was the access authorized for the declared purpose? Were downstream restrictions respected? Can the organization reconstruct the chain of consumption later, especially if the data has moved to a different platform?

These questions are becoming less theoretical. The EU AI Act is moving data governance, transparency, and record-keeping obligations into operational reality. Financial regulators are increasingly focused on evidence rather than intent. Internal audit functions are asking how AI access to regulated data can be reconstructed and tested.

The architectural gap is that most enterprises still expose data to AI through patterns originally designed for human analysts and applications. AI agents behave differently. They are autonomous, composable, persistent, and capable of acting at machine speed. Some will operate under delegated human authority. Others will be embedded into workflows where the human oversight boundary is less clear.

That changes the required architecture. Data access for AI cannot remain a collection of project-specific retrieval pipelines. It needs identity propagation, purpose binding, policy enforcement, managed embeddings, and evidence generation at the point of consumption.

Diagram showing disconnected vector databases, RAG pipelines, governance controls, and AI copilots across an enterprise estate.

Pain two: the regulator's bar has moved

Many enterprises still treat compliance as a documentation problem. That framing is becoming increasingly insufficient.

Across BCBS 239, DORA, the EU AI Act, GxP, operational resilience expectations, and sector-specific supervisory guidance, the direction of travel is reasonably clear. Regulators and auditors are less interested in static governance claims and more interested in whether the organization can produce operationally reproducible evidence.

A large amount of enterprise governance still depends on manually reconstructed lineage, spreadsheet-based attestations, PDF evidence packs, quarterly reconciliation exercises, and architecture diagrams that become outdated shortly after they are approved. These mechanisms were already imperfect in slower, more centralized environments. They become structurally weaker under federated cloud platforms, AI consumption, multi-jurisdiction residency, and continuous modernization.

BCBS 239 remains the most visible empirical warning. More than a decade after its publication, many globally systemic banks still struggle to demonstrate full compliance with supervisory expectations. This is not primarily because banks failed to buy enough governance tooling. It is because continuous evidence production is architecturally different from analytical data management.

The regulator’s question is gradually shifting from whether governance exists to whether the enterprise can prove what happened, when it happened, under which policy, against which data version, and under which organizational authority.

That is a different bar.

The difference becomes especially visible under multi-residency pressure. A modern Tier-1 enterprise may need to operate across Swiss bank-client-data requirements, EU data protection obligations, DORA third-party risk and exit planning requirements, AI governance obligations, and regional restrictions on data movement. At that point, governance is no longer mainly a catalog problem. It becomes a topology problem involving jurisdiction, policy, lineage, access, and evidence.

Pain three: migration has become the steady state

One of the more persistent misconceptions in enterprise architecture is that modernization is a temporary phase. In large organizations, it increasingly behaves like a permanent operating condition.

At any given time, a Tier-1 enterprise may be running cloud migration, post-merger integration, mainframe modernization, data platform consolidation, sovereign cloud restructuring, AI platform expansion, regulatory remediation, and vendor rationalization simultaneously.

The examples differ by industry, but the pattern is stable. Banks modernize legacy estates while moving selected workloads to cloud platforms. Industrial companies restructure SAP environments while building AI and digital-thread capabilities. Financial institutions integrate post-merger estates for years after legal consolidation. Sovereign cloud strategies introduce segmentation requirements that older global platforms were never designed to absorb.

A useful open table format such as Apache Iceberg or Delta Lake improves part of the problem, but it does not make an enterprise portable by itself. Real portability involves compute translation, schema migration, lineage continuity, policy portability, evidence preservation, and repeated operational testing.

I have seen enterprises describe themselves as portable while never having exercised a meaningful failover or substrate transition under realistic conditions. In practice, that is not portability. It is an architectural assumption that has not yet been tested.

DORA’s operational logic makes this distinction more important. An exit plan that exists only as documentation is weak evidence of resilience. Once portability becomes a regulatory and operational concern rather than a technical preference, the architecture has to treat migration evidence, equivalence testing, and lineage preservation as first-class capabilities.

Why these three pains compose

The most important point is that these pressures increasingly cannot be solved independently.

If an enterprise accelerates AI adoption without strengthening evidence production, governance gaps widen and regulatory exposure increases. If it centralizes governance too aggressively, modernization slows and AI experimentation often fragments outside the official platform. If it prioritizes rapid modernization without evidence continuity, lineage ruptures and historical policy context becomes difficult to reconstruct.

This is why many transformation programs now feel expensive, incomplete, and operationally exhausting. The organization is trying to solve a compositional architecture problem through isolated programs.

AI governance, regulatory evidence, and modernization now share dependencies:

  • identity,
  • lineage,
  • policy enforcement,
  • data product contracts,
  • residency metadata,
  • portability discipline,
  • and runtime observability.

Treating these capabilities as separate project deliverables creates duplication and inconsistency. Treating them as part of one architectural operating model creates a path toward coherence, although not a cheap one.

The architectural implication

The next architectural category is unlikely to be defined primarily by storage, compute, metadata, or orchestration alone.

Those capabilities remain necessary, but they are increasingly insufficient as the main organizing principle. The harder architectural question is whether the enterprise can simultaneously support governed AI consumption, regulator-defensible evidence production, and continuous modernization.

That implies three capabilities becoming first-class architectural concerns.

The first is evidence production as part of normal operation rather than periodic reconstruction. The second is operational portability that is tested and evidenced rather than assumed from open formats. The third is a dedicated mediation model for autonomous and semi-autonomous AI consumption.

This represents a conceptual shift. Compliance evidence becomes queryable because the architecture produces it continuously. Portability becomes credible because it is exercised rather than declared. AI governance becomes part of runtime policy enforcement rather than a document attached to an approval process.

That shift may ultimately matter more than the next storage abstraction or metadata platform.

Timeline showing the evolution from warehouse, lake, lakehouse, mesh, fabric, and DataOS toward regulator-defensible architecture.

What I would do Monday morning

If I were a Tier-1 CDO assessing this problem today, I would start with a small but concrete diagnostic.

  1. Identify which datasets are already feeding AI systems operationally, including unofficial pilots and local retrieval pipelines.
  2. Mark where lineage still depends on manual reconstruction rather than machine-produced evidence.
  3. Run one portability simulation this quarter on a limited but meaningful data product.
  4. Require each new AI use case to declare purpose, lawful basis, downstream restrictions, and dataset version dependencies explicitly.
  5. Review whether governance, AI delivery, and modernization are being managed as separate programs despite sharing the same architectural dependencies.

Most enterprises already have many of the underlying technologies required to start. What is usually missing is the framing that treats the three pressures as one architectural constraint.

The tradeoff most organizations underestimate

Architectures capable of producing regulator-defensible evidence continuously are operationally heavier than conventional analytical platforms.

Evidence generation, policy enforcement, portability testing, sovereign segmentation, and AI mediation all introduce cost, latency, governance work, and platform engineering complexity. There is no useful way to pretend otherwise.

The question is whether this overhead is greater or smaller than the alternative: permanent remediation cycles, uncontrolled AI proliferation, migration deadlock, inconsistent lineage, and increasing distrust from regulators and audit functions.

The next decade’s most successful architectures may not be the lightest architectures. They may be the ones that make unavoidable complexity governable.

Closing reflection

The enterprise data industry spent much of the last decade optimizing for scale, analytical convergence, and organizational ownership. Those optimizations were necessary, and many of the resulting architectures remain valuable.

The next pressure wave appears to be different. Evidence, sovereignty, governed AI consumption, and reversible modernization are becoming harder to separate operationally.

Large enterprises will continue modernizing. AI consumption will continue expanding. Regulatory expectations will continue moving toward reproducible evidence. The open question is whether the architecture can preserve proof while the estate keeps changing.

That is increasingly the bar that regulators, boards, AI governance functions, and platform leaders are converging toward simultaneously.

Part of the ongoing SDOP series on regulator-defensible data architecture, sovereign data systems, enterprise AI governance, and continuous modernization.

References

  1. BIS: Principles for effective risk data aggregation and risk reporting — BCBS 239 baseline principles for risk data aggregation and reporting.
  2. BIS: Progress in adopting the BCBS 239 principles — Basel Committee progress reporting on implementation gaps.
  3. ECB: Guide on effective risk data aggregation and risk reporting — ECB supervisory expectations for RDARR.
  4. EUR-Lex: Regulation (EU) 2022/2554 on digital operational resilience — DORA regulation text.
  5. EUR-Lex: Regulation (EU) 2024/1689 Artificial Intelligence Act — EU AI Act official journal text.
  6. FINMA Circular 2023/1: Operational risks and resilience - banks — Swiss operational risk and resilience circular.
  7. Apache Iceberg table specification — Open table format specification referenced in the portability discussion.
  8. OpenLineage documentation — Open lineage framework referenced for evidence and lineage architecture.
  9. SPIFFE/SPIRE documentation — Workload identity and attestation documentation.
  10. Linux Foundation: Agentic AI Foundation and Model Context Protocol — Agentic AI Foundation announcement referencing MCP.

Author

Géza Kuti is a senior Data and AI executive based in Bülach (ZH), Switzerland, focused on data strategy, enterprise architecture, AI governance, hybrid cloud, and regulated delivery.

Related articles