Compliance Cosplay

Part I — The Doctrine

The Governance Gap

When Policies Can’t Prevent a Single Decision

In 2023, UnitedHealth Group deployed an algorithm called nH Predict to manage Medicare Advantage claims for elderly patients. The system had everything a governance checklist could ask for: a stated policy that “AI does not make final coverage decisions,” audit logging, explainability features, and human oversight on paper.

Then the numbers came in. When patients appealed the AI’s denials, human reviewers overturned them nine out of ten times¹. A STAT investigation, later cited in the class action lawsuit, revealed that UnitedHealth had “pressured employees to use the algorithm to issue payment denials… setting a goal for employees to keep patient rehabilitation stays within 1% of the length of stay predicted by nH Predict.”²

In February 2025, the US District Court ruled that plaintiffs could proceed with their class action — finding that the Medicare Act did not preempt breach of contract claims involving the alleged use of AI “in lieu of physicians” to make coverage determinations³.

The system had policies. It had logging. It had the ability to explain its outputs after the fact. None of it prevented a single denial from executing.

“In regulated industries, AI governance that doesn’t include a runtime authority plane is just compliance cosplay.”

What Compliance Cosplay Looks Like

Compliance cosplay is governance that looks right but can’t technically prevent an unauthorised decision from executing. Walk through what most organisations actually have:

75%

of organisations have established AI usage policies…

…yet only 7% have fully embedded AI governance

Sources: Pacific AI 2025; Trustmarque 2025 via Knostic

• Policies: AI usage policies, responsible AI principles, governance charters — three-quarters of organisations have established these⁴
• Dashboards: Monitoring tools, accuracy metrics, bias checks — all reactive, all after-the-fact
• Audit logs: Records of what happened — forensic artefacts, not prevention mechanisms
• Ethics committees: Quarterly reviews of AI deployments — governance by calendar, not by execution
• Principles documents: “We believe in responsible AI” — aspirational, not enforceable

None of these run at decision time. None can technically prevent an action from executing. And when something goes wrong, the questions that matter are never about model accuracy:

Three Questions Governance Should Answer at Decision Time

Untraceable Authority

“Who accepted this decision? Under what mandate? With whose authority?”

Incalculable Risk

“Was risk acknowledged before execution, or reconstructed after?”

Irreversible Decision

“Could we have stopped this before it happened? Did anything try?”

When the answer to all three is “we’ll check the logs” — that’s compliance cosplay.

The Forensic Artefact Problem

There is a fundamental difference between explaining decisions and constraining decisions. This distinction frames the entire argument.

Explaining Decisions

• Generates an account of what happened after execution
• Forensic investigation — reconstructing cause from effect
• Logs, dashboards, post-hoc explainability tools
• Useful for learning, useless for prevention

Constraining Decisions

• Structurally prevents execution unless conditions are met before the action
• Infrastructure — governance as a runtime property of the system
• Decision gates, authority verification, evidence bundles
• Prevents harm by design, not by hope

Logs are forensic artefacts. Explanations are forensic artefacts. Dashboards are forensic artefacts. None of them are governance mechanisms. Governance that runs after the decision is already too late. Governance that runs with the decision changes system behaviour.

When something goes wrong in a regulated AI deployment, the question is never “Was the model accurate?” The question is: Who accepted this decision? Under what mandate? With whose authority? And where is the evidence that this was consciously exercised?

When that moment of acceptance isn’t embedded into the system, accountability still exists — but only retroactively. Evidence has to be reconstructed. Responsibility becomes narrative. Trust erodes under pressure.

“We can’t just have governance reviewing how the project got made. We can’t just review the inputs and outputs and rely on observation — especially in regulated industries.”

The Explanation Trap

Models asked to self-report their reasoning confabulate. If you send a whole complicated prompt and a large context of data, you can’t even reliably ask the AI why it made a decision. It might not even be able to reliably tell you what data it used. That’s not governance — that’s vibes-based auditing.

Just because you prompt a model to tell you the outcome, the data it used, the reason, and its confidence — you can’t necessarily rely on that. The model may confabulate post-hoc rationalisations that sound plausible but don’t reflect the actual computation. Major banks now maintain “shadow human decisioning” systems to validate AI outputs and explain individual decisions — an expensive admission that the AI’s own explanations aren’t regulatory-grade⁵.

The BIS Financial Stability Institute confirmed the scale of the problem: explainability and the “black box nature of certain AI algorithms” is the top issue discussed between regulators and firms⁶.

The Scale of the Gap: By the Numbers

This is not a policy deficit. It is a structural enforcement deficit.

The Enforcement Gap in Numbers

97%

of organisations in AI-related breaches lacked proper access controls⁸

IBM, 2025 Cost of a Data Breach

have fully embedded AI governance despite 93% using AI⁸

Trustmarque, 2025 via Knostic

70%

lack optimised governance — can’t demonstrate board-level reviews or automated monitoring⁹

Acuvity, 2025

say their infrastructure is fully prepared to support AI at scale⁸

Trustmarque, 2025 via Knostic

The pattern is consistent across every study. Seventy-five percent of organisations have AI usage policies, yet only 36% have adopted a formal governance framework⁷. Only 25% have fully implemented AI governance programmes — most have drafted policies but “struggle to turn them into daily practice”⁸. And 39% operate below managed governance levels entirely, relying on ad hoc practices or nothing at all⁹.

The Regulatory Hint: EU AI Act Article 12

The EU AI Act, with enforcement beginning August 2026, requires “automatic recording of events (logs) over the lifetime of the system” for high-risk AI¹¹. The logging capability must enable recording events relevant for identifying risk situations, facilitating post-market monitoring, and ensuring traceability.

This is a forensic-readiness requirement, not an execution-constraint requirement. It mandates that logs exist and can identify risks — after the fact. It does not require that the system technically prevent execution if governance conditions aren’t met.

Logging without enforcement is better record-keeping, not better governance. It’s the difference between requiring CCTV in a building and requiring locks on the doors. One tells you what happened. The other stops it from happening.

Article 14, which addresses human oversight, gets closer to runtime enforcement — but as we’ll explore in Chapter 3, even it leaves the mechanism to providers.

What Happens Without Runtime Governance

The trigger moment is approaching: a regulatory inquiry, an audit of an AI-assisted decision, a customer complaint about an automated action, a board member who asks “Can you prove who authorised this?”

When the Regulator Calls

Without Runtime Governance (Current State)

• Can’t answer “who accepted this specific decision?” at decision time
• Logs generated after execution are forensic artefacts, not governance mechanisms
• If the system can’t prevent execution, governance is bypassable
• Evidence must be reconstructed; responsibility becomes narrative

Result: Trust erodes. Defence fails. Fines arrive.

With Decision Authority Infrastructure

• Authority verified before execution, not reconstructed after
• Evidence produced by design, available on demand
• System structurally prevents unauthorised execution
• Accountability is architectural, not narrative

Result: Provable governance. Defensible decisions. Structural compliance.

EU AI Act enforcement begins August 2026. Penalties reach €15M or 3% of worldwide turnover¹². The first major enforcement action — likely targeting a high-risk system in hiring or credit — is expected in the €10–30M range¹³.

Organisations that can’t prove decision-time governance will be exposed. Not because their policies were wrong. Because nothing connected those policies to the moment of execution.

Key Takeaways

1. Most AI governance is compliance cosplay: policies, dashboards, and logs that cannot technically prevent an unauthorised decision from executing.
2. The core design flaw: if a system cannot technically prevent a decision from executing, then governance is external to the system and therefore bypassable.
3. The three questions — Is authority traceable? Is risk calculated before execution? Can the decision be stopped? If the answer is “we’ll check the logs,” you have forensic artefacts, not governance.
4. The evidence gap is quantified: 75% have policies, 7% have embedded governance, 97% of breach victims lacked access controls.
5. The distinction that frames this entire ebook: explaining decisions (forensic, after the fact) vs constraining decisions (structural, before execution).

Chapter 2 explains why this gap exists — the physics of “governance gravity” and what changes when AI executes at runtime. Chapter 3 provides the mental model for closing it.

Part I — The Doctrine

Governance Has Gravity

Why Runtime AI Changes Everything

Two highways. Same speed limit.

Highway A

Guardrails, lane markings, speed cameras, crash barriers. Drivers go fast because the infrastructure constrains the worst outcomes. Governance is built into the road.

Highway B

No infrastructure. A human stands at every intersection flagging cars. Some get waved through. Some get stopped for committee review. Traffic crawls — or people ignore the flaggers.

Most AI governance is Highway B: human review committees, quarterly ethics boards, post-hoc log analysis — governance that lives outside the system and operates on a different clock than execution.

Runtime governance infrastructure — Highway A — enables speed because it constrains. Pre-authorised structure means faster autonomous operation, not slower committee review. This isn’t a trade-off between governance rigour and deployment speed. It’s a design choice between governance that enables and governance that impedes.

The Governance Gravity Principle

“Governance has gravity — it gets pulled to wherever execution happens.”

Traditional IT mostly executes deterministically. Code does what it does. You write it, review it, test it, deploy it — and it runs exactly as reviewed. Governance can live upstream: SDLC controls, design reviews, testing, change approval. The governance model works because execution is predictable.

Where Execution Lives, Governance Must Follow

Traditional IT: Deterministic Execution

✓ Code does exactly what it does — same input, same output
✓ Governance lives upstream: review before deploy, monitor after
✓ SDLC controls, change management, testing — well-understood
✓ Execution is what was reviewed

Runtime AI: Probabilistic Execution

✗ Same model, same prompt, same data — different outputs
✗ Context changes per request: different customer, data, risk profile
✗ System can act before governance wakes up
✗ Execution is NOT what was reviewed — it’s what the model produces now

If you’re going to make real-time decisions, you’ve got to have governance as infrastructure — not just documentation and inspection after the event. If governance doesn’t follow execution into runtime, it becomes a historical artefact: a record of what the system was when it was reviewed, not what the system is when it decides.

Governance gravity means the further execution moves from design-time, the more governance must move with it.

The Governance Arbitrage

There is a structural reason compliance cosplay persists — and it isn’t that organisations don’t care about governance. It’s that the governance architecture they know doesn’t work for runtime decisions.

AI Placement	Governance Model	Status
Design-time (produces code/artefacts)	Standard SDLC — human reviews before deploy	Exists ✓
Runtime (makes live decisions)	Novel infrastructure needed — no review gate	Missing ✗

Design-time AI has a governance path: AI writes code → PR review → CI/CD → automated tests → human approval → deploy. Every step has an existing gate.

Runtime AI has no equivalent path: AI receives request → processes context → executes decision → logs what happened. No structural gate between proposal and execution.

Organisations apply design-time governance thinking — review, approve, deploy — to runtime AI decisions. The gap between review and execution is where governance fails. What’s needed is a new governance architecture purpose-built for the runtime gap. That’s what Decision Authority Infrastructure provides, as we’ll formalise in Chapter 4.

Why Post-Hoc Explainability Fails Structurally

Most current AI governance relies on the model explaining itself: “Tell me why you decided X.” This is asking the subject of an investigation to produce the evidence — fundamentally unreliable.

Models confabulate explanations. They generate plausible-sounding reasoning that may not reflect actual computation. Post-hoc explanation techniques have been shown to explain less than 40% of model behaviour for complex decisions¹⁴. Financial services’ response has been telling: major banks now maintain “shadow human decisioning” systems to validate AI outputs — an expensive admission that the AI’s own explanations aren’t regulatory-grade¹⁵.

Opacity hides problems. When users can’t see the reasoning, they can’t catch errors. The errors persist. Trust erodes slowly as recommendations fail for reasons nobody can diagnose. Research shows 40% of organisations identify explainability as a key AI risk, but only 17% actively work to mitigate it.

The structural fix isn’t better explanations. It’s moving from asking the model why, to forcing it to cite what it used — structured decision objects that can only reference verified evidence IDs. Then explanation is reconstructable by design: a proof trace, not a self-report. We’ll formalise this mechanism in Chapters 4 and 5.

The Agentic Acceleration

40%

of enterprise workflows will involve autonomous agents by 2026

Gartner projection, via Forbes Feb 2026

Agentic AI doesn’t just widen the governance gap — it makes it structural. Autonomous multi-step execution outpaces human monitoring. Inter-agent communication creates error propagation and attack surfaces. Dynamic tool-calling and data access cannot be pre-audited. Authority and intent are implicit — inherited from a chat session or config — rather than cryptographically verified per action.

In January 2026, Singapore’s Infocomm Media Development Authority published the world’s first agentic AI governance framework, acknowledging that agents’ “access to sensitive data and ability to make changes to their environment” raises an entirely new profile of risks. The framework explicitly calls for “defining action boundaries requiring human approval” for high-stake or irreversible actions¹⁶.

UC Berkeley’s Center for Long-Term Cybersecurity followed in February 2026 with an Agentic AI Risk-Management Standards Profile, naming “unintended goal pursuit, unauthorized privilege escalation or resource acquisition, and other behaviors, such as self-replication or resistance to shutdown” as unique challenges that “complicate traditional, model-centric risk-management approaches and demand system-level governance”¹⁷.

As AI agents move from experimental tools to autonomous decision makers, they are negotiating contracts, processing transactions, and accessing sensitive data — all without a standardised way to prove who they are or what they’re authorised to do¹⁸.

The AI Paradox: Better Tools, Same Governance Trap

“All we’re really doing with AI is avoiding deterministic programming. But the other side of the coin is we’re getting better at deterministic programming using AI.”

We reach for AI because deterministic code is too expensive to write and maintain for complex, unstructured decisions. But AI’s probabilistic nature creates a governance burden that deterministic code never had. Meanwhile, AI is also making us dramatically more productive at writing deterministic code.

The paradox resolves toward architecture: use AI to build better deterministic infrastructure — the governance pipeline, the decision gates, the enforcement mechanisms — and confine runtime AI to the narrower tasks where deterministic code genuinely can’t reach: reading messy documents, extracting facts from unstructured data, classifying intent.

This isn’t a retreat from AI. It’s the mature architecture: deterministic infrastructure (the governance skeleton) with AI perception at specific injection points. The model proposes; the deterministic gate enforces. Chapter 4 will formalise this as the IPA pattern.

Why Business Pressure Makes This Urgent

Business demand for real-time AI is not going away. Real-time AI feels like a magical employee — instant answers, automated decisions, no queue. Business people will push towards it, and that’s where governance of traditional IT and AI diverges completely.

The Cost of the Governance Gap

60%

of organisations will fail to realise AI value by 2027 due to incohesive governance⁵²

Gartner, 2024 prediction

80%

of AI pilots fail to reach production — despite 70% succeeding technically⁵³

AIM Research

The governance gap isn’t just a risk problem. It’s a value-extraction problem. Governance that blocks deployment is as costly as governance that fails in production.

The answer isn’t “never real-time.” It’s governance infrastructure that runs at the speed of execution:

1. Do the high-value cognition in batch/offline — preparation, retrieval, synthesis, evidence packing
2. Keep the real-time layer bounded — constrained action set, deterministic gating
3. Gate execution through a deterministic enforcement point — the In-Path Authority

Chapter 3 provides the mental model for this architecture. Chapter 4 provides the specification.

Key Takeaways

1. Governance has gravity — it gets pulled to wherever execution happens. If execution is runtime, governance must be runtime.
2. Traditional IT governance works because execution is deterministic. Runtime AI is probabilistic — upstream-only governance becomes theatre.
3. The governance arbitrage: design-time AI has existing governance paths (SDLC); runtime AI has none.
4. Post-hoc explainability fails structurally: less than 40% of behaviour explained for complex decisions.
5. Agentic AI accelerates the problem: 40% of workflows autonomous by 2026, each action an ungoverned execution event.
6. The AI paradox resolves toward architecture: deterministic infrastructure with AI perception at injection points.

Next: Chapter 3 introduces the mental model that unlocks everything — Zero Trust for Decisions.

Part I — The Doctrine

Zero Trust for Decisions

The Analogy That Unlocks Everything

2010. John Kindervag, a principal analyst at Forrester Research, publishes a simple, radical assertion: “Trust is a vulnerability.”

The prevailing security model at the time was perimeter defence: firewalls, VPNs, DMZs. Once you were inside the network, you were trusted. Kindervag’s insight was that this implicit trust was exactly how breaches propagated. An attacker who got past the perimeter had unrestricted lateral access. His alternative: Zero Trust — “never trust, always verify.” Authenticate and authorise every request, regardless of where it originates¹⁹.

The security industry’s reaction followed a predictable arc: scepticism, then gradual adoption, then structural shift. In 2014, Google’s BeyondCorp initiative demonstrated zero trust at enterprise scale — replacing VPNs with identity and device verification at every access point. In 2020, NIST SP 800-207 codified Zero Trust Architecture as a federal standard²⁰. Today, zero trust is the default security architecture for any serious enterprise.

The parallel to AI governance is precise. AI governance today is where network security was in 2008: trusting the perimeter — policy documents, governance frameworks, ethics committees — while the real decisions happen inside the network without verification.

The Mapping: Perimeter to Point-of-Access

Network Security Evolution

Era	Security Model	Trust Assumption	Failure Mode
Pre-2010	Perimeter (firewalls, VPNs)	Inside network = trusted	Breach propagation
Post-2010	Zero Trust	Nothing trusted by default	Breach contained

AI Governance Evolution (Same Shift, Applied to Decisions)

Era	Governance Model	Trust Assumption	Failure Mode
Current	Policy perimeter	Inside framework = governed	Ungoverned execution
Required	Zero Trust for Decisions	No decision trusted without runtime verification	Authority verified at every action

The Translation

Network Security

“Trust the network perimeter”

Network Security

“Authenticate at point of access”

Network Security

“Every request verified, regardless of origin”

AI Governance

“Trust the policy document”

AI Governance

“Verify authority at point of decision”

AI Governance

“Every decision gated, regardless of model confidence”

Zero trust didn’t make networks slower. It made them safer AND enabled more distributed, autonomous operation — because each request was verified in real time. The same is true for AI governance.

The Trust Hierarchy

The zero trust analogy maps directly to a three-level trust hierarchy for AI systems. Most AI governance operates at Level 1–2. Compliance cosplay is Level 1 dressed up as Level 3.

Level 1: Vibes — Fragile

Mechanism: Prompting, guardrails, system instructions

AI Governance equivalent: Policies, principles, responsible AI statements

Reality: “Please don’t make unauthorised decisions” → The model does whatever it computes

Level 2: Monitoring — Reactive

Mechanism: Observability, dashboards, alerting

AI Governance equivalent: Audit logs, post-hoc explainability, drift detection

Reality: “We’ll catch it if you do” → The damage is already done when you catch it

Level 3: Architecture — Structural

Mechanism: Containment, scoping, enforcement gates

AI Governance equivalent: Runtime decision gates, authority verification, evidence bundles

Reality: “You literally cannot execute without verified authority” → The system makes it impossible

Zero Trust for Decisions: What It Actually Means

“Models don’t get to ‘do.’ They only get to ‘suggest.’ And every suggestion hits an enforcement boundary that can prove what it relied on.”

Concretely, this means:

The AI system proposes a decision — approve, deny, escalate, recommend
Before that proposal executes, it passes through a deterministic enforcement boundary — the In-Path Authority (IPA)
The IPA verifies: Does this agent have authority for this action? Is the evidence bundle admissible and signed? Is the risk level within the pre-authorised threshold? Is this action reversible, or does it require higher authority?
Only if all conditions are met: ALLOW. Otherwise: PAUSE (human review) or DENY (blocked)
The decision trace is produced by the system as a structural artefact, not reconstructed later by investigation

The critical distinction from current “guardrails”: current guardrails are prompt-level instructions telling the model what not to do — Level 1 (Vibes), socially hackable, probabilistic. Zero Trust for Decisions is environment-level enforcement making certain actions structurally impossible unless conditions are met — Level 3 (Architecture), deterministic, verifiable.

As one implementation demonstrated: “This zero-trust setup ensures our autonomous agents are auditable and accountable. We no longer have to worry, ‘What if the AI goes off and does X without permission?’ Because in our design, the AI literally cannot do X without permission — the identity system won’t let it.”

SR 11-7: Financial Services Already Governs Probabilistic Engines

44%

of banks properly validate their AI tools — leaving 56% with massive blind spots

ArticleSledge, Model Risk Management Guide 2025

The Federal Reserve’s SR 11-7 guidance, issued in April 2011, has governed “probabilistic engines” in financial services for over a decade. It defines “model” broadly as any quantitative method that processes inputs into quantitative estimates — which includes modern AI systems²¹.

SR 11-7 requires three fundamental safeguards: robust model development with proper documentation, effective independent validation, and sound governance with senior management oversight²². These are the closest existing equivalent to runtime governance for probabilistic systems.

Yet even with mandatory requirements, only 44% of banks properly validate their AI tools²³. The Bank Policy Institute acknowledged in October 2025 that “the current rigid application of MRM Guidance to banks’ AI use cases has hindered (and continues to hinder) AI adoption”²⁴.

SR 11-7 proves governing probabilistic systems is not new. The regulatory framework exists and has been mandatory in financial services since 2011. What it reveals is the compliance cosplay pattern in its purest form: the governance requirement exists; the enforcement mechanism doesn’t.

EU AI Act Article 14: The Regulatory Direction

Article 14 is the closest the EU AI Act comes to prescribing runtime enforcement. It requires that high-risk AI systems “be designed and developed in such a way… that they can be effectively overseen by natural persons during the period in which they are in use.” It mandates that humans retain the ability to “decide, in any particular situation, not to use the high-risk AI system or to otherwise disregard, override or reverse the output.” For biometric identification systems, “no action or decision is taken” unless it “has been separately verified and confirmed by at least two natural persons”²⁵.

Article 14 is directionally right but architecturally silent. It requires that oversight be built into the system — not applied after the fact. It requires the structural ability to override or stop. But it leaves the enforcement mechanism to providers. It says what must be possible; it doesn’t prescribe how.

Decision Authority Infrastructure is one answer to Article 14’s “how” — and Chapter 4 provides the specification.

The Convergence: Independent Practitioners, Same Architecture

Independent convergence is the strongest validation signal. Across security, compliance, and AI architecture, practitioners are arriving at the same structural pattern — runtime enforcement gates, not post-hoc review.

Thinking OS (Feb 2026)

Three layers: Propose (intelligence), Commit (authority gate), Remember (audit trail). “Most organisations are pouring effort into #1 and #3. Almost nobody clearly owns #2.”

AWS AgentCore

Deterministic policy enforcement at the gateway — “The AI literally cannot exceed its scope.”

Policy-as-Code (OPA, Sentinel)

Translating governance into executable constraints within MLOps workflows — “models failing these checks get blocked from being deployed.”

Forbes AI Agent Trust Stack (Feb 2026)

Four dimensions: identity layer, operational boundaries, runtime behaviour, governance and compliance — all verifiable at execution time.

The pattern is clear: runtime enforcement gates, not post-hoc review. Propose → Commit (authority gate) → Remember — with the Commit layer almost universally missing. Chapter 4 formalises this as the Four Pillars of Decision Authority Infrastructure.

“Governance that runs after the decision is already too late. Governance that runs with the decision changes system behaviour.”

Key Takeaways

1. The zero trust evolution in network security (perimeter → point-of-access verification) is the precise parallel for AI governance.
2. Zero Trust for Decisions: models don’t get to “do” — they only get to “suggest.” Every suggestion hits an enforcement boundary.
3. The trust hierarchy: Level 1 Vibes (policies) → Level 2 Monitoring (logs) → Level 3 Architecture (enforcement gates). Most governance is Level 1–2.
4. “Can’t beats shouldn’t” — policy says “you shouldn’t”; architecture says “you can’t.”
5. Financial services has governed probabilistic engines since 2011 (SR 11-7) — the playbook exists but only 44% of banks properly validate AI under it.
6. EU AI Act Article 14 prescribes the outcome (human oversight built into the system) but not the mechanism. DAI provides the mechanism.

Next: Chapter 4 specifies what runtime governance actually requires — the Four Pillars of Decision Authority Infrastructure.

Part II — The Architecture

The Four Pillars

What Runtime Governance Actually Requires

From Explanation to Constraint. That is the title of the Decision Authority Infrastructure model, and it is the thesis of this entire chapter. Governance stops explaining decisions and starts constraining them.

Four questions. Every decision must answer all four before it executes. If it can’t, it doesn’t execute. That’s the shift.

Decision Authority Infrastructure

Admissible Knowledge

“What is allowed to matter?”

Fixed Authority

“Who has the right to decide?”

Gated Execution

“When can action proceed?”

Provable Evidence

“How is accountability captured?”

The strategic hierarchy runs: Business Strategy → Authority → AI Fabric → Scalable Architecture → Action. Authority sits at the keystone between strategy and execution. On the data side: Data Governance → Data Management → Data Operations. On the AI side: AI Governance → AI Cybersecurity → AI Operations. Authority connects them. Without it, strategy and execution are disconnected by governance theatre.

The current state has three weaknesses — Untraceable Authority, Incalculable Risk, Irreversible Decision — all marked with question marks, signalling uncertainty. The missing layer converts them: Verified Authority, Acknowledged Risk, Decision Time Seal. That missing layer is Decision Authority Infrastructure.

Pillar 1: Admissible Knowledge

“What is allowed to matter?”

Only verified, signed, in-scope artefacts enter the decision. Out-of-scope data is structurally excluded — not merely discouraged, not filtered by prompt instruction, but architecturally prevented from entering the evidence bundle.

Every artefact that enters a decision gets an ID + hash + source + timestamp + signer: user requests (channel-authenticated), retrieved records (API-authenticated), intermediate tool outputs (signed by the tool service or wrapped by a trusted gateway), and policy/version references documenting what rules were in force at decision time.

The agent never “trusts memory.” It consumes an evidence bundle: a small set of signed artefacts the IPA assembles for that specific decision. This prevents data from unauthorised sources entering the decision context, stale policy versions being applied, unverified intermediate results being treated as facts, and context poisoning through injected or fabricated data.

Prompt-level filtering isn’t enough. Instructions like “only use the evidence provided” are probabilistic and socially hackable. Context poisoning, prompt injection, and “ignore previous instructions” attacks bypass prompt-level controls.⁶⁰ Admissible Knowledge is an environmental control — the model never sees inadmissible data because the environment excludes it before the model runs.

Pillar 2: Fixed Authority

“Who has the right to decide?”

Authority is verified before execution — not “who reviewed it later” but “whose mandate covers this specific action right now.” Authority is fixed at decision time, not reconstructed from org charts after the fact.

Before any decision node runs, the IPA checks: request channel authentication (who asked), permitted action scope (what they’re allowed to do), authority level vs decision risk tier (does this person’s authority cover this risk level?), and delegation chain validity (if delegated, is it current?).

Authorised

Proceed with execution

Escalate

Higher authority needed

Restrict

Insufficient authority

In human-only systems, authority is implicit: the person doing the work is the authority. In AI-assisted systems, authority is ambiguous. Who authorised this specific AI decision? The person who deployed the system? The person who approved the model? The person who configured the rules? The end user who triggered the request?

Agents inherit a chat-era trust model where all text arrives equal — no channel carries cryptographic proof of origin. Without fixed authority, the system can’t distinguish between a legitimate user request, an injected instruction, a system message, or an agent forwarding another agent’s request. This is the confused deputy problem at scale.

“The biggest risk in AI isn’t the model. It’s the authority we give it.”

Pillar 3: Gated Execution

“When can action proceed?”

ALLOW / PAUSE / DENY enforcement at decision time. The deterministic gate between AI proposal and execution. This is the pillar that makes governance structural rather than aspirational.

Every action passes through the In-Path Authority before execution. The IPA evaluates: Is the evidence bundle complete and admissible? Is authority verified and sufficient for this risk tier? Is this action reversible or irreversible? Does the risk level exceed the pre-authorised threshold? Have all required verification steps completed?

Gate Outcomes

ALLOW

All conditions met → execute and record. Evidence bundle complete, authority verified, risk within threshold.

PAUSE

Conditions partially met → route to human review with evidence bundle. Authority gap, risk threshold exceeded, or verification incomplete.

DENY

Conditions not met → block execution, record reason, alert. Insufficient authority, inadmissible evidence, or policy violation.

The Irreversibility Gradient

Not all decisions need the same gate rigour. The principle: earn autonomy through evidence — reversible → irreversible → high-stakes.

Reversible Actions

Drafting a response, generating a report → lighter gates, faster execution

Irreversible Actions

Executing a payment, denying a claim, sending a legal notice → full gate verification, potential human attestation

High-Stakes Irreversible Actions

Material financial decisions, regulatory filings → dual verification, mandatory human override

The gate must be deterministic, not probabilistic. Enforce policy outside the LLM — deterministic gateways, not probabilistic prompts. A model-based gate is socially hackable; a deterministic gate evaluates conditions against policy rules. It doesn’t “understand” — it enforces.⁵⁵

Pillar 4: Provable Evidence

“How is accountability captured?”

Decision trace produced by design, not investigation. The audit trail is a structural artefact of the governance infrastructure, not a log file reconstructed after the fact.

Every gated decision produces a compact evidence chain: decision ID and timestamp, evidence bundle references (what artefacts were admissible), authority attestation (who had authority, what scope), gate outcome (ALLOW/PAUSE/DENY and reason), structured reason codes (machine-readable, not free-text explanation), policy version reference, and execution trace.

Receipts, Not Logs

Current: Forensic Investigation

System logs what happened
Analyst reconstructs why
Investigators determine who
Report produced weeks/months later

DAI: Structural Evidence

System produces evidence chain at decision time
Every decision comes with its own proof
Audit is replay, not investigation
Available on demand, immediately

When someone asks “Why did the system recommend X?” the answer is built in. Here’s the evidence, here’s the reasoning, here’s who approved it and when. No need to reconstruct after the fact. No forensic archaeology through logs. The explanation exists as a first-class artefact.

Each receipt records what was attempted, what evidence was used, what authority was verified, what the gate decided, and what was executed. Receipts are append-only, immutable, and linked — forming a chain that can be replayed end-to-end.

The IPA Pattern: How the Pillars Work Together

The IPA Is a Firewall for Decisions

Request

↓

Evidence Bundler (Pillar 1: Admissible Knowledge)

↓

LLM Proposes (structured decision object)

↓

IPA Verifies (Pillar 2: Authority + Pillar 3: Gated Execution)

↓

Execute or Block

↓

Append-Only Audit Record (Pillar 4: Provable Evidence)

LLM = proposer / drafter / classifier — the AI does the cognitive work

IPA = policy enforcement + execution gate — deterministic, model-agnostic, authority-aware, irreversibility-sensitive

The IPA’s defining characteristics: model-agnostic (works with any LLM — the enforcement layer doesn’t depend on the intelligence layer), non-autonomous (doesn’t make decisions, evaluates conditions against rules), authority-aware (knows who has authority for what, at what risk tier), and irreversibility-sensitive (different thresholds for different action types).

Models cannot verify their own provenance. A model can parrot that it checked something. It cannot be the trusted verifier. The environment — the IPA — must verify signatures and hashes, mint time-scoped capability tokens, attach evidence bundles to every action, and block execution if provenance or authority is missing. That’s the difference between explaining a decision and constraining a decision.⁵⁹

Mapping to the Agent Provenance Stack

DAI Pillar	Provenance Layer	Question
Admissible Knowledge	Layer 3 — Artefact	Can you trust what’s running?
Fixed Authority	Layer 1 — Identity + Layer 2 — Intent	Who is asking, and what are they allowed to ask for?
Gated Execution	(Enforcement mechanism)	Does execution meet all conditions?
Provable Evidence	Layer 4 — Execution	Can you prove what happened?

The Provenance Stack defines the layers; Decision Authority Infrastructure defines the pillars. They are two views of the same architectural requirement: provable, traceable, pre-execution governance.

Key Takeaways

1. Decision Authority Infrastructure has four pillars, each answering a question every AI decision must resolve before execution.
2. Admissible Knowledge: only signed, verified, in-scope evidence enters the decision.
3. Fixed Authority: authority verified before execution, not reconstructed after.
4. Gated Execution: ALLOW / PAUSE / DENY enforcement, irreversibility-sensitive.
5. Provable Evidence: decision trace by design, not investigation.
6. The IPA is the deterministic enforcement point: model-agnostic, non-autonomous, authority-aware, irreversibility-sensitive.

Next: Chapter 5 explores what changes operationally when governance shifts from explanation to constraint.

Part II — The Architecture

From Explanation to Constraint

What Changes When Governance Becomes Infrastructure

“If you send a whole big complicated prompt and a large context of data, you can’t even reliably ask the AI why it made a decision. It might not even be able to reliably tell you what data it used.”

This is the moment the explanation paradigm breaks. The fundamental assumption behind most AI governance — that you can understand what happened by asking the model — collapses under examination.

The entire governance industry is built on explanation: explainability tools, post-hoc analysis, model cards, fairness reports, bias assessments.⁶ All of which assume you can reconstruct what happened after the fact.

What if the answer isn’t better explanation? What if it’s making explanation unnecessary — because the system constrains decisions instead of explaining them?

The Explanation Paradigm vs The Constraint Paradigm

Two fundamentally different governance architectures. Not a spectrum — a fork. Like the difference between perimeter security and zero trust (see Chapter 3), this isn’t incremental improvement. It’s a different architecture.

Two Governance Architectures

Dimension	Explanation Paradigm	Constraint Paradigm
When governance runs	After execution	Before and during execution
What it produces	Explanations, logs, reports	Evidence chains, decision traces, gate outcomes
How accountability works	Reconstructed from logs by investigators	Produced by the system at decision time
What it can prevent	Nothing — it can only describe what happened	Unauthorised execution — structurally prevented
Speed relationship	Slows deployment (review bottleneck)	Enables deployment (pre-authorised autonomy)
Trust model	“We’ll catch it if something goes wrong”	“The system makes it impossible to go wrong”
Regulatory posture	“We can explain what happened”	“We can prove authority existed before action”

The shift in one sentence: governance stops explaining decisions and starts constraining them.

What Changes Operationally: Three Contradictions Resolved

The explanation paradigm forces three false trade-offs that organisations accept as structural realities. The constraint paradigm dissolves all three.

Contradiction 1: Governance Rigour ↑ vs Deployment Speed ↑

Under the Explanation Paradigm

More governance → slower. Every AI decision needs human review → bottleneck → pressure to skip review → governance erodes → compliance cosplay.

Under the Constraint Paradigm

Runtime governance infrastructure enables faster deployment. The IPA verifies authority in milliseconds — no human queue. Pre-verified evidence bundles mean admissible inputs from the start. Gate outcomes are deterministic — no ambiguity about whether to proceed. Only PAUSE outcomes require human intervention.

The highway analogy from Chapter 2: guardrails enable speed. Removing guardrails doesn’t make the highway faster — it makes it dangerous, which makes everyone drive slower.

Contradiction 2: AI Autonomy ↑ vs Human Accountability ↑

Under the Explanation Paradigm

Push autonomy up, accept accountability gaps. Nobody can answer “who authorised this” because authorisation was implicit. Or keep humans in the loop for everything — defeating the purpose.

Under the Constraint Paradigm

Decision Authority Infrastructure makes accountability a property of the architecture, not the org chart. Every decision has Fixed Authority (Pillar 2). Every decision has Provable Evidence (Pillar 4). Humans are accountable for setting authority boundaries — the system enforces them.

“Organisations don’t fail because they adopt AI — they fail because they delegate authority without clarity.”

Contradiction 3: Real-time Execution ↑ vs Governance Depth ↑

Under the Explanation Paradigm

Real-time decisions → shallow governance (no time for review). Deep governance → batch only (too slow for real-time). Either slow governance via committees, or fast-but-shallow governance via monitoring.

Under the Constraint Paradigm

Governance infrastructure runs WITH the decision. Same latency budget, but authority is checked by the system, not by a human reviewing logs next week. The IPA operates in the execution path — it doesn’t add a separate governance step, it is the execution path.

Policy-as-code enforcement operates at sub-millisecond latency: “Advanced platforms implement governance checks in parallel with inference requests, ensuring sub-millisecond latency impact.” ²⁶

“Governance that runs after the decision is already too late. Governance that runs with the decision changes system behaviour.”

What the Regulatory Landscape Requires vs What It Provides

Every major framework prescribes governance outcomes but not enforcement mechanisms.³² The pattern is consistent: all necessary, all insufficient.

Framework	What It Requires	What It Provides	What’s Missing
EU AI Act (Art 12, 14, 17)	Logging, human oversight, quality system	Obligations and penalties	Runtime enforcement mechanism
NIST AI RMF	Govern/Map/Measure/Manage lifecycle	Voluntary organisational scaffolding	Runtime enforcement mechanism
ISO 42001	AI management system, internal audits	Process and documentation standard	Runtime enforcement mechanism
SR 11-7	Model validation, ongoing monitoring, governance	Mandatory safeguards for probabilistic engines	Runtime enforcement mechanism

Each defines WHAT governance should look like but doesn’t provide the runtime HOW. Decision Authority Infrastructure is the enforcement complement — not a replacement for these frameworks, but the missing layer that makes their requirements operationally enforceable. Chapter 6 maps this in detail.

The Before and After: How DAI Changes Daily Operations

Before DAI: The Explanation Paradigm

AI Deployment

Deploy model → Hope it works → Monitor dashboards → Review logs periodically → Investigate if something goes wrong

When a Decision Fails

Someone notices (could be days or weeks later)
Investigation begins — pull logs, reconstruct context, interview operators
Try to determine: what data was used? What model version? Who had oversight?
Produce report (weeks to months)
Conclusion: “we’ll update the policy” or “we’ll add monitoring”
Nothing structurally changes about execution

When the Auditor Calls

Auditor asks: “Show me governance evidence for this decision”
Team scrambles to reconstruct: pull logs, correlate timestamps, map to policy versions
Evidence is circumstantial — “the policy was in place” but “we can’t prove it was enforced at decision time”
Finding: “governance exists but enforcement is unclear”

After DAI: The Constraint Paradigm

AI Deployment

Configure authority boundaries → Define admissible sources → Set gate thresholds → Deploy → System enforces

When a Decision Executes

Evidence bundle assembled automatically (Pillar 1)
Authority verified at decision time (Pillar 2)
Gate evaluates conditions (Pillar 3) → ALLOW / PAUSE / DENY
Decision trace produced by the system (Pillar 4) — contemporaneous, signed, complete
If PAUSE: routed to human with full context. If DENY: blocked, logged, alerted

When the Auditor Calls

Auditor asks: “Show me governance evidence for this decision”
System produces: evidence bundle, authority attestation, gate outcome, reason codes, policy version, execution trace
Evidence is structural — “here is the decision-time proof chain”
Finding: “governance is embedded and provable”

What “Not In Scope” Looks Like — The Honest Boundaries

This article establishes the WHY (Chapters 1–2: the governance gap, governance gravity) and the WHAT (Chapters 3–5: zero trust for decisions, four pillars, the operational shift). Deliberately out of scope:

How to build the decision DAG

Breaking complex decisions into separately governable nodes, proof-carrying proposals, the adversarial verification chain → Article 2

How SDLC becomes governance

The policy compiler pattern, treating policies as source code, CI/CD for decision systems → Article 3

Deep regulatory framework analysis

NIST AI RMF detail, ISO 42001 implementation, EU AI Act article-by-article compliance mapping → Article 4

This is intentional. DAI is the architectural principle. Implementation requires further work specific to your domain, your risk profile, and your regulatory obligations. But the principle must be established first — you can’t build the HOW without understanding the WHY.

The Governance Cost Inversion

Under the explanation paradigm, governance is a cost centre. It slows deployment, requires investigation staff, produces compliance documents nobody reads. “Governance fatigue” sets in — teams treat governance as a tax, not a capability.

Under the constraint paradigm, governance is an enabler. It enables faster autonomous operation through pre-authorised structure rather than committee review. It produces evidence by design, eliminating the need for investigation staff. It makes compliance a system property, turning audits into replay sessions. And it reduces regulatory risk through provable authority chains rather than reconstructed narratives.

The second-order effect is where the real value lies. Evidence produced by design enables faster audit response, faster regulatory response, faster trust-building with customers and partners. This shifts governance from “thing that slows us down” to “thing that makes new things possible.” Organisations with provable governance infrastructure can operate more autonomous AI — because they can demonstrate control.

The Governance Cost Inversion

Explanation Paradigm

Governance = cost centre

• Slows deployment
• Requires investigation staff
• Produces documents nobody reads
• Teams treat it as a tax

Constraint Paradigm

Governance = enabler

• Enables faster autonomous operation
• Evidence produced by design
• Compliance is a system property
• Makes new things possible

The SDLC Resolution

“It’s not that we want the smart AI as a magical employee. Maybe we’re pushing it back to SDLC and we’re going to write magical software instead.”

The resolution of the explanation-to-constraint shift crystallises here. The explanation paradigm imagines AI as an autonomous agent that we need to explain and monitor. The constraint paradigm imagines AI as a powerful perception engine embedded in deterministic governance infrastructure.

The Mature Architecture

LLM = perception + translation + proposal

The AI does what it does best: reading messy documents, extracting facts, classifying intent

Deterministic core = truth table over facts

Policy logic, completeness checks, precedence rules — all deterministic, all testable

IPA / governance plane = authority + provenance + execution gate

The four pillars enforced at runtime — model-agnostic, deterministic, verifiable

CI/CD = verifier of everything that changes

Including policy compiles, model updates, evidence source changes — all version-controlled

This is the mature architecture: not “AI makes decisions and we audit them” but “AI perceives, proposes, and classifies — and the infrastructure decides what’s allowed to execute.”

The trick: AI makes the deterministic infrastructure more powerful — a capable developer with AI assistance can achieve 10–50x productivity, building smarter policy compilers, better evidence extractors, more sophisticated decision graphs. And the deterministic infrastructure makes AI governance tractable. Each strengthens the other.

Key Takeaways

1. The shift from explanation to constraint is an architectural change, not an incremental improvement — it resolves three contradictions that the explanation paradigm cannot.
2. Governance rigour ↑ AND deployment speed ↑ — pre-authorised structure enables speed, like guardrails on a highway.
3. AI autonomy ↑ AND human accountability ↑ — accountability is architectural, not organisational.
4. Real-time execution ↑ AND governance depth ↑ — governance runs WITH the decision at sub-millisecond latency.
5. Every major framework (EU AI Act, NIST, ISO, SR 11-7) prescribes governance outcomes but not enforcement mechanisms — DAI is the missing enforcement complement.
6. Operationally: audits become replay sessions (not archaeological expeditions), accountability is contemporaneous (not reconstructed), and governance enables deployment instead of blocking it.
7. The mature architecture: AI as perception engine inside deterministic governance infrastructure — not autonomous agent that we monitor and explain.

Next: Chapter 6 maps Decision Authority Infrastructure against the regulatory landscape — how the four pillars satisfy what EU AI Act, NIST, ISO, and SR 11-7 demand but can’t enforce.

Part III — Applying the Doctrine

The Regulatory Proof

How DAI Satisfies What Frameworks Demand But Can’t Enforce

2 Aug 2026

EU AI Act full enforcement begins

€15M / 3%

Penalties: turnover-based fines

⁴¹

€10–30M

Expected first enforcement action

²⁷

When the regulator arrives, they will ask one question: “Can you prove authority existed before this decision executed?”

If your governance produces policies, dashboards, and post-hoc logs — you have compliance cosplay. If your governance produces decision-time evidence chains — you have Decision Authority Infrastructure. The difference between these two answers is the difference between defending your organisation and defending a fiction.

This chapter maps the four pillars of DAI (Chapter 4) to what the major frameworks actually require — showing that DAI isn’t a replacement for NIST, ISO, or the EU AI Act, but the enforcement layer they all assume exists.

EU AI Act: What It Requires, What It Leaves Open

Article 12 — Record-Keeping

“High-risk AI systems shall technically allow for the automatic recording of events (logs) over the lifetime of the system.” — Article 12, EU AI Act (enforcement: 2 August 2026)

Article 12 is a forensic-readiness requirement. Logs must enable identifying risk situations, facilitating post-market monitoring, and monitoring operations. For biometric systems, it requires recording use periods, reference databases, input data leading to matches, and identification of humans involved in verification. ²⁸

What it doesn’t require: that logs prevent execution. Article 12 ensures evidence exists for investigation, not that governance constrains execution.

Article 14 — Human Oversight

“High-risk AI systems shall be designed and developed in such a way… that they can be effectively overseen by natural persons during the period in which they are in use.” — Article 14, EU AI Act (enforcement: 2 August 2026)

Article 14 requires that humans retain the ability to “decide, in any particular situation, not to use the high-risk AI system or to otherwise disregard, override or reverse the output.” The system must include a “stop button or a similar procedure.” For biometric identification, “no action or decision is taken” unless “separately verified and confirmed by at least two natural persons.” ²⁹

Article 14 is directionally right but architecturally silent. It says WHAT oversight must achieve but not HOW to build it.

Article 17 — Quality Management System

Article 17 requires a documented quality management system: strategy for regulatory compliance, techniques for design and development, validation procedures, and an accountability framework. In October 2025, prEN 18286 became the first harmonised standard translating Article 17 into “a structured, auditable framework.” ³⁰

Article 17 is a documentation and process requirement. It doesn’t require that the QMS be machine-enforceable at runtime. The QMS says “we have an accountability framework” — DAI proves accountability exists at decision time. The QMS says “we have validation procedures” — DAI’s gates implement those procedures as runtime checks.

The Three Articles as a Stack

Article	Governance Layer	DAI Pillar
Art 17 (QMS)	Structure & documentation	All four pillars defined in policy
Art 14 (Oversight)	Human authority & intervention	Pillar 2 (Fixed Authority) + Pillar 3 (Gated Execution)
Art 12 (Logging)	Evidence & traceability	Pillar 1 (Admissible Knowledge) + Pillar 4 (Provable Evidence)

The EU AI Act builds from documentation (17) to oversight (14) to evidence (12). DAI provides the runtime enforcement that connects all three.

NIST AI RMF: Organisational Scaffolding Without Runtime Teeth

The NIST AI Risk Management Framework provides four core functions — Govern, Map, Measure, Manage — with Govern deliberately cross-cutting the rest. It is “intended for voluntary use and to improve the ability to incorporate trustworthiness considerations into the design, development, use, and evaluation of AI products, services, and systems.” ³¹

NIST itself acknowledges the limits: “Effective risk management is realised through organisational commitment at senior levels and may require cultural change within an organisation or industry. Use of the AI RMF alone will not lead to these changes.”³¹

How DAI Complements NIST AI RMF

Govern function

DAI provides the enforcement mechanism that makes governance operational — not just documented

Map function

Admissible Knowledge (Pillar 1) operationalises the mapping of what data and evidence is relevant to each decision type

Measure function

Provable Evidence (Pillar 4) produces measurement data — decision traces, gate outcomes — automatically

Manage function

Gated Execution (Pillar 3) is the management mechanism — ALLOW / PAUSE / DENY based on risk thresholds

NIST provides the org structure; DAI provides the runtime enforcement. Complementary, not competitive.

ISO 42001: Management System Without Execution Constraint

ISO/IEC 42001:2023 “defines formal requirements for AI governance, including risk assessment mandates, control implementation, and lifecycle oversight” and “establishes a systematic, repeatable process for AI compliance, ensuring organisations maintain consistent oversight.” Complemented by ISO/IEC 42005:2025 for AI system impact assessments. ³⁴

ISO 42001 is lifecycle-oriented and process-based. It specifies what governance processes should exist, not how governance is enforced at the moment of AI decision. “Lifecycle” still means design-develop-deploy-monitor, not enforce-at-decision-time.

SR 11-7: The Closest Existing Model — And Why Even It Falls Short

As established in Chapter 3, the Federal Reserve’s SR 11-7 guidance has governed “probabilistic engines” since 2011. Its three safeguards — robust development, effective validation, sound governance — remain the closest existing equivalent to runtime governance for probabilistic systems. ³⁶

SR 11-7 is the closest precedent because it governs probabilistic engines, requires ongoing monitoring comparing outputs to actual outcomes, and demands documentation detailed enough that unfamiliar parties can understand the model’s operation. ³⁸

But SR 11-7 was designed for model validation cycles (periodic review), not real-time decision enforcement. The Bank Policy Institute acknowledged in October 2025 that “the current rigid application of MRM Guidance to banks’ AI use cases has hindered (and continues to hinder) AI adoption.” ³⁹

How DAI Operationalises SR 11-7’s Principles

SR 11-7’s “effective challenge”

DAI’s Gated Execution (Pillar 3): every decision is challenged before execution, not periodically

SR 11-7’s “ongoing monitoring”

DAI’s Provable Evidence (Pillar 4): monitoring is a byproduct of enforcement, not a separate activity

SR 11-7’s “documentation for independent review”

DAI’s decision traces: produced at decision time, immediately reviewable, no reconstruction needed

DAI doesn’t replace SR 11-7 — it makes SR 11-7’s principles enforceable at the speed of runtime AI.

The Unified Mapping: Four Pillars × Four Frameworks

DAI Pillar	EU AI Act	NIST AI RMF	ISO 42001	SR 11-7
Admissible Knowledge	Art 12: what data was used, traceable	Map: identify relevant AI risks in context	Data governance requirements	Model inputs documented
Fixed Authority	Art 14: human oversight, override ability	Govern: roles, responsibilities, authority	Accountability framework	Sound governance (board oversight)
Gated Execution	Art 14: stop button, intervention ability	Manage: risk response, action thresholds	Control implementation	Effective challenge (independent validation)
Provable Evidence	Art 12 (logging) + Art 17 (QMS)	Measure: metrics, assessment results	Internal audit requirements	Documentation for independent review

The mapping shows: DAI’s four pillars aren’t new requirements. They’re the enforcement mechanism for requirements that already exist across every major framework. No framework currently mandates HOW governance is enforced at runtime — they all assume it exists. DAI fills that assumption.

Organisations that implement DAI simultaneously satisfy requirements across multiple frameworks — not through separate compliance programmes but through a single enforcement architecture.

The Enforcement Timeline: What’s Coming

2024

EU AI Act enters into force

Aug 2025

Member states required to have notifying and market-surveillance authorities in place

Aug 2026

Full enforcement of high-risk system requirements (Articles 12, 14, 17)

⁴⁰

2025–26

“Enforcement actions against AI deployers increased significantly in 2025, with settlements targeting companies across industries.”

⁴¹

Emerging

“Cyber insurance now requires AI-specific security controls. Insurance carriers are introducing AI Security Riders that condition coverage on documented security practices.”

⁴²

98%

of organisations expect AI governance budgets to increase significantly

OneTrust, “The 2025 AI-Ready Governance Report,” via Knostic ⁴³

The question isn’t whether governance investment is coming — it’s whether that investment buys enforcement infrastructure or more theatre.

“Organisations don’t fail because they adopt AI — they fail because they delegate authority without clarity.”

Key Takeaways

1. The EU AI Act (Articles 12, 14, 17) collectively require runtime-grade governance but leave the enforcement mechanism to providers. DAI provides the mechanism.
2. NIST AI RMF provides organisational scaffolding (Govern/Map/Measure/Manage); DAI provides runtime enforcement. Complementary, not competitive.
3. ISO 42001 establishes the management system; DAI provides the technical enforcement layer the auditor actually tests.
4. SR 11-7 has governed probabilistic engines since 2011 — the closest existing model, but even in financial services only 44% properly validate AI. DAI operationalises SR 11-7’s principles at runtime speed.
5. The four pillars map directly to requirements across all four frameworks — not new requirements, but the enforcement mechanism for requirements that already exist.
6. The investment is coming (98% expect budget increases). The choice is infrastructure or theatre.

Next: Chapter 7 examines how autonomous AI agents multiply the governance surface — and why the four pillars become urgent when agents start making decisions on your behalf.

Part III — Applying the Doctrine

The Agentic Acceleration

Why Autonomous AI Makes This Urgent

40%

of enterprise workflows will involve autonomous agents by 2026

Gartner projection, via Forbes Feb 2026 ⁴⁴

Consider what this means: agents negotiating contracts, processing transactions, accessing sensitive data, making tool calls that modify databases, send emails, execute payments — all without a standardised way to prove who they are or what they’re authorised to do.

Every autonomous tool call is an ungoverned execution event under current governance frameworks.⁵⁴

If a single AI decision without runtime governance is compliance cosplay (Chapter 1), then thousands of autonomous agent actions per hour without runtime governance is compliance cosplay at industrial scale. This chapter applies the four pillars of DAI (Chapter 4) to the agentic context — not as theory but as architectural necessity.

Why Agents Are Different: Four Compounding Challenges

Agentic AI doesn’t just make the governance gap wider — it makes the gap qualitatively different.

Challenge 1: Autonomous Multi-Step Execution Outpaces Human Monitoring

Traditional AI

User sends request → model produces output → human reviews

Agentic AI

Agent receives goal → plans steps → executes tool calls → modifies data → calls other agents → produces results — all before any human could possibly review

The execution velocity exceeds the monitoring cadence by orders of magnitude. Post-hoc monitoring is already stale by the time a human looks.

“The complex interactions among agents increase substantially the risk of outcomes becoming more unpredictable.” ⁴⁵

Challenge 2: Inter-Agent Communication Creates Attack Surfaces

When agents communicate, every message is a potential vector:

• Agent A asks Agent B to perform an action — is Agent A authorised to make that request?
• Agent B receives instructions — are those from the legitimate orchestrator or from an injection attack?
• Data passes between agents — has it been modified, and by whom?

This is the confused deputy problem at scale. Agents inherit a chat-era trust model where all text arrives equal — no channel carries cryptographic proof of origin. Without provenance, agent-to-agent communication is an unverified trust chain — each link can be compromised without the next agent knowing.

Challenge 3: Dynamic Tool-Calling Cannot Be Pre-Audited

Agents select and invoke tools at runtime based on context — the specific tool calls cannot be predicted at design time. Each tool call is a new execution context with potentially different data, different risk profiles, different authority requirements.

Pre-deployment review covers the agent’s tool ACCESS (what it CAN call) but not its tool USE (what it WILL call in any specific context). This is the runtime gap amplified: governance review happens before deployment; execution happens at runtime with unpredictable specifics.

Challenge 4: Authority and Intent Are Implicit

In current agent architectures, authority is typically:

• Inherited from the user session that spawned the agent (but the agent may outlive the session)
• Configured in a system prompt (but system prompts can be overridden or injected into)
• Assumed from the deployment context (but deployment may cover multiple authority levels)

None of these provide cryptographic proof of authority per action.

“AI agents are negotiating contracts, processing transactions, and accessing sensitive data — all without a standardised way to prove who they are or what they’re authorised to do.” ⁴⁶

The Containment vs Provenance Gap

Containment (Advancing)

“What CAN the agent do?”

• Scoped keys and tokenisation (SiloOS pattern)
• Container isolation, network segmentation
• Capability-scoped tokens, policy walls
• Deterministic policy enforcement at the gateway (AWS AgentCore)⁵⁵

Provenance (Missing)

“Who AUTHORISED this action?”

• No platform provides cryptographic proof of origin per action
• All text arrives equal — no channel differentiates source
• Authority inherited, not verified
• The confused deputy problem at scale

You can scope what an agent CAN do — but can you prove that THIS specific action was authorised by someone with the right to authorise it? Containment constrains capability; provenance proves authority. You need both.

Applying the Four Pillars to Agentic AI

The four pillars (Chapter 4) applied to the agentic context — same framework, different execution surface.

Pillar 1: Admissible Knowledge → Agent Evidence Bundles

Challenge: Agents access data from multiple sources, tool calls, and other agents — the evidence surface is vast and dynamic.

Application: Each agent task receives a scoped evidence bundle assembled by the orchestration layer. Only verified, signed data sources enter the bundle. Tool call results are signed by the tool service. Inter-agent data carries provenance metadata. Context from previous steps is either signed or treated as untrusted and re-derived.

Prevents: agents acting on unverified data, poisoned tool outputs, fabricated inter-agent messages, stale or manipulated context.

Pillar 2: Fixed Authority → Scoped Capability Tokens

Challenge: Who authorised THIS specific agent action? Authority is currently implicit, inherited, or assumed.

Application: Every agent action is gated by a scoped capability token specifying: who authorised (identity), what’s allowed (action scope), what data can be accessed (evidence scope), when it expires (time-bound), what risk tier this covers (authority level).

“The significance of this shift lies in delegated authority. As software begins to act on behalf of users… security controls anchored solely in users, roles, and static permissions will become less effective. What matters instead is whether each agent deployment is bound to a clearly defined purpose, with explicit boundaries around data access, tool usage, and decision scope.” ⁴⁷

Prevents: authority inflation, persistence attacks, confused deputy attacks.

Pillar 3: Gated Execution → Per-Action Enforcement Boundaries

Challenge: Every tool call is an execution event — how do you gate thousands of tool calls per hour?

Application: Every tool call hits an enforcement boundary. The IPA evaluates each action against capability token scope, risk tier, irreversibility, and evidence admissibility. ALLOW (routine, within scope) / PAUSE (exceeds pre-authorised risk) / DENY (outside scope, blocked).

Latency: Policy-as-code frameworks demonstrate sub-millisecond enforcement²⁶ (see Chapter 5). The gate doesn’t need to “understand” the action — it evaluates conditions against policy rules. Deterministic checks are fast.

Singapore IMDA explicitly calls for “defining action boundaries requiring human approval, such as high-stake or irreversible actions (editing sensitive data or permanently deleting data), or outlier and atypical behaviours.” ⁴⁸

Pillar 4: Provable Evidence → Signed Receipts Per Action

Challenge: Agents execute dozens to hundreds of actions per task — how do you produce provable evidence at this volume?

Application: Every gated action produces a signed receipt containing: action ID, capability token reference, evidence bundle references, gate outcome, reason codes, timestamp, signature. Receipts chain together to form an end-to-end trace for the agent task.

“A full AI agent trust stack is emerging across at least four dimensions: the identity layer… the operational boundaries layer… the runtime behaviour layer… and the governance and compliance layer, which proves that all of it meets regulatory and internal policy standards.” ⁴⁹

Produces: A complete, replayable, auditable trace of every agent action — produced by the system, not reconstructed by investigators.

The Governance Frameworks Are Catching Up

Singapore IMDA (January 2026)

World’s first agentic AI governance framework. Five risk categories: erroneous action, unauthorised actions, biased/unfair actions, data breaches, disruption to connected systems.

Key requirement: “define action boundaries requiring human approval” for high-stake or irreversible actions.¹⁶

Aligns directly with DAI: action boundaries (Pillar 3), human approval gates (Pillar 2), real-time monitoring producing evidence (Pillar 4).

UC Berkeley (February 2026)

Agentic AI Risk-Management Standards Profile. Names “unintended goal pursuit, unauthorised privilege escalation or resource acquisition, self-replication or resistance to shutdown” as unique challenges.

“These unique challenges complicate traditional, model-centric risk-management approaches and demand system-level governance.”¹⁷

“System-level governance” is precisely what DAI provides — governance embedded in the execution system, not applied to models from outside.

NIST Cybersecurity AI Profile (December 2025)

NIST IR 8596: “Because AI can operate autonomously, ensure that all internal AI systems are properly monitored for anomalous activity.” ⁵⁰

Direction of travel: runtime monitoring and enforcement for autonomous AI — not just design-time assessment.

Worked Example: Agent Processing an Insurance Claim

A concrete scenario to make the four pillars tangible. An insurance claim processing agent receives a new claim and must assess, gather evidence, apply policy, and produce a recommendation.

Insurance Claim #12847: $35K Vehicle Damage

Without DAI

Agent receives claim → accesses all available data → processes against full policy → produces APPROVE/DENY → logs result → human may review later.

• Who authorised the data access? Inherited from deployment config
• Who authorised applying this policy version? Assumed from system setup
• Can we prove what data was used? If the model self-reports accurately
• Can we prove the recommendation was reasonable? Only through post-hoc investigation

With DAI

Pillar 1: Admissible Knowledge

Evidence bundler assembles scoped bundle: claim form (signed), customer record (signed), policy v3.2 (signed, current), relevant precedents (signed). Agent never sees data outside this bundle.

Pillar 2: Fixed Authority

Agent’s capability token covers “claim assessment up to $50K, standard risk tier.” This claim is $35K — within scope. If it were $75K, gate would PAUSE for senior authority.

Pillar 3: Gated Execution

Agent produces structured recommendation (APPROVE, reason codes, evidence references). IPA evaluates: evidence admissible? authority sufficient? risk within threshold? All conditions met → ALLOW.

Pillar 4: Provable Evidence

Complete evidence chain produced: evidence bundle refs, authority attestation, gate outcome, reason codes, policy version, execution trace. When auditor asks about claim #12847 — the system produces it instantly.

We’re Not Building Magical Employees

“We’re not building a general-purpose autonomous agent with every tool in the world saying ‘just go and do what you want.’ We can’t build those sorts of tools yet — there are too many problems.”

The mature architecture for agentic AI is NOT: give the agent full access and monitor what it does. It IS: give the agent scoped access, gate every action, produce evidence by design, and earn autonomy through demonstrated reliability.

The “Earn Autonomy Through Evidence” Path

Start

Highly constrained agent with tight evidence bundles, frequent PAUSE gates, low risk ceiling

↓

Demonstrate

Reliability data from Provable Evidence traces — system has proof the agent operates within bounds

↓

Earn

Wider capability scope, higher risk thresholds, more ALLOW vs PAUSE gates — graduated based on evidence, not hope

This is the same “reversible → irreversible → high-stakes” graduation from the Architecture, Not Vibes framework.

Key Takeaways

1. Agentic AI creates four compounding governance challenges: autonomous multi-step execution, inter-agent attack surfaces, dynamic tool-calling that can’t be pre-audited, and implicit authority without cryptographic proof.
2. Containment (what an agent CAN do) is advancing; provenance (who AUTHORISED the action) is missing everywhere. You need both.
3. The four pillars apply directly: scoped evidence bundles (Pillar 1), capability tokens (Pillar 2), per-action enforcement boundaries (Pillar 3), signed receipts per action (Pillar 4).
4. Singapore IMDA and UC Berkeley frameworks both explicitly require pre-execution action boundaries and system-level governance — the direction of travel matches DAI.
5. We’re not building magical employees — we’re building constrained perception engines inside governed pipelines, earning autonomy through evidence.

Next: Chapter 8 brings it all together — the diagnostic to determine whether your governance is compliance cosplay or infrastructure, and what to do about it.

Part III — Applying the Doctrine

The Decision

Compliance Cosplay or Governance Infrastructure

“If you keep pushing this line, you end up with something that looks like Zero Trust for decisions: models don’t get to ‘do’, they only get to ‘suggest’, and every suggestion hits an enforcement boundary that can prove what it relied on.”

This ebook has walked from diagnosis — your AI governance is compliance cosplay — through physics — governance has gravity, it follows execution — through analogy — zero trust for decisions — through architecture — the four pillars of Decision Authority Infrastructure — through application — regulatory mapping, agentic AI.

Now the question: what do you do about it?

This chapter is a decision instrument — a way to diagnose your current state and determine your next move. Not theory. Not aspiration. A self-assessment you can run on Monday morning.

The Compliance Cosplay Diagnostic: Three Questions

Three questions that any CTO, CISO, or CIO can ask about their AI governance today. Answer honestly. The answers determine whether you have governance infrastructure or governance theatre.

Can your system technically prevent an unauthorised AI decision from executing right now?

Not “do you have a policy that says only authorised decisions should execute.”

Not “do you have a dashboard that shows what decisions were made.”

Specifically: is there a technical enforcement mechanism that will BLOCK an AI decision if authority, admissibility, or risk conditions are not met?

If Yes

You have some form of runtime governance. The question is whether it covers all four pillars (Chapter 4) or just some.

If No

You have compliance cosplay. Your governance is documentation — it can describe what should happen but cannot prevent what shouldn’t.

Remember nH Predict (Chapter 1): the policy said “AI only guides.” Nothing prevented execution. 90% overturn rate on appeal.

Can you prove — from system-generated evidence, not reconstructed logs — who had authority over this specific decision before it executed?

Not “can you find a log entry that shows when the decision was made.”

Not “can you identify who was on the governance committee that approved the model.”

Specifically: does the system produce, at decision time, a signed evidence chain that proves who had authority, what evidence was admissible, what the gate outcome was, and why?

If Yes

You have Provable Evidence (Pillar 4) and Fixed Authority (Pillar 2). You can answer a regulator’s question in minutes, not months.

If No

You have forensic artefacts, not governance. When the question comes — “who authorised this?” — you’ll be reconstructing, not replaying.

If a regulator asked “show me the decision-time evidence chain for this action” — could you produce it without investigation?

Not “could you produce a report that describes your governance framework.”

Not “could you show the model’s training data and validation results.”

Specifically: for THIS decision, on THIS date, affecting THIS person — can you produce the evidence chain in minutes?

If Yes

You have governance infrastructure. Audit is replay, not investigation.

If No

You have monitoring, not governance. Your evidence is circumstantial, not structural. Under the EU AI Act (effective August 2026)⁵⁸, “we’ll investigate and get back to you” may not be sufficient.

The shift from Chapter 5: audits as archaeological expeditions vs replay sessions.

Score Your Governance

Question	Yes	No
1. Can the system technically prevent unauthorised execution?	Runtime enforcement exists	Compliance cosplay
2. Can you prove authority from system-generated evidence?	Provable governance	Forensic artefacts
3. Can you produce the evidence chain without investigation?	Governance infrastructure	Monitoring only

3 Yes

You have Decision Authority Infrastructure (or are close). Focus on coverage and completeness.

1–2 Yes

You have partial enforcement. Identify the gaps and prioritise by risk tier.

0 Yes

You have compliance cosplay. Start with the highest-risk decisions and build from there.

The Cost of Inaction

Compliance cosplay isn’t free. It carries three compounding costs — regulatory, trust, and value — each reinforcing the others.

The Regulatory Cost

EU AI Act enforcement begins August 2026. Penalties reach €15M or 3% of worldwide turnover. The first major enforcement action — likely targeting a high-risk system in hiring or credit — is expected in the €10–30M range.⁵¹

Enforcement actions against AI deployers increased significantly in 2025, with settlements targeting companies across industries.⁵²

Cyber insurance carriers are now introducing “AI Security Riders” — coverage conditioned on documented security practices. Governance that can’t demonstrate runtime enforcement may void coverage when it matters most.⁵³

The Trust Cost

When a decision fails and you can’t prove who had authority — trust erodes. Boards, regulators, customers, and courts all ask the same question: “Who authorised this?”

Under compliance cosplay: the answer requires investigation, narrative construction, and hope that the logs are sufficient. Evidence is circumstantial. Accountability is retroactive.

Under Decision Authority Infrastructure: the answer is a system-generated evidence chain, produced at decision time. Evidence is structural. Accountability is contemporaneous.

The difference between these two answers is the difference between defending your organisation and defending a fiction.

The Value Cost

Compliance cosplay doesn’t just create risk — it blocks value. Governance that can’t enable deployment prevents the organisation from extracting AI value.

60%

of organisations will fail to realise AI value by 2027 due to incohesive governance⁵⁴

80%

of AI pilots fail to reach production — despite 70% succeeding technically⁵⁵

98%

of organisations expect AI governance budgets to increase significantly

OneTrust, “The 2025 AI-Ready Governance Report”⁵⁶

The investment is coming regardless. Ninety-eight percent of organisations expect AI governance budgets to increase significantly. The question isn’t whether to spend — it’s whether that investment buys enforcement infrastructure or more theatre.

The governance gap isn’t just about policy — it’s about enforcement, role clarity, and technical gatekeeping. Every dollar spent on governance documentation without runtime enforcement is a dollar spent on compliance cosplay.⁵⁷

What to Do Next

This ebook establishes the WHY (governance gap, governance gravity) and the WHAT (zero trust for decisions, four pillars of Decision Authority Infrastructure). The detailed HOW belongs to future articles — but here is the directional guidance you can act on now.

Step 1 Audit Your Current Governance — Does Any of It Run at Decision Time?

Map every AI governance mechanism you have: policies, committees, dashboards, logs, monitoring tools, model cards, bias assessments.

For each mechanism, ask: does this run BEFORE execution, or AFTER?

Categorise:

Preventive

Runs before execution, can block

Detective

Runs after execution, can detect

Documentary

Exists as documentation, doesn’t run at all

Most organisations will find: nearly everything is Detective or Documentary. Almost nothing is Preventive. Your compliance cosplay surface is the gap between what you document and what you enforce.

Step 2 Map Your AI Decisions by Irreversibility

Not all AI decisions need runtime gates. We’re not suggesting every AI decision needs enforcement — only those that cross legal, operational, or ethical boundaries.

LOW

Reversible — drafting content, generating reports, internal suggestions → lighter governance appropriate

MED

Irreversible — executing payments, denying claims, sending legal notices → runtime gates required

HIGH

High-stakes irreversible — regulatory submissions, medical decisions, credit decisions → full DAI enforcement mandatory

Start with the high-stakes irreversible decisions. That’s where compliance cosplay has the highest cost.

Step 3 For High-Risk Decisions — Can the System Technically Prevent Execution Without Authority?

For each high-risk decision identified in Step 2, run the three diagnostic questions above.

Where the answer is “no” — that’s your compliance cosplay surface, and your priority for DAI implementation.

The gap between your policy and your runtime is the gap that regulators, auditors, and courts will find.

Step 4 Start Building the Enforcement Layer

The principles are clear from this ebook; the implementation details are next:

• Article 2 (forthcoming): How to build the decision DAG — breaking complex decisions into separately governable nodes, proof-carrying proposals, adversarial verification chains
• Article 3 (forthcoming): How SDLC becomes governance — the policy compiler pattern, treating policies like source code, CI/CD for decision systems
• Article 4 (forthcoming): Regulatory deep dive — detailed compliance mapping for EU AI Act, NIST AI RMF, ISO 42001, SR 11-7

In the meantime: the architecture is the IPA pattern (Chapter 4) — LLM proposes, deterministic gate enforces. Start with the highest-risk decision and build the enforcement boundary.

The Series Roadmap

This ebook is the first in a series on Decision Authority Infrastructure. Each subsequent article builds on the foundation established here.

1 This Ebook

WHY + WHAT: Compliance Cosplay → DAI

Why is current governance insufficient? What does runtime enforcement look like?

2 Next

HOW (Decision): Proof-Carrying Proposals

How do you break complex decisions into separately governable nodes?

3 Forthcoming

HOW (SDLC): Policy Compiler Pattern

How does the development lifecycle become governance infrastructure?

4 Forthcoming

WHERE (Regulatory): Compliance Mapping

How does DAI map to specific regulatory requirements in detail?

The parking lot items from the pre-think are explicitly allocated to future articles — not deferred, planned.

Magical Software, Not Magical Employees

“It’s not that we want the smart AI as a magical employee. Maybe we’re pushing it back to SDLC and we’re going to write magical software instead.”

The resolution of the compliance cosplay problem isn’t “better policies” or “more monitoring” or “stronger prompts.” It’s an architectural shift: from trusting AI to constraining AI. From explaining decisions to constraining them. From governance as documentation to governance as infrastructure.

The four pillars — Admissible Knowledge, Fixed Authority, Gated Execution, Provable Evidence — are the blueprint for governance that runs with the decision, not after it. Each pillar answers a question that every AI decision must resolve before it executes. Together, they constitute Decision Authority Infrastructure: the missing enforcement layer that every major framework assumes exists.

The zero trust analogy is precise: just as “trust is a vulnerability” in network security, implicit trust in AI governance is the vulnerability that compliance cosplay exposes. Zero trust networking didn’t make networks slower. It made them safer AND enabled more distributed, autonomous operation. Decision Authority Infrastructure does the same for AI governance.

“The biggest risk in AI isn’t the model.
It’s the authority we give it.”

Governance that runs after the decision is already too late.
Governance that runs with the decision changes system behaviour.

What Now?

If you’re working with AI systems where decisions cross legal, operational, or ethical boundaries:

• Run the three-question diagnostic on your current governance
• Map your highest-risk AI decisions by irreversibility
• Ask: “If a regulator asked for the decision-time evidence chain, could we produce it in minutes?”

If the answer is “no” to any of these — you know where your compliance cosplay surface is.

If you’re working with AI systems where decisions cross legal, operational, or ethical boundaries — I’d be interested in comparing notes on how authority and execution are actually handled today.

Key Takeaways

1. The compliance cosplay diagnostic: three questions that reveal whether your governance is infrastructure or theatre. Can the system prevent unauthorised execution? Can you prove authority from system evidence? Can you produce the evidence chain without investigation?
2. The cost of inaction: regulatory penalties (€15M / 3% turnover), eroded trust when accountability is retroactive, and blocked AI value when governance can’t enable deployment.
3. What to do next: audit current governance for decision-time enforcement, map decisions by irreversibility, identify the compliance cosplay surface, and start with the highest-risk decisions.
4. The series continues: Article 2 (decision DAG and proof-carrying proposals), Article 3 (SDLC as governance and policy compiler pattern), Article 4 (detailed regulatory compliance mapping).
5. The closing frame: not magical employees, but magical software. Governance as infrastructure, not documentation. The four pillars are the blueprint. The IPA is the enforcement point. And the biggest risk in AI isn’t the model — it’s the authority we give it.

References & Sources

Primary sources, regulatory frameworks, and practitioner research cited throughout this ebook.

This ebook draws on regulatory texts, industry research (2024–2026), financial services governance precedent, and practitioner frameworks developed through enterprise AI transformation consulting. All URLs were verified as of February 2026. Some academic and regulatory resources may require institutional access.

Case Studies

[1, 2] Class action lawsuit against UnitedHealth AI claim denials

90% appeal overturn rate for nH Predict algorithm [1]; STAT investigation into UnitedHealth pressuring employees to follow AI predictions, setting a 1% length-of-stay target [2]. Core evidence for the governance gap thesis.

https://www.healthcarefinancenews.com/news/class-action-lawsuit-against-unitedhealth-ai-claim-denials-advances

[3] Lawsuit over AI usage by Medicare Advantage plans allowed to proceed

DLA Piper analysis of US District Court ruling (Feb 2025) allowing class action involving AI “in lieu of physicians” to proceed.

https://www.dlapiper.com/en/insights/publications/ai-outlook/2025/lawsuit-over-ai-usage-by-medicare-advantage-plans-allowed-to-proceed

Regulatory Frameworks

[11, 28] EU AI Act — Article 12: Record-Keeping

Automatic logging requirements for high-risk AI systems. Enforcement begins 2 August 2026.

https://artificialintelligenceact.eu/article/12/

[25, 29] EU AI Act — Article 14: Human Oversight

Human oversight requirements, stop-button mandate, dual verification for biometric systems. Closest the Act comes to prescribing runtime enforcement.

https://artificialintelligenceact.eu/article/14/

[30] EU AI Act — Article 17: Quality Management System (prEN 18286:2025)

First harmonised standard for AI QMS, translating Art. 17 into a structured, auditable framework.

https://www.vde.com/topics-en/artificial-intelligence/blog/eu-ai-act--quality-management-system

[31] NIST AI Risk Management Framework

Govern/Map/Measure/Manage framework for AI risk management. Voluntary and organisational in focus.

https://www.nist.gov/itl/ai-risk-management-framework

[50] NIST IR 8596 — Cybersecurity Framework Profile for AI (Dec 2025)

Runtime monitoring requirements for autonomous AI systems. Direction of travel toward enforcement.

https://nvlpubs.nist.gov/nistpubs/ir/2025/NIST.IR.8596.iprd.pdf

[34, 35] ISO/IEC 42001:2023 — AI Management System Standard

Formal requirements for AI governance, risk assessment, and lifecycle oversight. Complemented by ISO/IEC 42005:2025 for impact assessments.

https://aws.amazon.com/blogs/security/ai-lifecycle-risk-management-iso-iec-420012023-for-ai-governance/

[21, 36] SR 11-7: Guidance on Model Risk Management (Federal Reserve, 2011)

Mandatory governance framework for probabilistic engines in financial services since 2011. Three safeguards: robust development, effective validation, sound governance.

https://www.federalreserve.gov/supervisionreg/srletters/sr1107.htm

[40, 58] IAPP Global Legislative Predictions 2026

EU AI Act enforcement timeline and regulatory outlook for 2026. EU AI Act becomes fully applicable August 2026; member states required to have notifying and market-surveillance authorities in place since August 2025.

https://iapp.org/resources/article/global-legislative-predictions

Industry Research & Governance Statistics

[4, 7, 10] Pacific AI — 2025 AI Governance Survey

75% have AI policies [4]; only 36% have formal governance framework; 75% vs 36% policy-framework gap [7]; 48% monitor production AI for accuracy, drift, and misuse — dropping to 9% among small companies [10].

https://pacific.ai/2025-ai-governance-survey/

[8] Knostic — The 20 Biggest AI Governance Statistics and Trends of 2025

7% fully embedded governance (Trustmarque); 97% of AI breach victims lacked access controls (IBM); 25% fully implemented governance (AuditBoard); 98% expect budget increases (OneTrust).

https://www.knostic.ai/blog/ai-governance-statistics

[9] Acuvity — 2025: The Year AI Security Became Non-Negotiable

70% lack optimised governance; 39% operate below managed governance levels.

https://acuvity.ai/2025-the-year-ai-security-became-non-negotiable/

[5, 13, 15] RAGN World — The 2025 Responsible AI Governance Landscape

Shadow human decisioning in financial services; SHAP/LIME insufficient for regulatory requirements [5, 15]; first major EU AI Act enforcement action expected in the €10–30M range targeting hiring or credit AI systems [13].

https://www.aigl.blog/content/files/2026/02/THE-2025-RESPONSIBLE-AI-GOVERNANCE-LANDSCAPE-FROM-PRINCIPLES-TO-PRACTICE.pdf

[6] BIS Financial Stability Institute — Managing AI Explainability (2025)

Explainability confirmed as top regulatory concern. Black-box nature of AI algorithms the primary issue.

https://www.bis.org/fsi/fsipapers24.pdf

[23] ArticleSledge — Model Risk Management Guide 2025

Only 44% of banks properly validate AI tools under SR 11-7.

https://www.articsledge.com/post/model-risk-management

[22] ValidMind — SR 11-7 Model Risk Management Compliance (Oct 2025)

SR 11-7 principles now applied to AI, ML, and generative AI systems.

https://validmind.com/blog/sr-11-7-model-risk-management-compliance/

[24, 39] Bank Policy Institute — OSTP AI RFI Response (Oct 2025)

MRM guidance “hindered and continues to hinder” AI adoption in banking. [24] cited in Ch3 re: SR 11-7 rigidity constraining AI adoption; [39] cited in Ch6 same context.

https://bpi.com/wp-content/uploads/2025/10/BPI-OSTP-AI-RFI-Response-10.27.25.pdf

[14, 51] Nitor Infotech — Explainable AI in 2025: Navigating Trust and Agency

Post-hoc explanation techniques (SHAP, LIME) explain less than 40% of model behaviour for complex decisions, citing Meta 2023 research. Explainability identified as key AI risk by 40% of organisations, yet only 17% actively mitigate it.

https://www.nitorinfotech.com/blog/explainable-ai-in-2025-navigating-trust-and-agency-in-a-dynamic-landscape/

[38] HES FinTech — AI in Lending: AI Credit Regulations 2025

SR 11-7’s three fundamental safeguards becoming global standards.

https://hesfintech.com/blog/all-legislative-trends-regulating-ai-in-lending/

[52] Gartner — 2024 AI Governance Failure Prediction

60% of organisations will fail to realise AI value by 2027 due to incohesive governance. Cited as the primary cost-of-inaction statistic for runtime governance gaps.

https://www.rapidionline.com/blog/data-integration-trends-markets

[53] AIM Research Councils — AI Insights in 2025: Scale is the Strategy

70% of AI pilots succeed technically but 80% fail to reach production due to governance gaps. Evidence that the governance gap is a value-extraction problem, not merely a risk problem.

https://councils.aimmediahouse.com/ai-insights-in-2025-shows-scale-is-the-strategy/

[54] Deloitte — State of AI in the Enterprise 2026

85% of enterprises plan agentic AI deployments, but only 21% have governance frameworks ready — a 4:1 ambition-to-readiness gap. Primary evidence that agentic AI expansion is outpacing governance infrastructure.

https://www.deloitte.com/us/en/about/press-room/state-of-ai-report-2026.html

Agentic AI & Security

[16, 17, 45, 48] Giovanni Coletta — Agentic AI Governance Frameworks 2026 (Feb 2026)

Singapore IMDA (world’s first agentic governance framework) and UC Berkeley Agentic AI Risk-Management Standards Profile. Five risk categories. Action boundaries for human approval. UC Berkeley: “system-level governance” required for autonomous AI.

https://medium.com/@giovannicoletta/agentic-ai-governance-frameworks-2026-risks-oversight-and-emerging-standards-c15bf2e6efca

[18, 44, 46, 49] Forbes — 40% Of Workflows Will Run On Agentic AI (Feb 2026)

Gartner projection: 40% of enterprise workflows will involve autonomous agents by 2026. Four-dimension AI agent trust stack (identity, operational boundaries, runtime behaviour, governance). Agent identity crisis: no standardised way to prove who agents are or what they’re authorised to do.

https://www.forbes.com/sites/digital-assets/2026/02/13/40-of-workflows-will-run-on-agentic-ai-wheres-the-identity/

[47] Lasso Security — Enterprise AI Security Predictions 2026

Delegated authority shift. Security controls must move from static permissions to purpose-bound agent deployments.

https://www.lasso.security/blog/enterprise-ai-security-predictions-2026

[41, 12] Kiteworks — AI Regulation 2026: The Complete Survival Guide

Enforcement actions increasing. AI Security Riders for cyber insurance. EU AI Act penalties.

https://www.kiteworks.com/cybersecurity-risk-management/ai-regulation-2026-business-compliance-guide/

Zero Trust & Architecture

[19] IBM — The Evolution of Zero Trust and the Frameworks that Guide It

History from Kindervag (2010) through BeyondCorp (2014) to NIST SP 800-207.

https://www.ibm.com/think/insights/the-evolution-of-zero-trust-and-the-frameworks-that-guide-it

[20] NIST — Zero Trust Cybersecurity: Never Trust, Always Verify

NIST codification of zero trust principles.

https://www.nist.gov/blogs/taking-measure/zero-trust-cybersecurity-never-trust-always-verify

[59] Patrick McFadden / Thinking OS — Decision Intelligence Has 3 Layers (Feb 2026)

Independent derivation of Propose-Commit-Remember architecture. “Most organisations pour effort into #1 and #3. Almost nobody clearly owns #2.” The Commit layer — the pre-execution authority gate — is almost universally missing from enterprise AI stacks.

https://www.thinkingoperatingsystem.com/decision-intelligence-has-3-layers-most-stacks-only-govern-two

[32] Michael Hannecke — Agentic Tool Sovereignty (2025)

NIST AI RMF and ISO 42001 not designed for autonomous runtime decisions.

https://medium.com/@michael.hannecke/agentic-tool-sovereignty-the-eu-ai-act-problem-nobody-saw-coming-869325cd8352

[33] Cloud Security Alliance — AAGATE: NIST AI RMF-aligned Governance (Nov 2025)

Traditional AppSec not designed for self-directed reasoning systems.

https://cloudsecurityalliance.org/blog/2025/12/22/aagate-a-nist-ai-rmf-aligned-governance-platform-for-agentic-ai

[26] Ethyca — Governing Enterprise Data & AI with Policy-as-Code (Sep 2025)

Sub-millisecond governance enforcement through policy-as-code frameworks.

https://www.ethyca.com/news/how-to-govern-data-and-ai-with-a-policy-as-code-approach

Redwood — AI And Automation Trends 2026

Governance must move from policy decks to operating model.

https://www.redwood.com/article/ai-automation-trends/

Modern Data — 5 Best Practices for AI Governance in 2025

Policy-as-code integration into AI infrastructure and MLOps workflows.

https://www.moderndata101.com/blogs/5-best-practices-for-ai-governance-in-2025

[55] AWS AgentCore — Deterministic Policy Enforcement

Deterministic policy enforcement at the gateway for Bedrock agents. The AI literally cannot exceed its configured scope — architectural constraint rather than policy aspiration.

https://docs.aws.amazon.com/bedrock/latest/userguide/agentcore-policy.html

[60] OWASP — LLM01:2025 Prompt Injection (Top 10 for LLM Applications)

Prompt injection ranked as the #1 LLM security risk. Covers direct and indirect attacks that bypass prompt-level controls, including context poisoning and “ignore previous instructions” exploits. Foundational evidence for why Admissible Knowledge must be an environmental control, not a prompt-level instruction.

https://genai.owasp.org/llmrisk/llm01-prompt-injection/

LeverageAI / Scott Farrell

Practitioner frameworks and interpretive analysis developed through enterprise AI transformation consulting. These frameworks are integrated as author voice throughout the ebook and listed here for transparency.

Architecture, Not Vibes: AI Doesn’t Fear Death

Trust hierarchy (Vibes → Monitoring → Architecture). Five design principles for AI containment. “Can’t beats shouldn’t.”

https://leverageai.com.au/wp-content/media/AI_Doesnt_Fear_Death_You_Need_Architecture_Not_Vibes_For_Trust_ebook.html

Agent Provenance Stack: OpenClaw Has a Provenance Problem

Four-layer provenance model (Identity, Intent, Artefact, Execution). Containment vs provenance axis. Confused deputy problem at scale.

https://leverageai.com.au/openclaw-has-a-provenance-problem-and-so-does-every-agent-platform/

SiloOS: The Agent Operating System for AI You Can’t Trust

Policy vs Architecture distinction. Zero trust for AI agents. Scoped keys, tokenisation, stateless execution.

https://leverageai.com.au/wp-content/media/SiloOS.html

The Simplicity Inversion

Governance arbitrage concept. Design-time AI through existing SDLC vs runtime AI requiring novel infrastructure.

https://leverageai.com.au/wp-content/media/The_Simplicity_Inversion_ebook.html

Look Mum, No Hands: Human-AI Interfaces

Evidence Layer concept. Opacity hiding problems. Decision navigation UI.

https://leverageai.com.au/wp-content/media/Look_Mum_No_Hands.html

The Intelligent RFP: Proposals That Show Their Work

Agent Loop receipts pattern. Govern phase rolling up receipts into audit trail.

https://leverageai.com.au/wp-content/media/The_Intelligent_RFP_ebook.html

Note on Research Methodology

Research for this ebook was conducted between January and February 2026, focusing on regulatory texts (EU AI Act, NIST AI RMF, ISO 42001, SR 11-7), industry surveys and governance statistics (2024–2026), agentic AI governance frameworks (Singapore IMDA, UC Berkeley), and the zero trust security evolution as architectural precedent.

Statistics are drawn from named, published sources with clear provenance. Where multiple studies address the same metric with different figures, the most recent and methodologically transparent source was preferred. All URLs were verified at time of compilation.

The author’s practitioner frameworks (Architecture Not Vibes, Agent Provenance Stack, SiloOS, The Simplicity Inversion) are integrated as interpretive analysis and listed in the LeverageAI section above for transparency. These are not cited inline as external sources but are available for readers who wish to explore the underlying thinking.

What You’ll Learn

The Governance Gap

What Compliance Cosplay Looks Like

Three Questions Governance Should Answer at Decision Time

The Forensic Artefact Problem

Explaining Decisions

Constraining Decisions

The Explanation Trap

The Scale of the Gap: By the Numbers

The Enforcement Gap in Numbers

The Regulatory Hint: EU AI Act Article 12

What Happens Without Runtime Governance

When the Regulator Calls

Key Takeaways

Governance Has Gravity

Highway A

Highway B

The Governance Gravity Principle

Where Execution Lives, Governance Must Follow

Traditional IT: Deterministic Execution

Runtime AI: Probabilistic Execution

The Governance Arbitrage

Why Post-Hoc Explainability Fails Structurally

The Agentic Acceleration

The AI Paradox: Better Tools, Same Governance Trap

Why Business Pressure Makes This Urgent

The Cost of the Governance Gap

Key Takeaways

Zero Trust for Decisions

The Mapping: Perimeter to Point-of-Access

Network Security Evolution

AI Governance Evolution (Same Shift, Applied to Decisions)

The Translation

The Trust Hierarchy

Level 1: Vibes — Fragile

Level 2: Monitoring — Reactive

Level 3: Architecture — Structural

Zero Trust for Decisions: What It Actually Means

SR 11-7: Financial Services Already Governs Probabilistic Engines

EU AI Act Article 14: The Regulatory Direction

The Convergence: Independent Practitioners, Same Architecture

Thinking OS (Feb 2026)

AWS AgentCore

Policy-as-Code (OPA, Sentinel)

Forbes AI Agent Trust Stack (Feb 2026)

Key Takeaways

The Four Pillars

Decision Authority Infrastructure

Admissible Knowledge

Fixed Authority

Gated Execution

Provable Evidence

Pillar 1: Admissible Knowledge

Pillar 2: Fixed Authority

Pillar 3: Gated Execution

Gate Outcomes

ALLOW

PAUSE

DENY

The Irreversibility Gradient

Reversible Actions

Irreversible Actions

High-Stakes Irreversible Actions

Pillar 4: Provable Evidence

Receipts, Not Logs

Current: Forensic Investigation

DAI: Structural Evidence

The IPA Pattern: How the Pillars Work Together

The IPA Is a Firewall for Decisions

Mapping to the Agent Provenance Stack

Key Takeaways

From Explanation to Constraint

The Explanation Paradigm vs The Constraint Paradigm

Two Governance Architectures

What Changes Operationally: Three Contradictions Resolved

Contradiction 1: Governance Rigour ↑ vs Deployment Speed ↑

Under the Explanation Paradigm

Under the Constraint Paradigm

Contradiction 2: AI Autonomy ↑ vs Human Accountability ↑