Technical Reference — AgenticRail / NIST AI RMF

NIST AI RMF Mapping: Pre-Execution Enforcement for Agentic AI

Mapping AgenticRail's gate architecture to NIST AI Risk Management Framework 1.0 subcategories. Govern 1.2, Measure 2.5, Manage 4.2 — and the structural gap in the framework that agentic AI exposes.

Published 2026-05-23 · agenticrail.nz/spec/nist-ai-rmf/

1. Framework Overview

The NIST AI Risk Management Framework 1.0 (January 2023) organises AI risk management across four functions: Govern, Map, Measure, and Manage. It is voluntary — no regulatory penalties attach to non-conformance — but it has become the de facto US enterprise baseline for AI risk programmes, referenced in federal procurement, sector guidance (financial services, healthcare, critical infrastructure), and international cross-referencing with ISO 42001 and the EU AI Act.

In April 2026, NIST published a concept note for an AI RMF Critical Infrastructure Profile — sector-specific prescriptive guidance for energy, finance, healthcare, and transport. The direction of travel is toward greater specificity, not less. Organisations building to the current framework should anticipate more prescriptive requirements in those sectors.

2. The Agentic AI Gap in NIST AI RMF

NIST AI RMF 1.0 was designed for AI systems whose behaviour is characterisable at deployment time — systems that can be tested, evaluated, and risk-assessed before they go live, and whose behaviour in production can be monitored through aggregate metrics.

The structural gap

Agentic AI systems violate the assumptions the framework was built on. They execute multi-step workflows autonomously, produce different action sequences on every run, and interact with external systems in ways that cannot be fully characterised before deployment. The risk is not static — it is generated at runtime, step by step, action by action. A risk management framework that relies on pre-deployment characterisation and post-hoc aggregate monitoring cannot govern systems whose risk materialises at execution time in discrete, atomic steps.

Three NIST AI RMF subcategories expose this gap most directly. All three require operational evidence — proof that controls ran during deployment, not documentation that they were planned.

3. Subcategory Mapping

Subcategory Requirement Agentic AI gap AgenticRail mechanism
Govern 1.2 Designated personnel with authority to intervene in AI system operation If oversight depends on the model's self-reported output, human authority is nominal — not structural. The model is the only entity evaluating whether a step should proceed. Gate operates at infrastructure level. API keys held by designated personnel. No step executes without gate ALLOW. Model cannot bypass, self-report around, or replay an issued decision.
Measure 2.5 AI system performance monitored at risk-appropriate levels Aggregate session metrics and post-hoc dashboards detect anomalies after sequences have completed and decisions have been made. For consequential agentic decisions, monitoring must be contemporaneous with execution. Every gate decision is a monitoring event. ALLOW, DENY, and HALT recorded per action before execution. HMAC-signed receipt written to immutable storage. Dashboard surfaces statistics in real time. Retrospective analysis at full per-action fidelity.
Manage 4.2 Risk management processes integrated into organisational operations Risk controls implemented as a periodic review layer parallel to the AI system do not satisfy integration. The control must be in the critical path of execution. Gate is a structural dependency. No step N+1 runs without gate ALLOW on step N. Risk management is not adjacent to the operational process — it is the operational process for every step of every sequence.

4. Mechanisms in Detail

Govern 1.2
Structural human authority — gate as the intervention point

The gate sits between the agent's reasoning layer and action execution. An agent cannot advance from one step to the next without clearing the gate. The gate's API keys are held by designated personnel — not the model, not the application layer. Those personnel can revoke keys, modify the declared step-order policy, or issue a HALT that retires the sequence without the model's cooperation or awareness.

HALT is terminal. A HALT-ed sequence is sealed — no further steps can be evaluated, regardless of what the model attempts. The authority to halt is unconditional and does not require the model to stop itself.

Govern 1.2 evidence available: Key issuance log (who holds gate access, when granted/revoked). HALT receipt records (when human authority was exercised, which sequence, which step triggered it). Sequence termination events in KV statistics.
Measure 2.5
Per-action monitoring — contemporaneous with execution

For every step evaluation, the gate assesses: sequence position against the declared step order; function/step identity match; action type permissibility against the policy map; nonce uniqueness (replay protection); timestamp freshness (300-second window). Each assessment produces a decision — ALLOW, DENY, or HALT — and an HMAC-SHA256 signed receipt written to immutable R2 storage before the gate returns the decision to the caller.

The receipt is written before the action executes. This is not a description of what ran — it is a record of what was authorised at the moment of authorisation. For risk-appropriate monitoring of consequential agentic decisions, this is the minimum granularity: per action, per sequence, before execution.

Measure 2.5 evidence available: Per-step receipt chain for any sequence. ALLOW/DENY statistics at sequence and step level. DENY reason codes (SEQUENCE_VIOLATION, REPLAY_NONCE, ACTION_NOT_ALLOWED, STALE_TIMESTAMP). Real-time dashboard with step distribution and refusal log. Retrospective compliance report via report.agenticrail.nz.
Manage 4.2
Critical path integration — gate as execution dependency

The gate is not a monitoring layer layered on top of the agent. It is a structural dependency of the agent's execution path. An agent that calls a tool without first receiving a gate ALLOW for that step will not receive an ALLOW retroactively — the receipt does not exist, the step is not in the chain, and the sequence is invalid.

This means risk management is not adjacent to operations — it is integrated into operations at the level of individual step execution. The integration is not procedural (a policy that requires developers to call the gate) — it is architectural (the agent framework requires gate ALLOWs to proceed, enforced by the SDK and wrapper layer).

Manage 4.2 evidence available: The receipt chain itself. Every step that ran appears in the chain. Steps that did not clear the gate do not appear as ALLOWs — they appear as DENYs or do not appear at all. The chain proves integration — not by assertion, but by structure.

5. Evidence Package for NIST AI RMF Documentation

The following evidence is available from AgenticRail for inclusion in a NIST AI RMF risk programme documentation package:

Available evidence
  • Receipt schema specification — agenticrail.nz/spec/receipt-schema.json — defines all fields, types, and HMAC computation inputs
  • Compliance report — report.agenticrail.nz — full receipt chain for any sequence, chain integrity verification, per-step enforcement log, HMAC verification status
  • Live receipt verification endpoint — POST /verify with sequence_id and receipt pack_id returns cryptographic verification result
  • Dashboard statistics — real-time ALLOW/DENY ratios, step distribution, HALT events, refusal log available at dashboard.agenticrail.nz
  • DENY receipt examples — SEQUENCE_VIOLATION, REPLAY_NONCE, ACTION_NOT_ALLOWED samples demonstrating that blocked actions are recorded
  • Sequence sealing evidence — SEALED_SEQUENCE denial records proving that completed sequences cannot be replayed or extended

6. Cross-Framework Alignment

The same receipt chain produced by AgenticRail maps to three frameworks simultaneously. The underlying requirement across all three is identical: proof that oversight controls ran during deployment, not documentation that they were planned.

Framework Requirement Receipt chain satisfies
NIST AI RMF Govern 1.2, Measure 2.5, Manage 4.2 Structural intervention authority, per-action monitoring evidence, critical path integration proof
EU AI Act Article 12 — automatic logging for reconstruction of sequence of events Pre-execution receipts enable full sequence reconstruction without re-running the system
ISO/IEC 42001 A.6.2.8 — event log recording; A.6.1.6 — operational logging and reconstruction HMAC-signed, chain-linked, schema-published receipts satisfy both controls — pre-execution timing, defined format, cryptographic integrity, chain linkage
References and further reading