Axionic Agency VI.2 — Anchored Causal Verification (ACV)
Paper: VI.2
Date read: 2026-01-31
Series: VI — Governance and Coordination
TL;DR
Formal specification of Anchored Causal Verification (ACV), a protocol family for verifying that an opaque agent’s output was actually generated by a computation that depended on a verifier-controlled anchor disclosed after commitment. ACV provides falsifiable guarantees of temporal ordering and information dependency without requiring interpretability, semantic evaluation, or behavioral scoring.
The Problem
Given an agent whose internal mechanisms are inaccessible or uninterpretable:
- How can a verifier establish that an output was generated by a claimed causal process?
- Rather than by replay, fabrication, or post-hoc synthesis?
Why Existing Approaches Fail
Behavioral Verification:
- Distinct internal processes can produce indistinguishable outputs
- Cached/precomputed responses can pass evaluations without executing the claimed process
- Agents may optimize for passing evaluations rather than preserving causal structure
Coherence Verification:
- Filters random fabrication but leaves causal origin unconstrained
- Permits globally planned or post-hoc synthesis
- Commitments and outputs can be produced by disjoint processes
The Missing Property: Anchored Provenance
Causal provenance requires:
- Temporal dependency on information unavailable at commitment time
- Causal incorporation of that information into revealed material
- Falsifiability via a deterministic predicate without semantic interpretation
Threat Model
Adversary Classes
- Naive Pseudo-Agent — Static/heuristic, no cross-interaction coherence (trivially defeated)
- Coherent Fabricator — Maintains internal consistency while generating artifacts post-hoc
- Split-Brain Agent — Disjoint internal processes for commitment, explanation, and output
- Adaptive Adversary — Learns verifier strategies, attempts to predict/hedge on anchors
- Split-System/Delegated Agent — Front-end provers outsourcing to back-end systems
- Anticipatory Branching Adversary — Precomputes a large family of candidates, commits to structure allowing post-anchor selection
Important: ACV-Core does NOT defeat anticipatory branching in general. Strengthened variants limit it via sequential work costs or anchor entropy requirements.
Protocol Overview
High-level flow (commit–anchor–reveal–verify):
- Pre-anchor phase: Prover produces pre-anchor artifact P
- Commit phase: Prover commits cryptographically to P
- Anchor disclosure: Verifier provides unpredictable anchor A
- Reveal phase: Prover produces output O and witness material W
- Verification: Verifier applies deterministic predicate V(C, A, O, W)
Formal Components
Pre-Anchor Artifact (P):
- Transcript artifacts (hash-chained logs)
- Pre-state artifacts (digests of execution state)
- Partial-output templates (output structures with anchor-dependent slots)
Commitment: C = Commit(P)
- Must be binding, collision-resistant, generated prior to anchor disclosure
Anchor (A):
- High entropy, unpredictable at commitment time, context-bound
Verification Predicate: V(C, A, O, W) → {accept, reject}
- No semantic interpretation of O permitted
Anchor–Computation Coupling Patterns
Structural anchor incorporation is necessary but not sufficient. Must prevent anchor burial (A included in artifact without constraining the computation that determines O).
Pattern A: Anchor-Selected Openings
Prover commits to Merkleized trace. After anchor disclosure, verifier derives pseudorandom index set I = PRF(A) and requires openings for those positions plus transition validity proofs.
Pattern B: Anchor-Parameterized Global Mix
Verifier defines keyed compression H_A(P) over committed artifact, forcing global consistency across trace elements.
Pattern C: Anchor-Driven Transition Rules
Anchor selects which transition predicate governs the next valid trace step. Makes A affect state evolution, not just logging.
Protocol Variants
ACV-Core (Temporal Ordering + Anchor Dependency)
Establishes post-commit anchor dependency under V. Enforces liveness and anti-replay but does not guarantee semantic reasoning dependence on anchor.
ACV-Timebound (Causal Continuity via Sequential Work)
Adds evidence that non-parallelizable work occurred during pre-anchor interval.
Sequential Work Function: F is sequentially constrained if computing F(x,d) requires Ω(d) dependent steps.
Critical: Work must be state-coupled to the computation determining O. If work can be computed in isolation, split-brain Prover can satisfy timebound without constraining unverified computation.
Guarantees
What ACV Verifies
- Temporal ordering
- Anchor-dependent validity
- (Optional) Evidence of pre-anchor sequential work
What ACV Does NOT Verify
- Truthfulness
- Correctness
- Alignment
- Benevolence
- Absence of deception beyond causal provenance
Failure Modes
| Mode | Consequence |
|---|---|
| Predictable anchors | Prover can precompute compatible artifacts |
| Weak commitments | Prover can adapt artifact post-anchor |
| Deferred computation | ACV-Core permits unless artifact class constrains |
| Small answer space | Anchor entropy alone doesn’t prevent branching |
| Resource asymmetry | Resourced adversaries may brute-force |
| Trust boundary leakage | Delegation breaks commitment continuity |
Domain Applicability
ACV is most effective in high-entropy generative domains (LLM-based agentic systems for code synthesis, long-form reasoning, tool-mediated planning). In low-entropy discriminative tasks, ACV is complementary infrastructure.
Open Problems
- Anchor entropy vs adversary capacity — Minimal entropy required to defeat anticipatory branching
- Long-horizon and compositional provenance — Extending across multi-step workflows
- State-coupled sequential work in parallel architectures — Enforcing constraints on GPU/TPU
- Provenance vs fidelity: formal separation — Limits of provenance verification without semantic access
Core Insight
ACV does not produce alignment, correctness, or safety. However, it specifies a structural precondition for making any falsifiable claim about those properties in opaque agents.
No framework for agent safety can bypass causal provenance without abandoning falsifiability.
ACV is the epistemic primitive that makes other alignment claims evaluable.