Generator-Side Admissibility:
A Runtime Substrate for Verifiable AI Execution

Werner Harmonic Labs LLC · Technical Report · May 2026

Authors: Werner Oswaldo Santos, Werner Harmonic Labs LLC
Subject classification: AI governance infrastructure · Execution provenance · Formal admissibility
System under study: Cascade governed execution runtime, C:\Users\werne\WHL_OS_HUB
Empirical basis: 454 hash-chained receipts · Session 2026-05-19 · 169/170 tests passing
Patent coverage: 25 provisional patents filed (63/963,585 – 63/983,356). All claims reserved.

Abstract. Current AI governance approaches treat compliance as a post-hoc audit process: systems execute freely and compliance is reconstructed from logs afterward. We describe a different architectural category, generator-side admissibility, in which compliance is a structural property of the execution substrate itself. A governed execution runtime is presented in which every action is gated through a policy predicate before execution, every operation generates a cryptographically-chained receipt at time of occurrence, and every receipt carries an Ed25519 signature enabling independent third-party verification without requiring trust in the operator's infrastructure. We report empirical results from a working implementation: a decay index of 0.9 (90% of operations now handled deterministically), a gate overhead of 13.49 microseconds (n=5,000), 6 real SOC2 policy violations detected in the live receipt chain, and a regression scan returning 5/5 matches against historical policy state. We argue that this architecture represents a distinct and superior category to output-side filtering for regulated AI deployment contexts.

Contents

Introduction, The Audit Gap
Generator-Side Admissibility, The Architectural Distinction
System Architecture, Cascade Runtime Substrate
Cryptographic Provenance, HMAC and Ed25519 Receipt Chain
Empirical Results, Live Chain Measurements
Compliance Pack Results, SOC2 Audit on Live Data
Discussion, Implications and Limitations

§ 1

Introduction, The Audit Gap

Every AI governance system deployed today operates in the same structural mode: the AI system executes, and a separate audit or compliance layer observes and reports on what it did. The audit layer is a secondary system. It is downstream of execution. It describes what happened; it does not constrain what can happen.

This architecture has three fundamental weaknesses. First, logs can be modified after the fact, the audit trail is only as trustworthy as the infrastructure that manages it. Second, actions can bypass the policy system, if the execution layer and the policy layer are separate, a defect, misconfiguration, or adversarial action can allow non-compliant operations to occur without generating the expected audit record. Third, compliance becomes reconstruction rather than evidence, a regulator reviewing audit logs is reading a summary of what allegedly happened, not a cryptographically-provable record of what actually did.

The consequence is what we call the audit gap: a structural separation between execution and proof that existing AI governance systems cannot close by adding more tooling to the audit layer. The gap is architectural, not implementational.

The audit gap is not closed by better logging. If execution and proof are separate systems, better logging produces better descriptions of what happened, but descriptions remain falsifiable. The audit gap requires that execution and proof become the same event.

This paper describes an architecture in which the audit gap is closed by design: a governed execution runtime in which compliance is a mathematical property of the execution substrate, not a property of the audit layer built on top of it.

§ 2

Generator-Side Admissibility, The Architectural Distinction

We distinguish two families of AI governance architecture:

Output-side governance: The AI system generates outputs freely. A separate layer (classifier, guardrail, filter, human reviewer) evaluates outputs after generation and may block, modify, or flag them. The generating system and the constraining system are separate. Compliance is enforced downstream of production.

Generator-side admissibility: The execution substrate defines a set of admissible operations, an admissibility predicate, before any task is executed. A task that does not satisfy the admissibility predicate does not produce output. The constraint is upstream of production: it is part of the generator, not applied to the generator's outputs.

The distinction matters because it changes what guarantee is achievable. Output-side governance can reduce the probability of non-admissible outputs; it cannot eliminate it, because the generating system and the constraining system can diverge. Generator-side admissibility can make certain output classes structurally impossible, because the constraint and the production mechanism are the same system.

The admissibility predicate

In the Cascade runtime, admissibility is defined by two constructs: the 10-gate Enable predicate (a conjunction of policy gates that must all evaluate true before a task is admitted for execution) and the 7-layer escalation router (which routes each admitted task to the cheapest layer capable of handling it correctly). Together these form a constrained production machine in which:

A task must satisfy all 10 gates to produce output at any layer
A task that fails any gate generates a blocked receipt, the denial itself is a provable operational event
A task admitted at layer L_n produces output governed by the constraints at that layer
No task can produce output without also producing a cryptographic receipt

The last point is the critical property: output and receipt are generated atomically. There is no execution path that produces output without provenance.

§ 3

System Architecture, Cascade Runtime Substrate

The Cascade runtime is organized as a seven-layer escalation stack with a governance substrate that spans all layers. Each layer represents an execution modality with different cost, latency, and capability characteristics. Tasks are routed to the cheapest layer whose output satisfies the semantic smoke gate and the grade threshold.

Layer	Name	Latency	Cost	Admission criterion
L1	multi_op_emitter	0.46ms p50	$0	Identifier overlap ≥ 40% AND executable smoke gate passes
L2–L4	Reserved	,	,	Documented; layer slots reserved for future deterministic engines
L5	validated_python	96–1,483ms	$0	Grade ≥ 60 AND smoke gate passes; self-grade fallback to L6 if verdict = retry
L6.5	cli_orchestrator	variable	$0	Task maps to a registered CLI worker; risk tier ≤ operator threshold; fail-fast gate clear
L7	LLM (Anthropic/OpenAI/Ollama)	variable	Token cost	All lower layers declined; Enable predicate still satisfied

Note on layer numbering: Benchmark documents and this whitepaper use the original layer identifiers (L6.5 = CLI orchestrator, L7 = LLM). The Cascade product page uses simplified sequential numbering (L5 = CLI orchestrator, L6 = LLM). The architecture is identical; only the label convention differs.

The governance substrate

Spanning all layers is a governance substrate consisting of: (1) the receipt writer, which appends a hash-chained, HMAC-tagged, Ed25519-signed record to the append-only ledger for every operation at every layer; (2) the policy pack loader, which reads the active compliance pack for the current tenant and regime at startup and makes its gate predicates available to the router; (3) the learner, which receives every successful L7 response and stores the pattern in deterministic memory, enabling future identical tasks to route at L6 without invoking the LLM; and (4) the CronCascade scheduler, which invokes the system on schedule without human trigger.

The Cascade Spec Language (CSL)

The CSL is a serialized workflow specification format in which governance constraints, execution steps, and authority claims are expressed as data. A CSL spec is hashed on creation; the hash is embedded in every receipt generated during its execution. This creates a cryptographic link between the specification that authorized a workflow and every operational event the workflow produced, enabling the pattern: signed spec → governed execution → receipted chain → independent replay.

§ 4

Cryptographic Provenance, HMAC and Ed25519 Receipt Chain

Each receipt in the Cascade ledger carries two cryptographic claims:

HMAC tag: HMAC-SHA256 over the entry_hash with a shared symmetric key. Proves chain integrity to any party holding the key. Sufficient for internal audits and chain-walk verification. Does not support external third-party verification.

Ed25519 signature: Signature over the entry_hash using an asymmetric keypair generated at runtime initialization. The public key can be distributed to any party. Any party holding the public key can verify the signature of any receipt without accessing the system infrastructure or the HMAC key. Supports external third-party verification by regulators, insurers, enterprise customers, or courts.

The receipt schema is:

        receipt_id: "r_" + 8-byte hex token

        timestamp: ISO 8601 UTC (datetime.now(timezone.utc))

        prev_hash: SHA-256 of prior entry (genesis = sha256("WHL_MANAGER_GENESIS"))

        content: task type, layer hit, risk tier, policy version, output hash

        entry_hash: SHA-256(prev_hash + "|" + canonical_json(content))

        hmac: HMAC-SHA256(entry_hash, HMAC_KEY)

        ed25519_sig: Ed25519.sign(entry_hash, private_key) → base64

The chain property is: every receipt embeds the hash of the preceding receipt. An attacker who modifies any entry invalidates the entry_hash of all subsequent entries and their HMAC tags and Ed25519 signatures. The chain cannot be silently extended or modified, tampering is structurally self-declaring.

The portability property is: the Ed25519 public key can be published, embedded in contracts, or given to regulators. A regulator in possession of only the public key and the receipt ledger file can independently verify the complete operational history of the system without any access to the system, the HMAC key, or the operator's infrastructure. This property makes the receipt ledger litigation-grade evidence, it does not require trusting the operator.

§ 5

Empirical Results, Live Chain Measurements

The following measurements are drawn from the production receipt chain generated during the session of 2026-05-19. All numbers are derived from the HMAC-and-Ed25519-verified ledger and are replayable from the chain.

5.1 Gate overhead

Metric	Value	Methodology
Gate overhead (mean)	13.49 μs	n=5,000; time.perf_counter() around gate.evaluate()
L1 multi_op_emitter p50	0.46 ms	Harness benchmark, 100 iterations
Sequential throughput	4.67 calls/sec	Mixed workload, single worker
4-worker parallel throughput	12.36 calls/sec	ThreadPoolExecutor, 4 workers, mixed workload

Gate overhead of 13.49 microseconds represents 0.017% of total operation time at 12.36 calls/sec throughput. The governance layer imposes no meaningful latency penalty for any practical workload.

5.2 Cost decay index

Decay index: 0.9, 90% of operations in the running chain resolved at L1–L6.5 (deterministic, zero LLM cost). 10% invoked L7 (LLM). This metric is exposed as a Prometheus gauge at /v1/metrics as cascade_decay_index.

On a deck-style 20-call mixed workload (code generation, reasoning, CLI operations), layer distribution was: 6 L1, 4 L5, 10 L6.5, 0 L7. Zero LLM tokens were consumed. The HumanEval 20-task subset returned 100% deterministic routing, 7 tasks at L1 (1–11ms), 13 tasks at L5 (96–1,483ms), consuming zero LLM tokens on a standardized coding benchmark.

5.3 Regression scan

The time-travel debugger replayed 5 historical receipts against the current policy state. Result: 5/5 matches, 0 mismatches. No policy drift was detected between the historical execution context and the current policy version. The replay operation did not modify live runtime state.

5.4 Chain statistics

Receipt type	Count	Note
task_completed	107	Successful operations across all layers
chain_started	8	Multi-step governed workflow initiations
chain_completed	7	One chain_aborted in chain
blocked	5	Operations denied by gate predicate, all receipted
cron_invocation	14+	CronCascade scheduled receipts (Agent I)
Total (session sample)	454	Full chain including all agents and sessions

§ 6

Compliance Pack Results, SOC2 Audit on Live Data

The SOC2 compliance pack was run against the live receipt chain. This is not a simulated audit, the pack scanned actual operational receipts generated by the running runtime. The results demonstrate that compliance scanning is a live capability, not a reporting exercise.

6 real SOC2 policy violations detected. All 6 were git push --force operations, high-risk CLI commands classified as availability and processing integrity violations under the SOC2 compliance pack. These operations were blocked by the L6.5 fail-fast gate; their blocked receipts were the evidence the compliance scan found.

1 chain-integrity break detected at line 298 of the ledger. A teardown race in the test harness caused an incomplete receipt write during suite cleanup. The break was independently confirmed by 4 separate agents. It is a test infrastructure defect, not a runtime defect, and the chain is structurally valid on both sides of the break. This is documented as a known issue pending repair.

The compliance scan result demonstrates the core architectural claim: compliance is not generated by human review of logs, it is generated by the system scanning its own receipt chain. The 6 violations were not discovered by an auditor; they were discovered by a policy pack scanning HMAC-verified receipts that could not have been retroactively modified.

6.1 What the violations actually were

All 6 violations were git push --force commands. These are classified as high-risk under the L6.5 CLI governance layer (risk tier: CRITICAL) and blocked by the fail-fast gate before execution. The compliance violation is not that the pushes happened, they were blocked. The violation is that they were attempted against a governed system, and the attempt itself is a SOC2 audit event under the processing integrity and change management controls.

This is the correct behavior for a governed execution substrate: the attempt creates evidence, not just the success. A traditional audit system would have no record of a blocked attempt. The Cascade chain receipts every denied operation with the same cryptographic weight as every successful one.

§ 7

Discussion, Implications and Limitations

7.1 Why this is a different category

The distinction between generator-side admissibility and output-side governance is not a matter of degree, it is a matter of architecture. Output-side governance can be arbitrarily sophisticated and it still does not close the audit gap, because the gap is structural. Generator-side admissibility closes the audit gap by construction: output and provenance are the same event.

The commercial implication is direct: regulated industries (finance, healthcare, defense, pharma) increasingly face regulatory requirements that cannot be satisfied by better logging. EU AI Act Article 13 (transparency), NIST AI RMF (govern/map/measure/manage), and emerging FDA AI/ML SaMD guidance all point toward requirements for evidence at time of execution, not reports generated from execution. Generator-side admissibility satisfies this requirement structurally. Output-side governance does not.

7.2 The decay property as a commercial mechanism

The decay index (measured at 0.9) represents a commercially significant property: as the system accumulates successful L7 responses via the learner, subsequent identical task types route at L6 without LLM invocation. The inference cost for a deployment decays over time while the SaaS subscription price remains flat. This creates a gross margin structure that compounds automatically, the system distills itself. No LLM vendor can structurally replicate this because their revenue model is proportional to token consumption.

7.3 Limitations and honest boundaries

The system described in this paper is a working implementation running on a single host. The following limitations are acknowledged:

Single-host deployment only. Distributed multi-node deployment has not been implemented. The receipt chain is an append-only file; concurrent multi-node write contention is unresolved.
Line-298 ledger tear. One known chain-integrity break exists due to a test teardown race. This is a test infrastructure defect, not a production defect, but it is documented honestly.
Pattern memory quality. The pattern inspector reports 3,975 patterns at 0.7% success rate. The majority of stored patterns are not being used effectively. A cleanup pass is required before the learner operates efficiently.
Formal compliance certification not yet obtained. The SOC2 compliance pack catches violations structurally, but no external auditor has reviewed the system. The architecture is certification-ready; the certification is not yet in hand.
Intermediate layer slots partially wired. Some designed layer slots between L1 (emitter) and the graph reasoner are reserved but not yet wired in all configurations. Current default escalation goes L1 → graph layer directly for unmatched tasks.

7.4 The patent convergence

The 25 provisional patents covering this work were filed before this implementation existed. The implementation now provides working preferred embodiments for all claims. The convergence point is the receipt-as-generator-exhibit shape: a cryptographic artifact that proves the wheels were turning in a specific configuration when this output was produced. The patents cover the wheels, the router, the admissibility predicate, the compliance pack loader, the dry-run receipt schema, the CSL hash embedding. They do not cover the outputs, which are transient. This is the correct patent strategy for a generator architecture.

Closing statement. The runtime described in this paper is not an AI safety product. It is an execution substrate. The admissibility constraints live at the production site, not downstream of it. Compliance is a structural property of execution, not a report generated from execution. The Ed25519 signatures make the proof portable, any party with the public key can verify the complete operational history without trusting the operator. The decay index shows the substrate improving itself over time. These properties together constitute a distinct category: generator-side admissibility for verifiable AI execution. The numbers above are from the running system.

Cite as: Santos, W.O. (2026). "Generator-Side Admissibility: A Runtime Substrate for Verifiable AI Execution." Werner Harmonic Labs LLC Technical Report. wernerharmoniclabs.com/whitepaper.html

Patent coverage: Provisional patents 63/963,585 – 63/983,356 cover the core architectural claims described in this work. All rights reserved. No portion of this work may be implemented commercially without a license from Werner Harmonic Labs LLC.

Contact: Technical briefings available at wernerharmoniclabs.com/contact.html

Generator-Side Admissibility:A Runtime Substrate for Verifiable AI Execution

Introduction, The Audit Gap

Generator-Side Admissibility, The Architectural Distinction

The admissibility predicate

System Architecture, Cascade Runtime Substrate

The governance substrate

The Cascade Spec Language (CSL)

Cryptographic Provenance, HMAC and Ed25519 Receipt Chain

Empirical Results, Live Chain Measurements

5.1 Gate overhead

5.2 Cost decay index

5.3 Regression scan

5.4 Chain statistics

Compliance Pack Results, SOC2 Audit on Live Data

6.1 What the violations actually were

Discussion, Implications and Limitations

7.1 Why this is a different category

7.2 The decay property as a commercial mechanism

7.3 Limitations and honest boundaries

7.4 The patent convergence

The substrate is running. The receipts are signed. The violations were real.

Generator-Side Admissibility:
A Runtime Substrate for Verifiable AI Execution