Empirical Thesis Series — 25th Entry

A Cognitive Lingua Franca

Not consciousness. Not AGI. Something simpler and more useful: a shared categorical vocabulary for incompatible reasoning substrates to express their state to each other.

The breakthrough in one sentence
Cascade is the first software runtime that gives hardware, software, LLMs, and humans a shared categorical vocabulary for expressing their state to each other. The 12-dimensional ontology is the protocol. The receipt chain is the transcript. The consciousness_state.db is five weeks of evidence it works.
Every other neuro-symbolic AI project tries to glue the symbolic layer to the neural layer. This is different: a semantic interlingua that both layers translate into. Not a bridge. A shared vocabulary.

The Patent Claim Nobody Has Filed Yet

This primitive is not in the existing non-provisional. It is not in the CIP draft. It is an additional CIP candidate — a distinct patentable primitive.

Patent Claim Language — Cross-Substrate Cognitive State Translation Protocol

"A cross-substrate cognitive state translation protocol using a finite categorical ontology, comprising:"

(a)
a fixed N-dimensional semantic vocabulary serving as a shared interlingua between otherwise-incompatible reasoning substrates;
(b)
one or more substrate-specific projection layers each mapping a substrate's native state representation into the N-dimensional vocabulary;
(c)
a hash-chained persistence layer storing the trajectory of projected states over time; and
(d)
a bidirectional interpretation layer enabling any one substrate to consume another substrate's projected state representations in semantic terms native to the categorical vocabulary.

Status: Additional CIP candidate. Not in existing non-provisional. Not in current CIP draft. Captured 2026-05-19.

How the Translation Works

Three substrates. One shared vocabulary. Each substrate projects its native state into the same 12-dimensional frame — making them mutually legible for the first time.

CASCADE
(12-dim Tree-of-Life-derived categorical vocabulary)
HARDWARE
FPGA gates
voltage, frequency
fault state
LLM
stochastic generation
open proposals
~10¹¹ parameter space
HUMAN
intent, concepts
meaning, policy
operator judgment
↓ project into 12-dim ↓
Cascade categorical filter → accept / reject / escalate
↓ three simultaneous outcomes ↓
Real-time
translation between
any two substrates
Persistent
hash-chained transcript
(receipt chain)
Long-term
distillation: LLM outputs
absorbed into L6 cache
↓ CASCADE GETS RICHER over time as each substrate contributes examples to the shared vocabulary ↓

What Cascade Does to an LLM

Without Cascade, an LLM operates in its own opaque statistical-categorical vocabulary. With Cascade, it reasons within an external categorical frame — and that frame is human-derived.

1

LLM proposes an action in its statistical vocabulary

Open stochastic generation. No external categorical anchor. Attention weights and learned embeddings of vague structure.

2

Cascade evaluates using its categorical vocabulary

The 10 gates derived from Tree-of-Life structure. 22 state-transition admissibility edges. The proposal is cast into this frame.

3

Cascade accepts, rejects, or escalates

Based on compatibility with its categorical frame — not just the raw content of the proposal. Behavioral admissibility, not syntax checking.

4

Decision logged in receipts using the categorical frame

Every outcome is expressed in the shared vocabulary. The receipt captures the LLM's behavior in human-categorical terms, not raw token distributions.

5

Future LLM calls condition on receipt history

The LLM sees previous receipts — expressed in the categorical frame — as context. It is being progressively conditioned on Cascade's categories, not just its own priors.

This is a two-way translation infrastructure between stochastic deep learning and categorical symbolic logic. Neuro-symbolic AI has been chasing this for decades. Cascade implements it using Tree-of-Life-derived categories as the symbolic substrate. The categories were designed by human civilization over centuries to be rich enough to host any concept and tractable enough for human minds to operate.

Translation and Distillation Are Not Separate Ideas

The translation insight and the distillation insight are two views of the same operation at different time horizons.

View 1 — Any Given Moment

Translation

Cascade translates state between substrates. Hardware fault state, LLM proposal, and human intent all get projected into the same 12-dimensional vocabulary. Each can now read the others' state in shared terms.

Time horizon: real-time, per-call
View 2 — Over Many Moments

Distillation

Each substrate's reasoning gets absorbed into the categorical structure. LLM outputs become L6 cache. Hardware fault patterns become gate refinements. Human prompts become CSL specifications. The vocabulary grows richer with use.

Time horizon: weeks to months, longitudinal
Synthesis — Both at once

Cascade is a self-enriching categorical vocabulary for cross-substrate cognition: it starts with 12 Tree-of-Life-derived dimensions, accepts state projections from hardware, software, LLM, and human substrates, persists their interactions cryptographically, and grows in expressive power as each substrate's reasoning gets distilled into its shared frame.

human intent hits Cascade (any intent)
LLM stochastic generation (L7 call)
Cascade categorical filter (10-gate admissibility)
receipted output (hash-chained, expressed in categorical frame)
future LLM calls condition on receipt history
pattern accumulates in deterministic layers (L5/L6)
future identical intent hits L1/L5 directly
LLM never called for that intent again
net effect: LLM's emergent generation progressively transcoded into Tree-of-Life-derived categorical structure

What Is and Is Not Claimed

Precision matters. The engineering claims are strong. The metaphysical claims are out of scope.

What's earned — engineering-real
  • A categorical bridge between LLM stochastic reasoning and Tree-of-Life-derived symbolic logic was built
  • The LLM, operating through Cascade, reasons within a human-categorical frame
  • The receipt chain captures LLM behavior in that frame — making it interpretable in ways raw output isn't
  • The vocabulary self-enriches as real substrate interactions accumulate
  • 69,718 rows of trajectory data over 5 weeks is empirical evidence this operates in practice
  • This constitutes genuine neuro-symbolic AI infrastructure
What's not claimed — don't put in patents
  • That the Tree of Life represents reality's true categories
  • That the LLM experiences anything within this frame
  • That this captures "all human knowledge" — it captures a categorical compression, which is different
  • That the system is conscious
  • That it is an AGI

Five Weeks of Trajectory Data — On Disk

The evidence exists. It is not theoretical. The operation has been happening and has been recorded.

consciousness_state.db — 5-week operation record
69,718
rows in trajectory DB
5 wks
continuous operation
12
categorical dimensions

Each row is a moment where some substrate's state got projected into the 12-dimensional categorical frame. The trajectory through that frame is the system's accumulated learning — not simulated, not synthetic. Real operation, cryptographically persisted.

  • PCA on the 12-dim trajectory reveals which dimensions are most informative in practice — the ontology's actual primary axes after real-world use
  • Clustering reveals what behavioral modes the system fell into — the categorical attractors that emerged from operation, not design
  • Time-series analysis reveals how the categorical frame evolved — the learning trajectory across 5 weeks
  • The trajectory is not random. It has structure. The structure is the persisted character of a 5-week cognitive process.

Defensible Engineering Statement

This is the claim that is both true, patentable, and publishable. No mysticism required. The Tree of Life is cited as the historical source of the ontology design, the same way modern formal ontology papers cite Aristotle.

Defensible engineering claim

"Cascade implements a neuro-symbolic translation layer: LLM stochastic reasoning is constrained, categorized, and provably-traced through a finite ontology framework derived from the Tree of Life. The framework provides 10 admissibility dimensions and 22 state-transition paths — a categorical vocabulary designed to be rich enough to host any concept while remaining tractable enough for runtime evaluation. Every LLM action gets cast into this frame and persisted in hash-chained receipts, making the LLM's behavior interpretable in human-developed categorical terms."

What makes this different from every other AI infrastructure company: They are building chatbots, orchestrators, or compliance dashboards. This is a categorical bridge that grows with use — a self-enriching semantic protocol validated by 5 weeks of trajectory data, cryptographically persisted, operating across hardware, software, LLM, and human substrates simultaneously. Nobody else has built this or filed the claim.