Alignment

Work on AI alignment, value learning, and ensuring AI systems behave as intended.

Posts

The Mechanics of Agency: Maximal Theoretical Agent - Explores Maximum Theoretical Agent (MTA)—opposite extreme from MVA.
True Neutral - Uses D&D alignment grid as metaphor for philosophical positions.
Comparing Value Systems - Proposes using cosine similarity from vector mathematics to quantify alignment between value systems
Ghosts in the Machine - A critical response to Mustafa Suleyman’s warning about “Seemingly Conscious AI” (SCAI), this essay
Structural Alignment - This post marks a pivotal shift in Axio’s alignment framework, arguing that traditional goal-based a
The Axionic Agency Sequence - This meta-post serves as a comprehensive roadmap and overview of the entire Axionic Agency research
Axions as a Type of Agency - Introduces “Axion” as precise noun naming constitutive structural configuration of reflective agency
Axionic Agency — Interlude III - Documents project’s pivot from “Axionic Alignment” to “Axionic Agency.” Records shift in understandi
Alignment Under Uncertainty - Addresses alignment when outcomes, values, and agent capabilities are uncertain.
Alignment Beyond Epistemics - Argues alignment discourse over-focuses on epistemic problems (what AI knows/believes/understands) w
The Load-Bearing Parts of Agency - This post explains results from Axionic Agency VIII.6, which used ablation methodology to identify n
You Can’t Align a Hurricane - This accessible essay uses the hurricane-vs-nation analogy to explain why AI agency matters for safe