After Orthogonality: Virtue-Ethical Agency and AI Alignment

Preface This essay argues that rational people don’t have goals, and that rational AIs shouldn’t have goals. Human actions are rational not because we direct them at some final ‘goals,’ but because we align actions to practices[1]: networks of actions, action-dispositions, action-evaluation criteria, and action-resources that structure,

giovedì 19 febbraio 2026 New tab

Preface

This essay argues that rational people don’t have goals, and that rational AIs shouldn’t have goals. Human actions are rational not because we direct them at some final ‘goals,’ but because we align actions to practices[1]: networks of actions, action-dispositions, action-evaluation criteria, and action-resources that structure, clarify, develop, and promote themselves. If we want AIs that can genuinely support, collaborate with, or even comply with human agency, AI agents’ deliberations must share a “type signature” with the practices-based logic we use to reflect and act.

I argue that these issues matter not just for aligning AI to grand ethical ideals like human flourishing, but also for aligning AI to core safety-properties like transparency, helpfulness, harmlessness, or corrigibility. Concepts like ’harmlessness’ or ‘corrigibility’ are unnatural -- brittle, unstable, arbitrary -- for agents who’d interpret them in terms of goals or rules, but natural for agents who’d interpret them as dynamics in networks of actions, action-dispositions, action-evaluation criteria, and action-resources.

While the issues this essay tackles tend to sprawl, one theme that reappears over and over is the relevance of the formula ‘promote x x-ingly.’ I argue that this formula captures something important about both meaningful human life-activity (art is the artistic promotion of art, romance is the romantic promotion of romance) and real human morality (to care about kindness is to promote kindness kindly, to care about honesty is to promote honesty honestly).

Preface

After Orthogonality: Virtue-Ethical Agency and AI Alignment

After Orthogonality: Virtue-Ethical Agency and AI Alignment

Related reading

Anthropic suggests slowing AI research until we can align it with human goals

Dario Amodei's new essay reads like a Cold War playbook for the AI age

Jamie Metzl: AI's ethical challenges in rule-making, its potential to extract…

World must carefully consider AI’s purpose

AI Alignment is a Systems Architecture Problem, Not a Prompt Problem

Anthropic unveils ‘auditing agents’ to test for AI misalignment

Related reading

Anthropic suggests slowing AI research until we can align it with human goals

Dario Amodei's new essay reads like a Cold War playbook for the AI age

Jamie Metzl: AI's ethical challenges in rule-making, its potential to extract…

World must carefully consider AI’s purpose

AI Alignment is a Systems Architecture Problem, Not a Prompt Problem

Anthropic unveils ‘auditing agents’ to test for AI misalignment