Audit-trail-by-construction: a thesis for spec-driven AI coding

TL;DR. Trail is a multi-agent framework for Claude Code that uses Plane work-items as the audit bus. Requirements get stable IDs that thread all the way down into test-code annotations, so every line of AI-generated code can be traced back to a signed-off intent. Built for regulated work and security-critical systems, not for general velocity-first coding.

Most agentic frameworks for coding are built for velocity. They wire up some agents — a planner, an architect, a coder, a reviewer — and let them collaborate on a feature. What comes out is code, often working code, in less time than a human would need.

That is fine, until you have to defend the code.

A regulator asks: who signed off on the threat model that justifies this auth shortcut? A customer asks: which acceptance criterion does this test actually prove? An incident review asks: when this requirement got added, what was the original intent — was the implementation true to it, or did the agent improvise? In a velocity-first framework, the trail goes cold quickly. The agent did it. A dev approved the PR. The "why" lives in a chat transcript that got compacted twice and was partly summarised.

Audit-trail-by-construction: a thesis for spec-driven AI coding

Related reading

Why Your AI Agent Needs an Audit Trail (And How to Build One)

Why AgentTrail Exists: Building Open-Source Audit Trails for AI Agents

Audit Trails Make Systems Easier to Trust

Executable Contracts as Guardrails for AI-Generated Code

Your AI agent's audit trail is not evidence. Here's what makes it one.

Tracepath — I Built an AI Agent Audit Middleware in One Weekend