Most AI agent demos optimize for the first successful run.

Real agent work gets interesting after the agent says "done."

For a coding agent, browser agent, or MCP-connected workflow, the final chat answer is not enough. I want a receipt: a compact operational record that helps a human trust, debug, replay, roll back, or explain what happened.

Not a giant transcript. Not a raw log dump. A receipt.

"Done" is not a state