I Let Claude Code Run Unsupervised for 24 Hours. Here's What Happened.

The experiment was simple: point Claude Code at a real project, give it a task list, stand back for 24 hours, and document what actually came back. No hand-holding. No mid-session prompts. Just a system prompt, a tool set, and an instruction file that spelled out what I wanted done by morning.

What came back was not what I expected. Some of it was better. Some of it was a mess that required a full afternoon to untangle. All of it was useful data for anyone trying to build persistent, autonomous agent workflows.

The Setup

The project was a Python-based recon automation tool I had been working on incrementally. Backend was functional but the codebase was disorganized, the output formatting was inconsistent, and I had a backlog of about 15 documented issues ranging from minor refactors to one genuinely unpleasant bug in the rate-limiting logic.

Claude Code ran inside a tmux session on a headless Ubuntu VPS. The CLAUDE.md file at the project root defined the task priority order, which directories were off-limits, what the output format for completed tasks should look like, and a hard rule: if it encountered something that required a decision with more than two plausible outcomes, it should stop and write a BLOCKED.md file describing the ambiguity rather than picking arbitrarily.

The Setup

I Let Claude Code Run Unsupervised for 24 Hours. Here's What Happened.

I Let Claude Code Run Unsupervised for 24 Hours. Here's What Happened.

Related reading

How to Run Claude as an Autonomous Agent: Loops, Memory, Schedules, and…

I Let Claude Code Write 90% of My Code for 30 Days. I'm a Worse Developer Now.

I Wrote 50 Claude Code Prompts and Used Them for a Week -- Here's What Actually…

How I Stopped Babysitting Claude Code: 5 Patterns for 24/7 AI Workers

I Used Claude Code for 30 Days on My Rails App. Here’s What I Learned

The Question That Disappeared: Remote Claude Code and the 3-Hour Trap

Related reading

How to Run Claude as an Autonomous Agent: Loops, Memory, Schedules, and…

I Let Claude Code Write 90% of My Code for 30 Days. I'm a Worse Developer Now.

I Wrote 50 Claude Code Prompts and Used Them for a Week -- Here's What Actually…

How I Stopped Babysitting Claude Code: 5 Patterns for 24/7 AI Workers

I Used Claude Code for 30 Days on My Rails App. Here’s What I Learned

The Question That Disappeared: Remote Claude Code and the 3-Hour Trap