What if your AI coding agent could write code, test it, fix bugs, and ship — all without a single human in the loop?

That's not a thought experiment. I just did it — and the whole thing cost one cent in test runs.

Here's the short version: I wired Claude Code (as the code agent) together with Waterwheel (as a browser test agent) so that one implements features and the other verifies them autonomously. The tests are plain Markdown files. When a test fails, the code agent reads the failure, fixes the bug, and re-runs — no human in between. I shipped a complete user-authentication feature this way without touching the keyboard during the build-test-fix loop.

The rest of this post breaks down how it works and how you can run the whole thing yourself.

The Problem with Vibe Coding