The End of Manual QA: How I Built a Self-Testing App with Claude Code and Waterwheel Agent

What if your AI coding agent could write code, test it, fix bugs, and ship — all without a single human in the loop?

That's not a thought experiment. I just did it — and the whole thing cost one cent in test runs.

Here's the short version: I wired Claude Code (as the code agent) together with Waterwheel (as a browser test agent) so that one implements features and the other verifies them autonomously. The tests are plain Markdown files. When a test fails, the code agent reads the failure, fixes the bug, and re-runs — no human in between. I shipped a complete user-authentication feature this way without touching the keyboard during the build-test-fix loop.

The rest of this post breaks down how it works and how you can run the whole thing yourself.

The Problem with Vibe Coding

What if your AI coding agent could write code, test it, fix bugs, and ship — all without a single human in the loop?

That's not a thought experiment. I just did it — and the whole thing cost one cent in test runs.

The rest of this post breaks down how it works and how you can run the whole thing yourself.

The Problem with Vibe Coding

The End of Manual QA: How I Built a Self-Testing App with Claude Code and Waterwheel Agent

Other newsrooms on this story

The End of Manual QA: How I Built a Self-Testing App with Claude Code and Waterwheel Agent

Other newsrooms on this story

Related reading

AI doesn't write bad code. It writes plausible code — so I tried to break my…

I built a GitHub App that auto-generates adversarial tests for AI-written code…

How I Used Claude to Finish Building an AI That Evaluates AI — and Caught It…

I built an AI agent that runs manual test cases in a real browser

I built an AI QA agent in one week that tests your app like a real user

I built a multi-agent AI workflow with Claude Code + Java/Spring Boot…

Related reading

AI doesn't write bad code. It writes plausible code — so I tried to break my…

I built a GitHub App that auto-generates adversarial tests for AI-written code…

How I Used Claude to Finish Building an AI That Evaluates AI — and Caught It…

I built an AI agent that runs manual test cases in a real browser

I built an AI QA agent in one week that tests your app like a real user

I built a multi-agent AI workflow with Claude Code + Java/Spring Boot…