TL;DRAI

Claude and Gemini test showed model lineage is secondary; repo context and prompt quality catch critical issues diff-only setups miss. For engineering teams: harness configuration and prompt quality outweigh model selection in code review automation ROI.

What I actually found when I set out to test heterogeneous AI code review.

For the last couple of months, I've been running a two-agent code review workflow in my terminal. Left window: Claude Code doing implementation. Right window: a second Claude Code instance prompted to be adversarial, specifically tasked with finding problems in whatever the left window produced. It worked surprisingly well.

A few weeks ago, someone sent me a Lenny's Podcast episode featuring Dan Shipper. One of the things he talked about was using competing frontier models for coding and reviewing, the idea being that different model lineages have different blind spots, and a reviewer trained differently than the author catches things the author misses. That felt like an important gap in the workflow. I set out to fill it.

I went in expecting to write a post about model diversity as a reliability strategy. That's not what happened.

The plan that didn't survive contact with the code

dev.to

I Can't Tell If the Model Matters

What I actually found when I set out to test heterogeneous AI code review. For the last couple of...

venerdì 19 giugno 2026 New tab

TL;DRAI

1,178 words~5 min read

What I actually found when I set out to test heterogeneous AI code review.

I went in expecting to write a post about model diversity as a reliability strategy. That's not what happened.

The plan that didn't survive contact with the code

I Can't Tell If the Model Matters

I Can't Tell If the Model Matters

Other newsrooms on this story

Related reading

AdamsReview: Multi-Agent PR Reviews for Claude Code, Reviewed

I built a multi-agent AI workflow with Claude Code + Java/Spring Boot…

How I Used Claude to Finish Building an AI That Evaluates AI — and Caught It…

LocalityLens: Why Your AI Coding Agent Gets Lost in Your Codebase

Which AI Tool Wins? Wrong Question.

The Real Power of Claude Code Isn’t the Code Generation

Other newsrooms on this story

Related reading

AdamsReview: Multi-Agent PR Reviews for Claude Code, Reviewed

I built a multi-agent AI workflow with Claude Code + Java/Spring Boot…

How I Used Claude to Finish Building an AI That Evaluates AI — and Caught It…

LocalityLens: Why Your AI Coding Agent Gets Lost in Your Codebase

Which AI Tool Wins? Wrong Question.

The Real Power of Claude Code Isn’t the Code Generation