TL;DRAI

Creato Tribunal, skill Claude per code review: agenti specializzati (hater, cross-module, judge) collidono per verità oneste invece di equilibrio interno. Per CTO: pattern strategico — estrarre segnali da collisione tra ruoli, non da balance del modello. Replicabile per audit, quality gate, security review.

Every time I asked Claude to review my branch, I got one of two answers: a cheerful "Looks good! 👍" or a vague list where I couldn't tell a real bug from a matter of taste. The model wants to please you. That's exactly the problem.

So I built Tribunal — a Claude skill that reviews your diff adversarially, in stages, where the honest signal comes from agents fighting each other instead of one polite model.

The idea: don't ask one model to be fair

A single model told to "be critical" still hedges — it's trained to be agreeable. So instead of one balanced reviewer, Tribunal runs one-sided roles that collide:

One agent per file, deliberately biased. It tears the diff apart as if a clueless amateur wrote it — focused only on what changed. But strictly on the merits: correctness, races, leaks, edge cases, security. No style nitpicks.

dev.to

I stopped trusting Claude's code reviews, so I built a skill that puts my code on trial

Every time I asked Claude to review my branch, I got one of two answers: a cheerful "Looks good! 👍"...

sabato 13 giugno 2026 New tab

TL;DRAI

464 words~2 min read

So I built Tribunal — a Claude skill that reviews your diff adversarially, in stages, where the honest signal comes from agents fighting each other instead of one polite model.

The idea: don't ask one model to be fair

A single model told to "be critical" still hedges — it's trained to be agreeable. So instead of one balanced reviewer, Tribunal runs one-sided roles that collide:

I stopped trusting Claude's code reviews, so I built a skill that puts my code on trial

I stopped trusting Claude's code reviews, so I built a skill that puts my code on trial

Related reading

I built a local MCP server that gives Claude Code real PR context — 33s reviews…

Stop Using Claude Like a Rubber Duck: Real Code Review Strategies

I Trained Claude Code to Write Its Own Skills — Then Watched It Spiral Out of…

Two tiny Claude Code skills that fixed my two biggest agent problems

50 Reusable Claude Code Skills That'll Save You Hours Every Week

AdamsReview: Multi-Agent PR Reviews for Claude Code, Reviewed

Related reading

I built a local MCP server that gives Claude Code real PR context — 33s reviews…

Stop Using Claude Like a Rubber Duck: Real Code Review Strategies

I Trained Claude Code to Write Its Own Skills — Then Watched It Spiral Out of…

Two tiny Claude Code skills that fixed my two biggest agent problems

50 Reusable Claude Code Skills That'll Save You Hours Every Week

AdamsReview: Multi-Agent PR Reviews for Claude Code, Reviewed