Sweeping i18n leaks with four parallel AI agents — from 300 candidates down to 60 real bugs

For any app past a certain size that's gone bilingual, the question "how much hardcoded Japanese is still hiding in our repo?" never quite goes away. A naive grep for [ぁ-んァ-ヶ一-龯] returns thousands of hits, and the vast majority are inside translation tables, already-branched code, or comments. The real leaks are buried.

For one cleanup pass we attacked this with four parallel AI investigation agents plus AST-based false-positive filtering. The result: ~300 candidates detected → ~60 real leaks → cleaned up across five rounds. This post walks through the flow and the most interesting bug it uncovered — paying English users had been getting Japanese email from the Stripe webhook for months.

Why a plain grep isn't enough

A repository-wide grep returns thousands of hits, but the contents fall into four bins: translation tables / already branched by lang == 'en' / comments and docstrings / real leaks. The first three are harmless. Only the last shows Japanese to English users. The trouble is that grep can't separate them, and the volume is too high for a human to triage one by one.

Four parallel agents for "wide and shallow" detection

Why a plain grep isn't enough

Four parallel agents for "wide and shallow" detection

Sweeping i18n leaks with four parallel AI agents — from 300 candidates down to 60 real bugs

Sweeping i18n leaks with four parallel AI agents — from 300 candidates down to 60 real bugs

Related reading

What I found when I security-scanned 10 AI-built apps (and how to check yours…

What my leak scanner catches — and the exact line where it stops

What I found scanning 3 AI agent codebases for unguarded tool calls

AI translation: post-editing best practices

Your AI Coding Agent Wastes 80% of Its Context. Fixed That with Graph Theory.

AI Coding Agents Need Runtime Telemetry Before Commit Telemetry

Related reading

What I found when I security-scanned 10 AI-built apps (and how to check yours…

What my leak scanner catches — and the exact line where it stops

What I found scanning 3 AI agent codebases for unguarded tool calls

AI translation: post-editing best practices

Your AI Coding Agent Wastes 80% of Its Context. Fixed That with Graph Theory.

AI Coding Agents Need Runtime Telemetry Before Commit Telemetry