Code Fingerprinting: Detecting Duplicate Submissions Without Losing Your Mind (or Your API Budget)

While building LogicVisor's review system, I was faced with a question I hadn't thought about before: what would happen if users submitted duplicate algorithm solutions?

I decided to test it myself. I submitted a solution to a simple palindrome challenge. The request went through, my solution was reviewed correctly. Normal happy stuff. Then I submitted the exact same solution a second time. And the normal happy stuff happened again. But I was far from happy.

That duplicate submission meant more tokens spent, more database memory consumed, more remote procedure calls, more AI calls, and more waiting time. For a problem that had already been solved.

I shuddered at the thought of a bad actor effectively DOS-ing my site or running up my AI bill just by hammering the same solution repeatedly. Rate limiting controls the number of requests per client — but it has nothing to do with the content of those requests. A user could stay well within their rate limit and still abuse the system by submitting the same code over and over.

This wasn't going to work. I needed a way to identify duplicate submissions and return a cached response instead of processing everything from scratch. Which led me to a more interesting question: how do you actually detect identical code?

While building LogicVisor's review system, I was faced with a question I hadn't thought about before: what would happen if users submitted duplicate algorithm solutions?

That duplicate submission meant more tokens spent, more database memory consumed, more remote procedure calls, more AI calls, and more waiting time. For a problem that had already been solved.

Code Fingerprinting: Detecting Duplicate Submissions Without Losing Your Mind (or Your API Budget)

Code Fingerprinting: Detecting Duplicate Submissions Without Losing Your Mind (or Your API Budget)

Related reading

How I Finished My AI Code Reviewer Using GitHub Copilot

Two AI reviews agreeing is not two reviews: how I learned to test claims before…

Stop Drowning in AI Code Review Suggestions

When Duplicate Code Is the Better Design

I Thought My AI Code Reviewer Was Finished. Then a Single Hallucinated Line…

The 10 Svelte 5 & SvelteKit footguns your AI review bot waves through — and how…

Related reading

How I Finished My AI Code Reviewer Using GitHub Copilot

Two AI reviews agreeing is not two reviews: how I learned to test claims before…

Stop Drowning in AI Code Review Suggestions

When Duplicate Code Is the Better Design

I Thought My AI Code Reviewer Was Finished. Then a Single Hallucinated Line…

The 10 Svelte 5 & SvelteKit footguns your AI review bot waves through — and how…