Inference Theft: Your AI Endpoint Is Someone Else's Free Model

Earlier this year, a bunch of people figured out they could use a customer service chatbot for a popular fast-food chain as a free coding assistant. It went viral. Some customers came looking for burritos and others left with LeetCode solutions. Everyone got what they wanted except the company paying for the inference.

The chatbot was backed by a capable general-purpose model with no way to enforce what it should and shouldn't answer. If you asked it to invent a novel approach to bubble sort, it would try. The model didn't know it was only supposed to be a burrito bot, it just saw a prompt and responded.

If your AI endpoint doesn't restrict who can sent it requests, and have a way to limit what it will and won't answer, any general-purpose model you expose becomes a general-purpose model for everyone, on your dime.

That's an easy way to become the victim of inference theft.

Inference theft occurs when someone repurposes your AI application as a model endpoint that you never intended to expose. They route requests through your application and let you pay the inference bill. Inference theft is one of the fastest ways to create a denial-of-wallet event.

That's an easy way to become the victim of inference theft.

Inference Theft: Your AI Endpoint Is Someone Else's Free Model

Inference Theft: Your AI Endpoint Is Someone Else's Free Model

Other newsrooms on this story

Related reading

Inference Theft Is the New AI App Security Bug: How to Protect Your LLM…

Protecting against inference theft

Protecting against token theft

Should You Hijack a Corporate AI Chatbot for Free Tokens?

Why agentic AI needs an open inference stack

Meet the AI jailbreakers: ‘I see the worst things humanity has produced’

Other newsrooms on this story

Related reading

Inference Theft Is the New AI App Security Bug: How to Protect Your LLM…

Protecting against inference theft

Protecting against token theft

Should You Hijack a Corporate AI Chatbot for Free Tokens?

Why agentic AI needs an open inference stack

Meet the AI jailbreakers: ‘I see the worst things humanity has produced’