This is a submission for the Google I/O Writing Challenge

On-device AI has spent most of its existence being impressive in demos and limited everywhere else. Google just changed the constraint that mattered most: the model couldn't reach anything outside the app sandbox.

The Problem It's Solving

Local inference is great for privacy and latency. It's lousy for usefulness. A model running entirely on your phone can answer questions from its training data and nothing else — no calendar, no inbox, no live web, no external tools. You get an isolated reasoning engine that can't act on the world around it.

That's the fundamental tension in edge AI: the moment you connect a model to external systems, you typically route the requests through a server. The privacy story falls apart. The latency goes up. The offline capability disappears.