Google just gave its AI model a mouse and keyboard. Gemini 3.5 Flash, announced on May 19, 2026, is the company’s first production model built to actually use computers the way humans do: clicking through interfaces, reading screenshots, and executing multi-step workflows across browsers and desktops.
The model scored 78.4% on OSWorld-Verified, the benchmark that measures how well an AI can navigate real operating systems and applications.
What computer use actually means
Gemini 3.5 Flash can look at what’s on screen via screenshots, identify UI elements like buttons and text fields, and then take actions: clicking, typing, scrolling, switching tabs.
The model ships with a 1 million token context window. It also supports function calling and structured outputs, the technical plumbing that lets developers wire it into existing systems.








