TL;DRComputer use is now a built-in tool in Gemini 3.5 Flash, replacing the standalone Gemini 2.5 computer use model with enterprise safeguards.
Google has made computer use a built-in tool inside Gemini 3.5 Flash, the model it launched at I/O 2026 as its fastest agentic AI model. The capability, which lets AI agents see screens, click, type, and scroll across browsers, mobile devices, and desktops, previously required a separate standalone model and is now available as a native tool through the Gemini API and the Gemini Enterprise Agent Platform, the renamed version of Vertex AI.
The update means developers no longer need to call a dedicated computer use model to build agents that interact with graphical interfaces. Instead, they can activate computer use as one of several tools within Flash, alongside code execution, search, and function calling. Product manager Mateo Quiros described the integration as giving Flash the ability to see, reason about, and take action on screens.
Google first released a standalone Gemini computer use model in October 2025, designed specifically for browser-based agent workflows. That model achieved roughly 70 percent accuracy on the Online-Mind2Web benchmark and was built around a screenshot-action loop where developers fed it a screen capture, received a structured command, executed it, and sent back the updated view. Folding the capability into Flash consolidates what was a two-model workflow into one.








