
Google bakes computer control directly into Gemini 3.5 Flash, letting the model see and operate your screen
Quick Answer
Google has embedded 'Computer Use' functionality in Gemini 3.5 Flash, enabling it to autonomously control devices.
Quick Take
Google has embedded 'Computer Use' functionality in Gemini 3.5 Flash, enabling it to autonomously control devices. Scoring 78.4 on the OSWorld benchmark, it rivals GPT-5.5, allowing developers to create agents for software testing and office automation.
Key Points
- Gemini 3.5 Flash can operate computers, browsers, and mobile devices autonomously.
- Achieved a score of 78.4 on the OSWorld benchmark, comparable to GPT-5.5.
- Developers can utilize the Gemini API for software testing and office automation.
- This integration enhances user interaction with AI in everyday tasks.
- Potential applications include automated testing and streamlined office workflows.
Article Excerpt
From source RSS / original summaryGoogle has integrated "Computer Use" directly into Gemini 3. 5 Flash, letting the model operate computers, browsers, and mobile devices on its own. On the OSWorld benchmark, it scores 78. 4, putting it on par with GPT-5. 5. Developers can use the Gemini API to build agents for software testing or office automation. The article Google bakes computer control directly into Gemini 3. 5 Flash, letting the model see and operate your screen appeared first on The Decoder.
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from The Decoder
See more →
Cursor announces its own AI model, a new Git platform, and a mobile app
Cursor has launched its first in-house AI model alongside a new Git platform and a mobile app, aiming to enhance developer productivity. The AI model is designed to streamline coding processes, while the Git platform offers improved version control features tailored for collaborative projects.

