Google DeepMind unveils its first “thinking” robotics AI

Imagine that you want a robot to sort a pile of laundry into whites and colors. Gemini Robotics-ER 1.5 would process the request along with images of the physical environment (a pile of clothing). This AI can also call tools like Google search to gather more data. The ER model then generates natural language instructions, specific steps that the robot should follow to complete the given task.

The two new models work together to “think” about how to complete a task.

Gemini Robotics 1.5 (the action model) takes these instructions from the ER model and generates robot actions while using visual input to guide its movements. But it also goes through its own thinking process to consider how to approach each step. “There are all these kinds of intuitive thoughts that help [a person] guide this task, but robots don’t have this intuition,” said DeepMind’s Kanishka Rao. “One of the major advancements that we’ve made with 1.5 in the VLA is its ability to think before it acts.”

Both of DeepMind’s new robotic AIs are built on the Gemini foundation models but have been fine-tuned with data that adapts them to operating in a physical space. This approach, the team says, gives robots the ability to undertake more complex multi-stage tasks, bringing agentic capabilities to robotics.

The two new models work together to “think” about how to complete a task.

Google DeepMind unveils its first “thinking” robotics AI

Google DeepMind unveils its first “thinking” robotics AI

Other newsrooms on this story

Related reading

Meet a Smarter Spot, from Boston Dynamics and Google DeepMind

Google’s new robotics AI can run without the cloud and still tie your shoes

Google DeepMind’s new AI models can search the web to help robots complete tasks

Google DeepMind svela l'IA che rende i robot autonomi - Future Tech - Ansa.it

Google’s AI Boss Says Gemini's New Abilities Point the Way to AGI

Google DeepMind’s optimized AI model runs directly on robots

Other newsrooms on this story

Related reading

Meet a Smarter Spot, from Boston Dynamics and Google DeepMind

Google’s new robotics AI can run without the cloud and still tie your shoes

Google DeepMind’s new AI models can search the web to help robots complete tasks

Google DeepMind svela l'IA che rende i robot autonomi - Future Tech - Ansa.it

Google’s AI Boss Says Gemini's New Abilities Point the Way to AGI

Google DeepMind’s optimized AI model runs directly on robots