Skip to main content
The Keyword
Take a closer look at our new Gemini models for robotics.
["What does AI mean for retail?", "How did Nano Banana get its name?", "How can AI help me plan travel?"]

Today, Google DeepMind announced a new family of Gemini models designed for robotics. Gemini Robotics is a vision-language-action (VLA) model that takes natural language and images as input and outputs actions, allowing robots to physically move and perform tasks. The second model is Gemini Robotics-ER, a reasoning model that enhances skills like identifying objects and their parts in 3D space.

Take a look at what robots can do using these Gemini models, from folding origami to packing lunches to spelling words with Scrabble tiles.

Related stories

Let’s stay in touch. Get the latest news from Google in your inbox.

Subscribe