Google taught the robot to carry out commands and drive around the office using the Gemini neural network

The Google DeepMind Robotics team demonstrated this week how the RT-2 robot, trained using the Google Gemini 1.5 Pro neural network, can carry out natural language commands and move around an office space.

Image source: Google DeepMind

DeepMind Robotics published a paper titled “Mobility VLA: Multimodal Instructional Navigation Using VLM with Long Context and Topological Graphs” in which a series of videos showed the robot performing various tasks in a 9,000 square meter office space. ft (836 m2).

In one video, a Google employee asks the robot to take him somewhere to draw. “Okay,” he replies, “give me a minute.” We are thinking together with Gemini…” The robot then leads the person to a wall-sized whiteboard.

In the second video, another employee asks the robot to follow directions on a board. He draws a simple map showing how to get to the Blue Zone. Once again, the robot thinks for a moment before following the specified route to a location that turns out to be a robotics testing site. “I have successfully followed the instructions on the board,” the robot reports.

Before recording videos, the robots were familiarized with the space using the Multimodal Instructional Navigation with Demonstration Tours (MINT) solution. Thanks to this, the robot can move around the office in accordance with various landmarks indicated using speech. DeepMind Robotics then used a hierarchical Vision-Language-Action (VLA) system “that combines environmental awareness with the power of common sense.” After combining the processes, the robot gained the ability to respond to written and drawn commands, as well as to gestures and navigate the area.

According to Google, in about 90% of 50 interactions with employees, robots successfully followed the instructions given to them.

admin

Share
Published by
admin

Recent Posts

TSMC CEO Reminds Compatriots That the Company Will Build 11 New Enterprises in Taiwan This Year Alone

The buzz surrounding TSMC's plans to increase its investment in the US by $100 billion…

8 minutes ago

The graphics card market showed growth last quarter, but the long-term outlook is weak

According to a new report from analyst firm Jon Peddie Research, the global market for…

18 minutes ago

Solar film has been printed in rolls like wallpaper

British company Power Roll, together with scientists from the University of Sheffield, reported progress in…

2 hours ago

By 2030, console gaming will leave PC gaming far behind, but mobile games will be in the lead

Apparently, in the near future the eternal dispute about what is more popular - games…

5 hours ago

Defective GPUs May Have Leaked Into GeForce RTX 50 Series Laptops — Now They Won’t Be Released on Time

According to German publication Heise, laptop manufacturers are working hard to thoroughly test new models…

6 hours ago

Robocop Returns in Unfinished Business Story DLC for RoboCop: Rogue City — Details and First Gameplay

Publisher Nacon and developers from the Polish studio Teyon (Terminator: Resistance) presented Unfinished Business -…

6 hours ago