Categories: Technology and IT market. newsThe world of robotics

Google DeepMind Gives Robots AI That Can Perform Complex Tasks Without Prior Training

ahr0chm6ly8zzg5ld3mucnuvyxnzzxrzl2v4dgvybmfsl2lsbhvzdhjhdglvbnmvmjayns8wmy8xmi8xmte5nji5l2dvb2dszs1nzw1pbmktcm9ib3rpy3mtms5qcgc3d

Google’s DeepMind lab has unveiled two new AI models that will help robots “perform a wider range of real-world tasks than ever before.” Gemini Robotics is a vision-language-action model that can understand new situations without prior training. And Gemini Robotics-ER is described by the company as an advanced model that can “understand our complex and dynamic world” and control the robot’s movements.

Image source: Google DeepMind

Gemini Robotics is built on Gemini 2.0, the latest version of Google’s flagship AI model. According to Carolina Parada, head of robotics at Google DeepMind, Gemini Robotics “takes Gemini’s multimodal understanding of the world and brings it into the real world by adding physical actions as a new modality.”

The new model is particularly strong in three key areas that Google DeepMind says are necessary to create truly useful robots: versatility, interactivity, and dexterity. In addition to being able to generalize to new scenarios, Gemini Robotics is better at interacting with people and their environments. The model is able to perform very precise physical tasks, such as folding a piece of paper or opening a bottle.

«While we’ve made progress in each of these areas individually in the past, we’re now delivering [dramatically] increasing performance in all three areas with a single model,” Parada said. “This allows us to create robots that are more capable, more responsive, and more resilient to changes in their environment.”

The Gemini Robotics-ER model is designed specifically for roboticists. With its help, specialists can connect to existing low-level controllers that control the robot’s movements. As Parada explained using the example of packing a lunch box – there are objects on the table, you need to figure out where everything is, how to open the lunch box, how to take the objects and where to put them. This is the chain of reasoning that Gemini Robotics-ER follows.

The developers have paid serious attention to safety. Google DeepMind researcher Vikas Sindhwani explained how the lab uses a “layered approach” in which Gemini Robotics-ER models “learn to assess whether it is safe to perform a potential action in a given scenario.”

Google DeepMind has also developed a number of benchmarks and frameworks to help further safety research in the AI field. Most notably, last year the lab introduced the “Robot Constitution,” a set of rules inspired by Isaac Asimov’s “Three Laws of Robotics” in his 1942 short story “Round Dance.”

Google DeepMind is currently working with Apptronik to develop the “next generation of humanoid robots.” The lab has also made its Gemini Robotics-ER model available to “trusted testers,” including Agile Robots, Agility Robotics, Boston Dynamics, and Enchanted Tools.

«”We’re completely focused on creating intelligence that can understand the physical world and act in that physical world,” Parada said. “We’re very excited to use that in multiple incarnations and in multiple applications for us.”

Recall that in September 2024, researchers from Google DeepMind demonstrated a learning method that allows a robot to perform certain dexterity-requiring actions, such as tying shoelaces, hanging shirts, and even repairing other robots.

admin

Next Imec to Receive ASML's Best Tools for Sub-2nm Process Development »

Previous « Niantic to sell Pokemon Go and rest of gaming business to Monopoly Go and Stumble Guys creator for $3.5 billion

Warhammer 40,000: Boltgun 2 Will Be Released in 2026, and You Won’t Have to Wait for a Free Printed Shooter Based on the First Part

At the Warhammer Skulls 2025 presentation, developers from the British studio Auroch Digital announced a…

8 hours ago

Sem categoria

The cult strategy Warhammer 40,000: Dawn of War will get a new life in 2025 thanks to a remaster – trailer and details

In line with its new strategy, Canadian studio Relic Entertainment presented a remaster of Warhammer…

8 hours ago

Sem categoria

Sega Announces ‘Thoughtful Restoration’ of Original Warhammer 40,000: Space Marine for New Generation of Players

Publisher Sega and developers from the Lithuanian studio SneakyBox announced a re-release of the 2011…

8 hours ago

Sem categoria

Xiaomi has unveiled its second electric car, the Xiaomi YU7 crossover, which is superior to the Tesla Model Y in many ways

Xiaomi has officially unveiled its second electric vehicle, the YU7 crossover in three trim levels:…

8 hours ago

Sem categoria

ID-Cooling DX360 Max Liquid Cooling System with Thicker Radiator

The ID-Cooling DX360 Max liquid cooling system has one, but very important difference from other…

8 hours ago

Sem categoria

MSI MPG Infinite X3 AI 2nd System Unit Review: All That’s Left to Do Is Play

As part of the expansion of the diversity of the "Laptops and PCs" section, it's…

1 day ago

Google DeepMind Gives Robots AI That Can Perform Complex Tasks Without Prior Training

Recent Posts

Warhammer 40,000: Boltgun 2 Will Be Released in 2026, and You Won’t Have to Wait for a Free Printed Shooter Based on the First Part

The cult strategy Warhammer 40,000: Dawn of War will get a new life in 2025 thanks to a remaster – trailer and details

Sega Announces ‘Thoughtful Restoration’ of Original Warhammer 40,000: Space Marine for New Generation of Players

Xiaomi has unveiled its second electric car, the Xiaomi YU7 crossover, which is superior to the Tesla Model Y in many ways

ID-Cooling DX360 Max Liquid Cooling System with Thicker Radiator

MSI MPG Infinite X3 AI 2nd System Unit Review: All That’s Left to Do Is Play