Premium

What is Gemini Robotics, Google DeepMind’s new AI for humanoids?

Google DeepMind is bringing Gemini 2.0’s intelligence to general-purpose robotic agents in the physical world.

Gemini Robotics AI models have been equipped with safety measures and have been programmed to comply with ethical considerations. (Image: Google DeepMind)

Google has taken a major leap in robotics. Google DeepMind has launched Gemini Robotics, its suite of AI models meant to equip robots with the ability to perform complex physical tasks with unprecedented accuracy and dexterity.

The AI research lab has also launched Gemini Robotics ER along with Gemini Robotics, a combination of two innovative AI models that will allow robots to do complex tasks, even those physical tasks where it may not have prior training.

Gemini Robotic Models

The AI suit comprises two models—Gemini Robotics and Gemini Robotics ER. The Gemini Robotics is an advanced vision language system (VLS) that has been built upon the Gemini 2.0 framework, essentially adding physical actions to its output modality. Reportedly this model allows robots to process and respond to visual inputs, comprehend language commands, and execute complex physical tasks.

Story continues below this ad

Meanwhile, the Gemini Robotics ER is an AI model that helps robots with spatial understanding and embodied reasoning capabilities. Essentially, it allows roboticists to run their programs with enhanced performance. It allows them to adapt to different types of robots, from bi-arm platforms to humanoids like Apptronik’s Apollo. In the demo shared by the company, both models have displayed remarkable improvements over existing technologies. Gemini Robotics reported a 74.5 per cent success rate in in-distribution task performance compared to 42.6 per cent for multi-task diffusion policies.

With its versatility, the new models allow robots to perform a wide range of tasks with adaptability and precision. These tasks include folding intricate origami models, packing lunch items in Ziploc bags, tying shoelaces, and even figuring out how to slam dunk a basketball, although it has never seen it before.

Safety and specifications

Google DeepMind has implemented measures to ensure the responsible and reliable deployment of AI-powered robots. The Gemini Robotics models come with integrated safety protocols in their core functionality to prevent harmful actions. Besides, the company has introduced the Artificial Social Intelligence for Machines and Oversight Validation (ASIMOV) dataset, which has been designed to evaluate and improve the social intelligence of robots.

Reportedly, the models have been programmed to comply with ethical guidelines, essentially refusing requests. The company is also working with trusted partners to test and refine the safety features of Gemini Robotics under various real-world circumstances. The company continues to invest in research to enhance the safety and reliability of AI-powered robots.

Technology on smartphone reviews, in-depth reports on privacy and security, AI, and more. We aim to simplify the most complex developments and make them succinct and accessible for tech enthusiasts and all readers. Stay updated with our daily news stories, monthly gadget roundups, and special reports and features that explore the vast possibilities of AI, consumer tech, quantum computing, etc.on smartphone reviews, in-depth reports on privacy and security, AI, and more. We aim to simplify the most complex developments and make them succinct and accessible for tech enthusiasts and all readers. Stay updated with our daily news stories, monthly gadget roundups, and special reports and features that explore the vast possibilities of AI, consumer tech, quantum computing, etc.

Tags:
artificial intelligence