Tech

Google DeepMind Is Integrating Gemini 1.5 Pro in Robots That Can Navigate Real-World Environments

ByNews Polite July 12, 2024

Google DeepMind shared new advancements made in the field of robotics and vision language models (VLMs) on Thursday. The artificial intelligence (AI) research division of the tech giant has been working with advanced vision models to develop new capabilities in robots. In a new study, DeepMind highlighted that using Gemini 1.5 Pro and its long context window has now enabled the division to make breakthroughs in navigation and real-world understanding of its robots. Earlier this year, Nvidia also unveiled new AI technology that powers advanced capabilities in humanoid robots.

Google DeepMind Uses Gemini AI to Improve Robots

In a post on X (formerly known as Twitter), Google DeepMind revealed that it has been training its robots using Gemini 1.5 Pro’s 2 million token context window. Context windows can be understood as the window of knowledge visible to an AI model, using which it processes tangential information around the queried topic.

For instance, if a user asks an AI model about “most popular ice cream flavours”, the AI model will check the keyword ice cream and flavours to find information to that question. If this information window is too small, then the AI will only be able to respond with the names of different ice cream flavours. However, if it is larger, the AI will also be able to see the number of articles about each ice cream flavour to find which has been mentioned the most and deduce the “popularity factor”.

DeepMind is taking advantage of this long context window to train its robots in real-world environments. The division aims to see if the robot can remember the details of an environment and assist users when asked about the environment with contextual or vague terms. In a video shared on Instagram, the AI division showcased that a robot was able to guide a user to a whiteboard when he asked it for a place where he could draw.

“Powered with 1.5 Pro’s 1 million token context length, our robots can use human instructions, video tours, and common sense reasoning to successfully find their way around a space,” Google DeepMind stated in a post.

In a study published on arXiv (a non-peer-reviewed online journal), DeepMind explained the technology behind the breakthrough. In addition to Gemini, it is also using its own Robotic Transformer 2 (RT-2) model. It is a vision-language-action (VLA) model that learns from both web and robotics data. It utilises computer vision to process real-world environments and use that information to create datasets. This dataset can later be processed by the generative AI to break down contextual commands and produce desired outcomes.

At present, Google DeepMind is using this architecture to train its robots on a broad category known as Multimodal Instruction Navigation (MIN) which includes environment exploration and instruction-guided navigation. If the demonstration shared by the division is legitimate, this technology might further advance robotics.

Check out our Latest News and Follow us at Facebook

Original Source

Tech

Samsung Galaxy S25, Galaxy Watch 7 to Be Equipped With 3nm Exynos Chips: Reports

ByNews Polite May 17, 2024

Samsung Galaxy S25 could be launched with a next-generation mobile processor built on the company’s 3nm technology, according to a report. The company’s upcoming flagship grade Exynos chip is expected to offer better efficiency than its Qualcomm counterpart, which is expected to arrive by the end of the year. Meanwhile,…

Tech

Coyote vs. Acme May Get New Distributor After Warner Bros. Shelves Project, Amazon Said to Be Prime Candidate

ByNews Polite November 14, 2023

Coyote vs. Acme might not get shelved after all. Following reports of Warner Bros. axing the live-action-animation hybrid film as a tax write-off, it appears as though it will now be sold to other distributors. Principle photography on the movie was completed a year ago in New Mexico and upon…

Tech

Elon Musk Says Tesla Could Lower Electric Vehicle Prices if Inflation Slows in Future

ByNews Polite July 15, 2022

Tesla Chief Executive Officer Elon Musk said on Friday the electric automaker could lower prices for cars if inflation calms down. Musk, who has over 100 million followers on Twitter, was replying to a tweet on Friday that asked if the company had any plans to lower prices that it…

Tech

Google Chrome Rolls Out Memory, Energy Saver Modes for Desktops: All Details

ByNews Polite February 20, 2023

Google Chrome is rolling out the Memory Saver and Energy Saver modes, which were announced last year, on Chrome for desktops. The features are now available on Mac, Windows, Linux, and Chromebooks and are turned on by default. Both Memory and Energy saver mode will boost the desktop’s performance, as…

Tech

Ola Electric Roadster Series EV Bikes Unveiled in India: All You Need to Know

ByNews Polite August 16, 2024

Ola Electric took the wraps off its latest range of electric two-wheelers dubbed the Roadster series in India at its Sankalp 2024 event on August 15. The event, held at the company’s new FutureFactory in Tamil Nadu, saw the debut of three new electric vehicle (EV) bikes: Roadster Pro, Roadster…

Tech

Vivo X90 Pro Global Variant With MediaTek Dimensity 9200 SoC,12GB RAM Spotted on Geekbench

ByNews Polite December 30, 2022

Vivo X90 Pro was launched in China last month alongside the vanilla Vivo X90 and Vivo X90 Pro+. The flagship smartphone is expected to debut in other global markets soon. Ahead of its launch outside China, Vivo X90 Pro was spotted on Geekbench benchmarking site, with the MediaTek Dimensity 9200…

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Google DeepMind Is Integrating Gemini 1.5 Pro in Robots That Can Navigate Real-World Environments

Google DeepMind Uses Gemini AI to Improve Robots

Samsung Galaxy S25, Galaxy Watch 7 to Be Equipped With 3nm Exynos Chips: Reports

Coyote vs. Acme May Get New Distributor After Warner Bros. Shelves Project, Amazon Said to Be Prime Candidate

Elon Musk Says Tesla Could Lower Electric Vehicle Prices if Inflation Slows in Future

Google Chrome Rolls Out Memory, Energy Saver Modes for Desktops: All Details

Ola Electric Roadster Series EV Bikes Unveiled in India: All You Need to Know

Vivo X90 Pro Global Variant With MediaTek Dimensity 9200 SoC,12GB RAM Spotted on Geekbench

Leave a Reply Cancel reply

Jenelle Evans Reveals Son Jace Lives With Dad

Rich Paul on Lakers’ Luka Doncic trade: “Names don’t win championships. Rosters do”

College Professors Are Using ChatGPT. Some Students Aren’t Happy.

European Commission wrong to deny release of von der Leyen messages, court says

Oppo Reno 14 Pro Display, Battery Details Revealed Ahead of Debut on May 15

Google DeepMind Uses Gemini AI to Improve Robots

Similar Posts

Leave a Reply Cancel reply