Google Unveils Gemini 1.5, Meta Introduces Predictive Visual Machine Learning Model V-JEPA

Google and Meta made notable artificial intelligence (AI) announcements on Thursday, unveiling new models with significant advancements. The search giant unveiled Gemini 1.5, an updated AI model that comes with long-context understanding across different modalities. Meanwhile, Meta announced the release of its Video Joint Embedding Predictive Architecture (V-JEPA) model, a non-generative teaching method for advanced machine learning (ML) through visual media. Both products offer newer ways of exploring AI capabilities. Notably, OpenAI also introduced its first text-to-video generation model Sora on Thursday.

Google Gemini 1.5 model details

Demis Hassabis, CEO of Google DeepMind, announced the release of Gemini 1.5 via a blog post. The newer model is built on the Transformer and Mixture of Experts (MoE) architecture. While it is expected to have different versions, currently, only the Gemini 1.5 Pro model has been released for early testing. Hassabis said that the mid-size multimodal model can perform tasks at a similar level to Gemini 1.0 Ultra which is the company’s largest generative model and is available as the Gemini Advanced subscription with Google One AI Premium plan.

The biggest improvement with Gemini 1.5 is its capability to process long-context information. The standard Pro version comes with a 1,28,000 token context window. In comparison, Gemini 1.0 had a context window of 32,000 tokens. Tokens can be understood as entire parts or subsections of words, images, videos, audio or code, which act as building blocks for processing information by a foundation model. “The bigger a model’s context window, the more information it can take in and process in a given prompt — making its output more consistent, relevant and useful,” Hassabis explained.

Alongside the standard Pro version, Google is also releasing a special model with a context window of up to 1 million tokens. This is being offered to a limited group of developers and its enterprise clients in a private preview. While there is no dedicated platform for it, it can be tried out via Google’s AI Studio, a cloud console tool for testing generative AI models, and Vertex AI. Google says this version can process one hour of video, 11 hours of audio, codebases with over 30,000 lines of code, or over 7,00,000 words in one go.

In a post on X (formerly known as Twitter), Meta publicly released V-JEPA. It is not a generative AI model, but a teaching method that enables ML systems to understand and model the physical world by watching videos. The company called it an important step towards advanced machine intelligence (AMI), a vision of one of the three ‘Godfathers of AI’, Yann LeCun.

In essence, it is a predictive analysis model, that learns entirely from visual media. It can not only understand what’s going on in a video but also predict what comes next. To train it, the company claims to have used a new masking technology, where parts of the video were masked in both time and space. This means that some frames in a video were entirely removed, while some other frames had blacked-out fragments, which forced the model to predict both the current frame as well as the next frame. As per the company, the model was able to do both efficiently. Notably, the model can predict and analyse videos of up to 10 seconds in length.

“For example, if the model needs to be able to distinguish between someone putting down a pen, picking up a pen, and pretending to put down a pen but not actually doing it, V-JEPA is quite good compared to previous methods for that high-grade action recognition task,” Meta said in a blog post.

At present, the V-JEPA model only uses visual data, which means the videos do not contain any audio input. Meta is now planning to incorporate audio alongside video in the ML model. Another goal for the company is to improve its capabilities in longer videos.

Affiliate links may be automatically generated – see our ethics statement for details.

Check out our Latest News and Follow us at Facebook

Original Source

Crypto Price Today: Profits Take Over Crypto Charts as 11 BTC ETFs Bag Historic Approval in US

ByNews Polite January 11, 2024

Bitcoin on Thursday, January 11 minted a small profit and continued trading at the price point of $46,331 (roughly Rs. 38.4 lakh). In a historic development, the US SEC finally gave the green signal to 11 Bitcoin ETF applications that include Blackrock, Fidelity, and Invesco among others. This has ushered…

Tech

Amazon Enters Quick Commerce Market in India With 15-Minute Delivery Pilot

ByNews Polite December 10, 2024

#news #newstoday #tech #technews #latestnews #techupdates #newsupdates Amazon India has confirmed that it is piloting a quick commerce service in India. Following this move, the Indian arm of the US-based retail giant will offer quick deliveries of groceries and other daily essential items in 15 minutes or less. Notably, in…

Tech

Around 95 Percent WhatsApp Users in India Receive Pesky Calls, SMS Through Online Business: Survey

ByNews Polite February 22, 2023

Around 76 percent of respondents have claimed that they have noticed a rise in pesky calls or SMS based on their conversations with WhatsApp business accounts and their activity on Facebook or Instagram, online survey firm LocalCircles said on Wednesday. According to the survey conducted between February 1 and 20, 95…

Tech

Pixel Phones With Temperature Sensor Could Get Material Auto-Detection Feature Soon

ByNews Polite September 26, 2024

Google Pixel 9 Pro series and Pixel 8 Pro’s pre-installed Thermometer app could get a new feature that could make the process of recording the temperature easier, according to a report. It is said to have been spotted during an APK teardown of the app. Leveraging the temperature sensor located at…

Tech

Disney Is Considering Selling Its Indian Streaming and TV Business, Reliance Is Among Potential Buyers

ByNews Polite September 18, 2023

Walt Disney has held preliminary talks with potential buyers for its India streaming and television business including billionaire Mukesh Ambani’s Reliance Industries, according to people familiar with the matter. The US entertainment giant has discussed a range of options with would-be suitors, from a deal for the entire Disney Star…

Tech

Walt Disney Forms Business Unit to Coordinate Use of AI, Augmented Reality

ByNews Polite November 2, 2024

Walt Disney is forming a new group to coordinate the company’s use of emerging technologies such as artificial intelligence and mixed reality, as the media giant explores applications across its film, television and theme park divisions. The newly formed Office of Technology Enablement will be led by Jamie Voris, the…

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Google Unveils Gemini 1.5, Meta Introduces Predictive Visual Machine Learning Model V-JEPA

Google Gemini 1.5 model details

Crypto Price Today: Profits Take Over Crypto Charts as 11 BTC ETFs Bag Historic Approval in US

Amazon Enters Quick Commerce Market in India With 15-Minute Delivery Pilot

Around 95 Percent WhatsApp Users in India Receive Pesky Calls, SMS Through Online Business: Survey

Pixel Phones With Temperature Sensor Could Get Material Auto-Detection Feature Soon

Disney Is Considering Selling Its Indian Streaming and TV Business, Reliance Is Among Potential Buyers

Walt Disney Forms Business Unit to Coordinate Use of AI, Augmented Reality

Leave a Reply Cancel reply

All American Eagle Shorts Are Under $30 Right Now: Jorts, Bermudas & More

India Strikes Pakistan Two Weeks After Kashmir Terrorist Attack

Instagram’s Favorite Fashion Expert Gives a Hot Take on Met Gala 2025

‘Every 50-50 decision was for them’

Every Noteworthy Character in GTA 6

Google Gemini 1.5 model details

Similar Posts

Leave a Reply Cancel reply