Tech

Alibaba Releases QwQ-32B Reasoning-Focused AI Model in Preview to Take on OpenAI’s GPT-o1

ByNews Polite November 29, 2024

Alibaba released a new artificial intelligence (AI) model on Thursday, which is said to rival OpenAI’s GPT-o1 series models in reasoning capability. Launched in preview, the QwQ-32B large language model (LLM) is said to outperform GPT-o1-preview in several mathematical and logical reasoning-related benchmarks. The new AI model is available to download on Hugging Face, however it is not fully open-sourced. Recently, another Chinese AI firm released an open-source AI model DeepSeek-R1, which was claimed to rival ChatGPT-maker’s reasoning-focused foundation models.

Alibaba QwQ-32B AI Model

In a blog post, Alibaba detailed its new reasoning-focused LLM and highlighted its capabilities and limitations. The QwQ-32B is currently available as a preview. As the name suggests, it is built on 32 billion parameters and has a context window of 32,000 tokens. The model has completed both pre-training and post-training stages.

Coming to its architecture, the Chinese tech giant revealed that the AI model is based on transformer technology. For positional encoding, QwQ-32B uses Rotary Position Embeddings (RoPE), along with Switched Gated Linear Unit (SwiGLU) and Root Mean Square Normalization (RMSNorm) functions, as well as Attention Query-Key-Value Bias (Attention QKV) bias.

Just like the OpenAI GPT-o1, the AI model shows its internal monologue when assessing a user query and trying to find the right response. This internal thought process lets QwQ-32B test various theories and fact-check itself before it presents the final answer. Alibaba claims the LLM scored 90.6 percent in the MATH-500 benchmark and 50 percent in the AI Mathematical Evaluation (AIME) benchmark during internal testing and outperformed the OpenAI’s reasoning-focused models.

Notably, AI models with better reasoning are not proof of models becoming more intelligent or capable. It is simply a new approach, also known as test-time compute, that lets models spend additional processing time to complete a task. As a result, the AI can provide more accurate responses and solve more complex questions. Several industry veterans have pointed out that newer LLMs are not improving at the same rate as their older versions, suggesting the existing architectures are reaching a saturation point.

As QwQ-32B spends additional processing time on queries, it also has several limitations. Alibaba stated that the AI model can sometimes mix languages or switch between them giving rise to issues such as language-mixing and code-switching. It also tends to enter reasoning loops and apart from mathematical and reasoning skills, other areas still require improvements.

Notably, Alibaba has made the AI model available via a Hugging Face listing and both individuals and enterprises can download it for personal, academic, and commercial purposes under the Apache 2.0 licence. However, the company has not made the model weights and data available, which means users cannot replicate the model or understand how the architecture functions.

Check out our Latest News and Follow us at Facebook

Original Source

Tech

Five9 Plans to Expand in Europe With 2 Data Centres, Relocates Russian Staff to Portugal

ByNews Polite May 11, 2022

Five9, the US call centre software firm whose shareholders spurned a merger with Zoom last year, is looking to expand in Europe by setting up two data centres and relocating its employees in Russia to Portugal. The data centres will be in Frankfurt and Amsterdam, and serve customers in Europe,…

Tech

WhatsApp Beta Introduces AI-Generated Group Icons, Meta AI Widget

ByNews Polite March 8, 2025

#news #newstoday #tech #technews #latestnews #techupdates #newsupdates WhatsApp has started testing two new features on recent beta versions of the messaging app on Android. The Meta-owned chat service has added a new widget that makes it easier for users to access its Meta AI chatbot, without having to open the…

Tech

Binance CEO Hits Out at ‘Chinese Company’ Label and His Connections to the Country

ByNews Polite September 2, 2022

Binance CEO Changpeng Zhao aka CZ has hit back at critics who claim that the platform is “Chinese.” In a blog post, the company chief specifies that the executive team is now dominated primarily by Europeans and Americans, while the broader workforce is more globally dispersed. CZ added, “The inference…

Tech

Infinix Hot 30 5G Design, Specifications Officially Confirmed; Could Launch on July 14

ByNews Polite July 4, 2023

Infinix is all set to unveil the Infinix Hot 30 5G smartphone in India soon. The company has revealed the expected launch date along with a few key specifications and price range for the upcoming smartphone. The phone is expected to launch next week. The company has also confirmed the…

Tech

Google Pixel 8, Pixel 8 Pro with Tensor G3, Android 14, and Upgraded Cameras Launched: Price, Specifications

ByNews Polite October 4, 2023

Google on Wednesday launched its new Pixel 8 and Pixel 8 Pro smartphones at the company’s Made by Google 2023 hardware launch event. The latest smartphones from the search giant are powered by a Tensor G3 chip, offer up to 256GB of inbuilt storage, and run on Android 14 out-of-the-box. The standard…

Tech

Samsung Galaxy Watch 6 Pro Design Could Be Similar to Galaxy Watch 4 Classic: All Details

ByNews Polite April 26, 2023

Samsung Galaxy Watch 6 Pro and Galaxy Watch 6 are expected to go official in the coming months alongside the next generation of foldable smartphones. It was rumoured earlier this week that the Galaxy Watch 6 series would be powered by a new Exynos W980 SoC. Now, another leak suggests…

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Alibaba Releases QwQ-32B Reasoning-Focused AI Model in Preview to Take on OpenAI’s GPT-o1

Alibaba QwQ-32B AI Model

Five9 Plans to Expand in Europe With 2 Data Centres, Relocates Russian Staff to Portugal

WhatsApp Beta Introduces AI-Generated Group Icons, Meta AI Widget

Binance CEO Hits Out at ‘Chinese Company’ Label and His Connections to the Country

Infinix Hot 30 5G Design, Specifications Officially Confirmed; Could Launch on July 14

Google Pixel 8, Pixel 8 Pro with Tensor G3, Android 14, and Upgraded Cameras Launched: Price, Specifications

Samsung Galaxy Watch 6 Pro Design Could Be Similar to Galaxy Watch 4 Classic: All Details

Leave a Reply Cancel reply

A Scenic Tour of Red Tape: Tracking the Slowest High-Speed Train in the Country

Best SKIMS Push-Up Bra for Natural-Looking Cleavage Is Back

This Alabama City Faces a Culture War, With Its Public Library at the Center

Musical Chairs Inside the White House Press Room

Israeli army calls up tens of thousands of reservists for Gaza offensive

Alibaba QwQ-32B AI Model

Similar Posts

Leave a Reply Cancel reply