Alibaba has released Qwen2-Math mathematical language models that are better than analogues from OpenAI and Google

Aug 13, 2024

Alibaba Group Holding continues to actively work in the field of artificial intelligence. This week, the e-commerce giant released several large language models (LLMs) under the collective name Qwen2-Math, which are focused on solving complex mathematical problems and, according to the developers, do it better than AI algorithms from other companies.

Image Source: Shutterstock

In total, three large language models were presented, which differ from each other in the number of parameters that affect the accuracy of the algorithm’s answers. The model with the most parameters, Qwen2-Math-72B-Instruct, according to the developers, is superior to many AI algorithms in terms of solving mathematical problems, including GPT-4o from OpenAI, Claude 3.5 Sonnet from Anthropic, Gemini 1.5 Pro from Google and Llama-3.1 -405B from Meta✴ Platforms.

«Over the past year, we have done a lot of work exploring and expanding the logical capabilities of large language models, with a particular focus on their ability to solve arithmetic and mathematical problems. We hope that Qwen2-Math will contribute to the community’s efforts to solve complex mathematical problems.” message from the developers.

Qwen2-Math’s language models were tested against a variety of benchmarks, including GSM8K (8,500 complex and varied high school-level math problems), OlympiadBench (a high-level bilingual multimodal science benchmark), and Gaokao (one of the toughest university-level math entrance exams). It is noted that the new models have some limitations due to “support for English only.” In the future, the developers plan to create bilingual and multilingual LLMs.

Network news Technology and IT market. news

Alibaba has released Qwen2-Math mathematical language models that are better than analogues from OpenAI and Google

Related Post

Threads gets ‘long overdue improvements’ to search and trends

Ubisoft spoke about the capabilities and innovations of stealth mechanics in Assassin’s Creed Shadows – new gameplay

Assembly of the second NASA SLS rocket has started – in a year it will send people on a flight around the Moon

Leave a Reply Cancel reply

You missed

Threads gets ‘long overdue improvements’ to search and trends

Ubisoft spoke about the capabilities and innovations of stealth mechanics in Assassin’s Creed Shadows – new gameplay

Assembly of the second NASA SLS rocket has started – in a year it will send people on a flight around the Moon

The creators of Black Myth: Wukong will surprise players before the end of the year – teaser from the head of Game Science