Alibaba Cloud introduced the QWEN2.5-Max AI-model, which exceeds Deepseek V3 in key tests

Alibaba Cloud, a cloud unit of the Chinese company Alibaba, announced the release of an updated, large-scale language model QWEN2.5-MAX. AI model is based on the architecture of MIXTURE-OF-Experts (MOE) and trained in more than 20 trillions of tokens. The developers emphasize that the tool showed “significant progress in intellectual capabilities” and is already available for use.

Image source: Alibaba Cloud

The new version of the model is characterized by improved performance and accuracy, it is able to better cope with tasks that require a deep understanding of the context, such as the analysis of the text, translation and generation of content. “QWEN2.5-MAX demonstrates significant progress in processing complex requests and providing relevant answers,” the company writes on the pages of its blog.

QWEN2.5-MAX was tested in a number of key benchmarks, including MMLU-PRO, LiveCodebench, Livebench and Arena-Hard. The model showed superiority over DeepSeek V3 in tests such as Arena-Hard, Livebench and Livecodebench, and also demonstrated competitive results in Mmlu-Pro. Compared to other leading models, such as the GPT-4O and Claude-3.5-Sonnet, QWEN2.5-MAX also confirmed its leading positions.

Image source: Alibaba Cloud

Image source: Alibaba Cloud

Alibaba Cloud plans to integrate QWEN2.5-MAX into its cloud services, which will allow customers to use a tool to solve a wide range of problems, including automation of data processing, improving customer interaction through chat bots and optimizing business processes.

AI-model is already available through the QWEN Chat service, in which users can interact with QWEN2.5-MAX, test its capabilities and experiment with various functions. For developers, the API software interface is also open. For access, you need to register in Alibaba Cloud, activate the Model Studio service and create an API key.

admin

Share
Published by
admin

Recent Posts

Stardew Valley Gets Baldur’s Village Mod Featuring Baldur’s Gate 3 Characters – Sven Vincke Approves

A crossover between the fantasy RPG Baldur’s Gate 3 from Larian Studios and the farming…

40 minutes ago

Radeon RX 9070 XT is faster than GeForce RTX 5080 in Cyberpunk 2077 and 3DMark after undervolting

Well-known overclocker and YouTube blogger Der8auer (Roman Hartung) experimented with manual overclocking of the PowerColor…

2 hours ago

Asus Releases VU Air Ionizer Monitors With Built-in Air Ionizer

Asus has released three monitors of the VU Air Ionizer series, the main feature of…

3 hours ago

Millions of computers at risk of being hacked due to critical Python vulnerability

A critical vulnerability has been discovered in the popular Python object-oriented programming language package —…

3 hours ago

Split Fiction sold over a million copies in two days; It Takes Two took nearly a month to reach that milestone

Developers from the Swedish Hazelight Studios (It Takes Two, A Way Out) reported on the…

3 hours ago

Sony Reacts to Rumors of God of War Remasters Announced to Celebrate Series’ 20th Anniversary

Developers from Sony Interactive Entertainment's Santa Monica studio have adjusted fans' expectations for the event…

4 hours ago