Google unveils Gemma 2 2B, a compact language model that outperforms GPT 3.5 Turbo

Google has unveiled Gemma 2 2B, a compact yet powerful artificial intelligence language model (LLM) that can compete with industry leaders despite its significantly smaller size. With just 2.6 billion parameters, the new language model delivers performance on par with much larger peers including OpenAI GPT-3.5 and Mistral AI Mixtral 8x7B.

Image source: Google

In the LMSYS Chatbot Arena test, a popular online platform for benchmarking and assessing the quality of artificial intelligence models, Gemma 2 2B scored 1130 points. This result is slightly ahead of the results of GPT-3.5-Turbo-0613 (1117 points) and Mixtral-8x7B (1114 points) – models with ten times more parameters.

Google says Gemma 2 2B also scored 56.1 on the MMLU (Massive Multitask Language Understanding) test and 36.6 on the MBPP (Mostly Basic Python Programming) test, which is a significant improvement over the previous version.

Gemma 2 2B challenges the conventional wisdom that larger language models inherently perform better than smaller ones. The performance of Gemma 2 2B shows that sophisticated training methods, architectural efficiency, and high-quality datasets can compensate for the lack of parameters. The development of Gemma 2 2B also highlights the growing importance of AI model compression and distillation techniques. The ability to efficiently compile information from larger models into smaller ones opens the door to more affordable AI tools without sacrificing performance.

Google trained Gemma 2 2B on a massive data set of 2 trillion tokens using systems powered by its proprietary TPU v5e AI accelerators. Support for multiple languages ​​expands its potential for use in global applications. The Gemma 2 2B model is open source. Researchers and developers can access the model through the Hugging Face platform. It also supports various frameworks, including PyTorch and TensorFlow.

admin

Share
Published by
admin

Recent Posts

Telegram will begin to disclose the IP addresses and phone numbers of criminals to law enforcement agencies

Telegram's flexible search capabilities allow users to easily find public channels and bots. Unfortunately, the…

23 mins ago

Windows games may soon be coming to Linux Arm devices as Valve tests software

Image Source: Warner Bros Interactive Also, the SteamDB website currently lists a large number of…

23 mins ago

Automotive companies lag behind Tesla and Chinese competitors in developing modern software

Global automakers from Toyota and Volkswagen to General Motors are falling further behind Tesla and…

1 hour ago

YouTube is raising Premium subscription prices again—in some cases by 50%

YouTube has announced a significant price increase for its Premium subscription. In some countries the…

2 hours ago

Alibaba Cloud Reduces Data Center Assembly Time by 50% Using Modular Architecture

Alibaba Cloud presented at its annual Apsara conference a modular data center architecture called “CUBE…

2 hours ago