The latest version of Google Gemini AI beats GPT-4o and Claude-3 in tests

The latest version of the large language model, Gemini 1.5 Pro, has suddenly shot to the top of the rankings on the Chatbot Arena platform, beating traditional leaders in the field of generative artificial intelligence – OpenAI GPT-4o and Anthropic Claude-3 in tests.

Image source: blog.google

The previously champion OpenAI GPT-4o neural network lost its leadership on August 1, when Google quietly released an experimental build of its latest model – it quickly attracted an AI-interested community on social networks, which considered victory in the benchmark a testament to quality. OpenAI ChatGPT has become almost synonymous with generative AI since its launch back in the GPT-3 era. To date, OpenAI GPT-4o and Anthropic Claude-3 are considered established leaders, which over the past year have had almost no competitors in tests.

Image source: x.com/lmsysorg

One of the most popular tests is LMSYS Chatbot Arena. It offers models various tasks and assigns them scores. The current version of GPT-4o was able to score 1286 points, and Claude-3 – 1271 points. The previous Google Gemini 1.5 Pro had a score of 1261, but the Gemini 1.5 Pro 0801 released on August 1 suddenly scored a whopping 1300 points. This may indicate that Google’s new neural network is more capable than its competitors, but benchmarks don’t always accurately reflect what an AI model can and can’t do.

Today’s chatbot market is mature enough to offer the consumer multiple options and allow them to decide for themselves which AI is best suited. It is not yet clear whether the experimental Gemini 1.5 Pro will become the default version in the future. It remains publicly available, but with experimental status may be closed or radically edited for security or other reasons.

admin

Share
Published by
admin

Recent Posts

Tired of waiting: sales of S.T.A.L.K.E.R. 2: Heart of Chornobyl exceeded one million copies within two days of release

The post-apocalyptic open-world shooter S.T.A.L.K.E.R. 2: Heart of Chornobyl from the developers from the GSC…

13 minutes ago

TSMC to start producing 1.6-nm chips in two years

TSMC's plans for the next couple of years remain largely unchanged - by the end…

13 minutes ago

YouTube has added the Dream Screen feature to Shorts – an AI background generator for videos

The YouTube administration announced that the updated Dream Screen feature is now available in the…

23 minutes ago

PCs with AI reduce user productivity – people do not know how to properly communicate with AI

Users of PCs with artificial intelligence systems demonstrate lower productivity compared to those who work…

23 minutes ago