Google Unveils Gemini 2.5 Pro, Its Smartest AI Model Yet, Beating OpenAI o3

Google has announced the Gemini 2.5 Pro AI model, calling it “its smartest model yet.” The neural network is part of the Gemini 2.5 family and outperforms previous versions in data analysis, programming, and solving complex problems, supporting context for up to 1 million tokens.

Image source: Google

The key feature of the Gemini 2.5 Pro, like all models in the Gemini 2.5 family, is the ability to reason, visualizing its thought process before giving the user a more precise and final answer. Unlike the previous generation of models (Gemini 2.0 Flash Thinking), Google no longer uses the Thinking label or displays the reasoning process. However, as 9to5Google points out, users can manually activate the “think out loud” feature to see the bot’s thought process.

Overall, Gemini 2.5 Pro showed a significant jump in performance thanks to an improved base model and post-training tweaks. Google notes that this version topped the LMArena rankings, which evaluates models based on user preferences, and also showed better results in math (AIME 2025) and science (GPQA diamond).

At the same time, in the Humanity’s Last Exam test, which is created by experts to test the limits of artificial intelligence in the field of knowledge and logic, Gemini 2.5 Pro achieved a record 18.8% without using additional tools. The model also received significant improvements in programming, especially in creating web applications and editing code.

In the software development space, Gemini 2.5 Pro scored highly on the SWE-Bench Verified benchmark, scoring 63.8% using a dedicated agent approach. It also has built-in multimodality, handling text, audio, images, video, large data sets, and even full code repositories.

The model’s context window offers a size of 1 million tokens, and in the near future it will increase to 2 million. In the next few weeks, Gemini 2.5 Pro will appear in Vertex AI, and later Google will introduce a pricing policy that allows using the AI ​​model in large-scale projects. For now, the model is available to paid subscribers and developers in test mode.

admin

Share
Published by
admin

Recent Posts

Nissan Leaf EV to Become NACS-Ported Compact Crossover in Third Generation

Nissan Leaf can rightfully be considered a long-liver of the electric car market, since the…

3 days ago

OpenAI expects to more than triple its revenue this year and then double it next year.

OpenAI, the market leader in generative artificial intelligence systems, remains nominally a startup, its financial…

3 days ago

OpenAI Decides to Hold 4o Image Generation Launch for Free Users

OpenAI has been forced to delay the release of ChatGPT's built-in image generator for free…

3 days ago

1440p and 240Hz for just $200: Xiaomi updates the 27-inch Redmi G27Q gaming monitor

Xiaomi continues to update its Redmi G27Q gaming monitor every year. The model was first…

3 days ago

Beware, Android is shutting down: OS development will cease to be public, but there is no reason to panic

Android device makers can significantly customize the look and feel of the operating system, but…

3 days ago

Fake GeForce RTX 4090s with RTX 3090 chips have started popping up in China — craftsmen are even changing the GPU markings

In China, scammers have started selling GeForce RTX 3090 graphics cards, passing them off as…

3 days ago