Artificial Intelligence, Machine Learning, Neural Networks Technology and IT market. news

Openay Deep Research showed a record result in the most difficult “last exam of mankind”

Feb 5, 2025

Image source: scale.com

Benchmark, created by experts from around the world, contains extremely complex questions and tasks on knowledge and reasoning – even some people cannot understand individual questions in it, not to mention the answer to them. Soon after her exit, the list of leaders in the exam was headed by the reasoning model of the Deepseek R1 AI, which gave 9.4 % of the correct answers. Openai O3-Mini models with a result of 10.5 % and O3-Mini-High could overtake it, which scored 13 %-the latter is really more powerful, but it also works slower. But the result was shown by the Aegent Openai Deep Research more impressive-it scored 26.6 %, thereby driving the previous less than 10 days.

IOS Apps Technology and IT market. news

China is preparing an antitrust investigation against Apple because of the commissions in the App Store

Feb 5, 2025 admin

Games Technology and IT market. news

The creators of Deep Rock Galactic shared the entertaining statistics of players in 2024, and sales reached 10 million copies

Feb 5, 2025 admin

Shooter Technology and IT market. news

Electronic Arts is ready to postpone the release of the “largest Battlefield in history” under the threat of a collision with GTA VI

Feb 5, 2025 admin

Openay Deep Research showed a record result in the most difficult “last exam of mankind”

Related Post

China is preparing an antitrust investigation against Apple because of the commissions in the App Store

The creators of Deep Rock Galactic shared the entertaining statistics of players in 2024, and sales reached 10 million copies

Electronic Arts is ready to postpone the release of the “largest Battlefield in history” under the threat of a collision with GTA VI

Leave a Reply Cancel reply

You missed

China is preparing an antitrust investigation against Apple because of the commissions in the App Store

The creators of Deep Rock Galactic shared the entertaining statistics of players in 2024, and sales reached 10 million copies

Electronic Arts is ready to postpone the release of the “largest Battlefield in history” under the threat of a collision with GTA VI

Starliner long -suffering space project brought Boeing $ 523 million last year