Openay Deep Research showed a record result in the most difficult “last exam of mankind”

Image source: scale.com

Benchmark, created by experts from around the world, contains extremely complex questions and tasks on knowledge and reasoning – even some people cannot understand individual questions in it, not to mention the answer to them. Soon after her exit, the list of leaders in the exam was headed by the reasoning model of the Deepseek R1 AI, which gave 9.4 % of the correct answers. Openai O3-Mini models with a result of 10.5 % and O3-Mini-High could overtake it, which scored 13 %-the latter is really more powerful, but it also works slower. But the result was shown by the Aegent Openai Deep Research more impressive-it scored 26.6 %, thereby driving the previous less than 10 days.

admin

Share
Published by
admin

Recent Posts

Samsung loses faith in the feasibility of building new chip factories in the US

According to the South Korean publication The Korea Economic Daily, in January this year, the…

16 minutes ago

‘Wildly Intrigued’: Massive New Trailer Reveals Death Stranding 2: On the Beach Release Date

As expected, publisher Sony Interactive Entertainment and developer Kojima Productions used the SXSW 2025 festival…

17 minutes ago

US authorities have managed to find four buyers for TikTok’s American business

April 5 is approaching inexorably, and although Donald Trump has repeatedly stated that he may…

2 hours ago

Apple delays HomePad release, allows employees to test prototypes at home

Well-informed sources have been regularly reporting for a long time about Apple's attempts to develop…

5 hours ago

In the fantasy strategy Age of Wonders 4, starting April 1, you will be able to literally rob caravans

Developers from the Dutch Triumph Studios, together with the publisher Paradox Interactive, have decided on…

15 hours ago