Google DeepMind AI models solved math Olympiad problems at the level of a silver medalist

Google DeepMind, the London-based artificial intelligence (AI) research subsidiary of Google, has introduced AlphaProof and AlphaGeometry 2 AI models that can solve complex mathematical problems that current AI models cannot handle.

Image source: geralt/Pixabay

For a number of reasons, solving mathematical problems that require advanced reasoning abilities is not yet within the capabilities of most AI systems. The fact is that these types of problems require the formation and use of abstractions. It also requires complex hierarchical planning, setting subgoals, backtracking, and finding new paths, which is a difficult issue for AI.

Both new AI models have the ability to perform advanced mathematical reasoning to solve complex mathematical problems. AlphaProof was created using reinforcement learning, gaining the ability to prove mathematical statements in the formal Lean programming language. To create it, we used a pre-trained language model AlphaZero, a reinforcement learning algorithm that previously taught itself to play chess, shogi and go. In turn, AlphaGeometry 2 is an improved version of the existing AlphaGeometry AI system, introduced in January and designed to solve geometry problems.

While AlphaProof was trained to solve problems on a wide range of math topics, AlphaGeometry 2 is optimized for solving problems involving object movements and equations involving angles, ratios and distances. Because AlphaGeometry 2 was trained on significantly more synthetic data than its predecessor, it can handle much more complex geometry problems.

To test the capabilities of the new AI systems, Google DeepMind researchers tasked them with solving six problems from this year’s International Mathematical Olympiad (IMO) and proving the answers were correct. AlphaProof solved two algebra problems and one number theory problem, one of which was the hardest in the Olympiad, while AlphaGeometry 2 solved a geometry problem. Two problems in combinatorics remained unsolved.

Two renowned mathematicians, Tim Gowers and Joseph Myers, tested the solutions provided by the systems. They awarded each of the four correct answers the maximum number of points (seven out of seven), giving the systems a total of 28 points out of a maximum of 42. An Olympian who scored the same number of points would have been awarded a silver medal and would have fallen just short of gold, which awarded to those who score 29 points or more.

For the first time, an AI system was able to achieve medal-level results in solving IMO mathematical problems. “As a mathematician, I find this very impressive and a significant leap over what was previously possible,” Gowers said during a press conference.

Creating AI systems that can solve complex mathematical problems could pave the way for exciting human-AI collaborations, says Katie Collins, a researcher at the University of Cambridge. This, in turn, can help us learn more about how we humans do math. “There’s still a lot we don’t know about how people solve complex math problems,” she says.

admin

Share
Published by
admin

Recent Posts

Google simplified the management of a smart home-Google Home received a Gemini AI assistant

Smart home control in the Google ecosystem through the Gemini artificial intelligence assistant has become…

36 minutes ago

AMD Releases Optional Driver Supporting Marvel’s Spider-Man 2 and Final Fantasy VII Rebirth

AMD has released an optional update of the Radeon Software Adrenalin 25.1.1 driver. It added…

2 hours ago

Epic Games launches a game distribution program and will help compensate for Apple to IOS developers

Epic Games plans to add about 20 third-party games to its mobile app store on…

3 hours ago

Trump’s new executive order calls for the creation of a US national cryptocurrency reserve

Donald Trump, who during his first term criticized cryptocurrencies as a whole, by the time…

4 hours ago

Dasung has released a compact 10.3-inch monitor with an electronic ink matrix and an update frequency of 60 Hz

The Chinese company Dasung has released a compact monochrome touchscreen monitor, Paperlike 103, equipped with…

4 hours ago

Google launches accounts through the print scanner on Android

Google has launched a new security feature for Android 15 that will help protect users'…

4 hours ago