TL;DRAI

DeepMind's AlphaProof Nexus solved 9 of 353 open Erdős problems — two unsolved for 56 years — at a few hundred dollars each, using Gemini 3.1 Pro to generate Lean proof steps verified by a formal compiler at every iteration. Simple agentic loops grounded by symbolic feedback are converging on specialized trained systems, signaling that LLM + compiler architectures are the next credible pattern for AI agents where logical reliability and auditability are non-negotiable.

AlphaProof Nexus combines LLM-driven proof generation with machine verification to crack open math research problems that have stumped mathematicians for decades.

Google Deepmind's new framework AlphaProof Nexus has autonomously solved nine out of 353 open Erdős problems it attempted, including two questions that had gone unanswered for 56 years.

The system also proved 44 out of 492 open conjectures from the Online Encyclopedia of Integer Sequences (OEIS), settled a 15-year-old question about Hilbert functions in algebraic geometry, and improved a known bound in convex optimization. Inference costs ran just a few hundred dollars per problem, according to the research paper.

Unlike (potentially) pure natural-language approaches such as OpenAI's recent solution, the underlying language model in AlphaProof Nexus—in this case Gemini 3.1 Pro—doesn't have to carry the entire logical chain on its own.

Instead, it generates proof steps in Lean's formal language, and the compiler checks each one. Error messages feed directly back into the next attempt. That way, the LLM gets grounded by symbolic feedback, a safety net that offsets the well-known weaknesses of language models when it comes to logical reasoning. Humans only step in at the very end to check the results.

the-decoder.com

Google Deepmind's AlphaProof Nexus solves decades-old math problems for a few hundred dollars

Google Deepmind's AlphaProof Nexus has autonomously solved nine open Erdős problems, including two that stumped mathematicians for 56 years, for just a few hundred dollars per problem in inference costs. Unlike OpenAI's natural-language approach, the system uses the Lean compiler to verify every proof step automatically. Still, the overall success rate sits at just 2.5 percent.

lunedì 25 maggio 2026 New tab

TL;DRAI

947 words~4 min read

AlphaProof Nexus combines LLM-driven proof generation with machine verification to crack open math research problems that have stumped mathematicians for decades.

Google Deepmind's new framework AlphaProof Nexus has autonomously solved nine out of 353 open Erdős problems it attempted, including two questions that had gone unanswered for 56 years.

Google Deepmind's AlphaProof Nexus solves decades-old math problems for a few hundred dollars

Google Deepmind's AlphaProof Nexus solves decades-old math problems for a few hundred dollars

Other newsrooms on this story

Related reading

Google DeepMind's AlphaProof Nexus solves 9 Erdős problems and proves 44…

Google AI just solved 56-year-old math problems on its own

How DeepMind AlphaProof Nexus Cracks 56-Year-Old Math: Agentic LLM Loops and…

OpenAI smonta una congettura geometrica di 80 anni, ma Google risolve 9…

OpenAI claims its AI model solves 80-year-old math conjecture

It Feels Like We're Days Away From The Crash

Other newsrooms on this story

Related reading

Google DeepMind's AlphaProof Nexus solves 9 Erdős problems and proves 44…

Google AI just solved 56-year-old math problems on its own

How DeepMind AlphaProof Nexus Cracks 56-Year-Old Math: Agentic LLM Loops and…

OpenAI smonta una congettura geometrica di 80 anni, ma Google risolve 9…

OpenAI claims its AI model solves 80-year-old math conjecture

It Feels Like We're Days Away From The Crash