Google DeepMind’s new AI agent cracks real-world problems better than humans can

AlphaEvolve uses large language models to find new algorithms that outperform the best human-made solutions for data center management, chip design, and more.

Google DeepMind has once again used large language models to discover new solutions to long-standing problems in math and computer science. This time the firm has shown that its approach can not only tackle unsolved theoretical puzzles, but improve a range of important real-world processes as well.

Google DeepMind's new tool, called AlphaEvolve, uses the Gemini 2.0 family of large language models (LLMs) to produce code for a wide range of different tasks. LLMs are known to be hit and miss at coding. The twist here is that AlphaEvolve scores each of Gemini’s suggestions, throwing out the bad and tweaking the good, in an iterative process, until it has produced the best algorithm it can. In many cases, the results are more efficient or more accurate than the best existing (human-written) solutions.

“You can see it as a sort of super coding agent,” says Pushmeet Kohli, a vice president at Google DeepMind who leads its AI for Science teams. “It doesn’t just propose a piece of code or an edit, it actually produces a result that maybe nobody was aware of.”

Google DeepMind’s new AI agent cracks real-world problems better than humans can

Related reading

Google DeepMind’s AI Agent Dreams Up Algorithms Beyond Human Expertise

Google DeepMind claims ‘historic’ AI breakthrough in problem solving

A Deep Learning Alternative Can Help AI Agents Gameplay the Real World

Google DeepMind says its new AI can map the entire planet with unprecedented…

Chinese fintech giant Ant releases powerful AI model to rival DeepSeek, OpenAI

DeepSeek secrets unveiled: engineers reveal science behind Chinese AI model