The Return of Recursion: How 5M-Parameter Models Are Outperforming Frontier LLMs on Reasoning in 2026

The Return of Recursion: How 5M-Parameter Models Are Outperforming Frontier LLMs on...

sabato 23 maggio 2026 New tab

2,694 words~12 min read

TL;DR Summary

Tiny recursive models with 5-7 million parameters are achieving state-of-the-art on deterministic reasoning tasks that frontier LLMs score 0% on — including Sudoku-Extreme, ARC-AGI puzzles, and maze navigation

The key innovation: reasoning in latent space instead of generating "thinking tokens" like Chain-of-Thought — delivering 100x speedups and 75% token reduction

Probabilistic TRM (7M params) achieves 98.75% on Sudoku-Extreme using Gaussian noise to escape local optima, while DeepSeek-R1 scores 0.0%

The Return of Recursion: How 5M-Parameter Models Are Outperforming Frontier LLMs on Reasoning in 2026

The Return of Recursion: How 5M-Parameter Models Are Outperforming Frontier LLMs on Reasoning in 2026

Other newsrooms on this story

Related reading

Thinking to recall: How reasoning unlocks parametric knowledge in LLMs

How test-time scaling unlocks hidden reasoning abilities in small language…

Small Language Models on Edge Devices: How 2.6B Parameters Are Outperforming…

URM shows how small, recurrent models can outperform big LLMs in reasoning…

I read a multi-agent reasoning paper, built the Claude-native version, and…

Recursive Language Models: A new framework for infinite context in LLMs -…

Other newsrooms on this story

Related reading

Thinking to recall: How reasoning unlocks parametric knowledge in LLMs

How test-time scaling unlocks hidden reasoning abilities in small language…

Small Language Models on Edge Devices: How 2.6B Parameters Are Outperforming…

URM shows how small, recurrent models can outperform big LLMs in reasoning…

I read a multi-agent reasoning paper, built the Claude-native version, and…

Recursive Language Models: A new framework for infinite context in LLMs -…