I Built the Claude-Native Version of RecursiveMAS

RecursiveMAS showed that sharing internal reasoning state between agents improves accuracy. But it requires open-weight models. Here is how I built the same system with Claude extended thinking, what the eval found on 200 MATH level 4-5 problems, and why the stat test matters as much as the result.

lunedì 1 giugno 2026 New tab

RecursiveMAS (arXiv 2604.25917) showed that agents sharing internal reasoning state outperform agents that share only final outputs. The average accuracy gain across benchmarks was 8.3 points. The mechanism: each agent passes not just its answer but the latent embeddings from its own reasoning process, and the next agent conditions on both. The paper is a good result.

The catch is access. RecursiveMAS requires open-weight models with hidden states exposed at inference time. That rules out Claude, GPT-4o, and Gemini. I built a Claude-native version using the Anthropic extended thinking API. The core idea transfers: instead of passing latent vectors, pass the full thinking text. The paper calls it internal state sharing; the Claude version calls it thinking-block relay.

The architecture problem

Claude's extended thinking blocks carry an encrypted signature tied to the originating conversation. You cannot pass a signed thinking block into a different agent's messages array. The API rejects it. The workaround: extract the text from the thinking block and inject it as a regular user message.

# Extract thinking text from Agent 1

The architecture problem

# Extract thinking text from Agent 1

I Built the Claude-Native Version of RecursiveMAS

I Built the Claude-Native Version of RecursiveMAS

Other newsrooms on this story

Related reading

I read a multi-agent reasoning paper, built the Claude-native version, and…

DeepMath: A lightweight math reasoning Agent with smolagents

The Return of Recursion: How 5M-Parameter Models Are Outperforming Frontier…

Build Recurrent-Depth Transformers with OpenMythos for MLA, GQA, Sparse MoE,…

Fine-Tuning Small Open-Source LLMs to Outperform Large Closed-Source Models by…

ReasoningBank: Enabling agents to learn from experience

Other newsrooms on this story

Related reading

I read a multi-agent reasoning paper, built the Claude-native version, and…

DeepMath: A lightweight math reasoning Agent with smolagents

The Return of Recursion: How 5M-Parameter Models Are Outperforming Frontier…

Build Recurrent-Depth Transformers with OpenMythos for MLA, GQA, Sparse MoE,…

Fine-Tuning Small Open-Source LLMs to Outperform Large Closed-Source Models by…

ReasoningBank: Enabling agents to learn from experience