Researchers let Claude Code discover AI scaling algorithms that humans probably wouldn't have designed

Researchers from UMD, Google, Meta, and other institutions use AutoTTS to let a coding agent independently discover control algorithms for AI reasoning. The algorithm it found cuts compute by about 70 percent compared to standard self-consistency while matching its accuracy. The whole search cost $40 and took 160 minutes.

domenica 24 maggio 2026 New tab

Instead of writing rules for more efficient AI reasoning themselves, researchers let a coding agent hunt for better control algorithms in a simulated environment. The result beats established methods while burning far less compute.

Test-time scaling (TTS) is meant to make large language models perform better by letting them spend more compute on a response, say, by running several solution paths in parallel or extending chains of thought. Until now, human-written rules almost always dictated when a model kicks off a new solution path, doubles down on a promising one, or kills it.

A research team from UMD, UVA, WUSTL, UNC, Google, and Meta flips that with AutoTTS. Humans don't write the algorithm. Instead, they build the playground where an AI agent figures out algorithms on its own.

The paper argues that many known methods are really just special cases in a shared control space defined by width (how many solution paths run at once) and depth (how far each one goes). So why, the authors ask, do researchers keep plotting paths through this space by hand instead of letting a machine search it?

Simulating the search keeps costs down

Researchers let Claude Code discover AI scaling algorithms that humans probably wouldn't have designed

Researchers let Claude Code discover AI scaling algorithms that humans probably wouldn't have designed

Other newsrooms on this story

Related reading

AutoTTS reduces token usage by 69.5% in LLM reasoning strategies

LLM reasoning, automated: tokens drop 69.5%

[AI] Context Engineering for AI Coding: AGENTS.md, Cursor Rules & RAG

AI optimizer beats Claude Code, Codex by 2.5x

OpenAI cuts inference costs in half with new optimization technique

How Meta Used AI to Map Tribal Knowledge in Large-Scale Data Pipelines

Other newsrooms on this story

Related reading

AutoTTS reduces token usage by 69.5% in LLM reasoning strategies

LLM reasoning, automated: tokens drop 69.5%

[AI] Context Engineering for AI Coding: AGENTS.md, Cursor Rules & RAG

AI optimizer beats Claude Code, Codex by 2.5x

OpenAI cuts inference costs in half with new optimization technique

How Meta Used AI to Map Tribal Knowledge in Large-Scale Data Pipelines