World Action Models give robots the ability to simulate consequences before they move

World Action Models tackle a basic weakness of today's robotics AI: current models learn which movements match which camera images, but they don't understand how the world actually changes as a result. A new survey organizes about a hundred papers into two architectural lines and shows a key edge: these models can learn from everyday videos that contain no robot action labels. That kind of data was nearly useless for traditional robotics AI.

domenica 17 maggio 2026 New tab

Today's robotics AI has a basic weakness: models learn to map camera images directly to movements. But they don't understand how the world actually changes as a result of their actions.

A new survey paper from Fudan University, the Shanghai Innovation Institute, and the National University of Singapore is the first to systematically catalog a class of models designed to close that gap: World Action Models.

The authors map all current World Action Models along two main branches, showing how joint and cascaded architectures have branched out since 2024. | Image: Wang et al.

Robots that simulate their own near future

Existing vision-language-action models mostly learn direct mappings from observations to matching actions. World Action Models go further. They also model how the environment will likely change, then couple that prediction to action generation.

Today's robotics AI has a basic weakness: models learn to map camera images directly to movements. But they don't understand how the world actually changes as a result of their actions.

The authors map all current World Action Models along two main branches, showing how joint and cascaded architectures have branched out since 2024. | Image: Wang et al.

Robots that simulate their own near future

World Action Models give robots the ability to simulate consequences before they move

World Action Models give robots the ability to simulate consequences before they move

Other newsrooms on this story

Related reading

AI's next big leap is models that understand the world.

Tech entrepreneurs seeking the next AI frontier are pivoting from chatbots to…

All the world's a robot-staging ground for tech entrepreneurs building…

Why world models must do more than simulate: Pony.ai CTO

Fei-Fei Li explains world models' roles in robotics and gaming

AI research Archive

Other newsrooms on this story

Related reading

AI's next big leap is models that understand the world.

Tech entrepreneurs seeking the next AI frontier are pivoting from chatbots to…

All the world's a robot-staging ground for tech entrepreneurs building…

Why world models must do more than simulate: Pony.ai CTO

Fei-Fei Li explains world models' roles in robotics and gaming

AI research Archive