Alibaba unveils Qwen-Robot series with three foundation models for embodied AI · TechNode

The Qwen team on Tuesday released a robotics suite featuring three foundation models: Qwen-RobotNav, Qwen-RobotManip, and Qwen-RobotWorld. These three models align language with different types of physical actions.

Qwen-RobotNav extends vision-language capabilities into mobile robotics through controllable observation encoding and tool-based interfaces. The model unifies four key tasks within a single framework: instruction following, goal-directed navigation, target tracking, and autonomous driving.

Qwen-RobotManip standardizes the state-action space and represents end-effector motion as incremental poses in the camera coordinate system. Trained on more than 38,100 hours of fully open-source data, the model supports large-scale learning across multiple robot platforms, enabling a broad range of manipulation capabilities.

Meanwhile, Qwen-RobotWorld serves as a general-purpose world model that connects vision-language understanding with future-state prediction through a natural-language action interface. The model can forecast physically consistent future outcomes across navigation, driving, and manipulation scenarios, allowing a single world model to generalize across diverse embodied AI tasks. [TechNode Reporting]

Alibaba unveils Qwen-Robot series with three foundation models for embodied AI · TechNode

Alibaba unveils Qwen-Robot series with three foundation models for embodied AI · TechNode

Other newsrooms on this story

Related reading

Meet Qwen-RobotSuite: Three Embodied AI Models for VLA Manipulation, Video…

Alibaba builds Qwen-Robot, an operating system for the robot economy

Alibaba eyes physical world with its first suite of AI models for robots

Alibaba's Qwen Team Launches Qwen3.7-Plus, Adding Vision, Deep Reasoning, Tool…

Alibaba unveils Qwen3.5 with visual agentic abilities

Alibaba unveils Qwen3.5 as China’s chatbot race shifts to AI agents

Other newsrooms on this story

Related reading

Meet Qwen-RobotSuite: Three Embodied AI Models for VLA Manipulation, Video…

Alibaba builds Qwen-Robot, an operating system for the robot economy

Alibaba eyes physical world with its first suite of AI models for robots

Alibaba's Qwen Team Launches Qwen3.7-Plus, Adding Vision, Deep Reasoning, Tool…

Alibaba unveils Qwen3.5 with visual agentic abilities

Alibaba unveils Qwen3.5 as China’s chatbot race shifts to AI agents