NVIDIA XR AI is now available in public beta, giving developers a framework for building multimodal AI agents for AR glasses and XR devices.

Your browser does not support the video tag.

AI is moving beyond chatbots and copilots into the physical world. Across laboratories, factories and hospitals, a new generation of AI agents is beginning to work alongside people, helping them understand their environment, access knowledge and take action in real time.

However, building agentic systems that combine models, skills, harnesses, tools and an agentic runtime to help people perform hands-on work is challenging. To operate effectively in dynamic, real-world environments, these agents must do more than generate responses.

Like human workers, they need knowledge, tools and specialized skills to perceive and understand the world through video, audio and sensor data, interpret fast-changing conditions and spatial context, retrieve information from enterprise systems, reason about the next best action and use software tools to complete tasks. All of this must happen with low latency and in a way that supports the user without creating distraction.