The AI industry has spent the past several years learning a critical lesson: better data often matters as much as better models. While advances in large language models have been powered by increasingly sophisticated datasets and data pipelines, robotics has yet to undergo the same transformation. Robotics teams are working with vast quantities of video, sensor data, and demonstrations, but much of the infrastructure needed to process, annotate, and improve that data remains immature.

Macrodata Labs believes that closing that gap could become one of the most important challenges in robotics AI. Macrodata Labs recently emerged from stealth, launching Refiner, an open-source framework and cloud platform for processing robotics datasets.

The company raised $4 million in pre-seed funding in June this year to build infrastructure for the robotics data loop. The round was led by Air Street Capital, with participation from Drysdale Ventures, OPRTRS club, Kima Ventures, YG (Alex Yazdi), >commit, Thomas Wolf, and business angels from some of the world’s leading AI labs and technology companies.

Macrodata Labs builds infrastructure for the robotics data loop. Its first product, Refiner, is an open-source framework and cloud platform for robotics data processing, helping teams turn raw physical-world data into better training datasets.