80TB+ of astronomy for the HDD-poor: crossmatch the Multimodal Universe from your laptop

Back to Articles

TL;DR What exactly is the Multimodal Universe? Why should you care about crossmatching astronomy data? HATS + LSDB ❤️ Hugging Face 🤗 Just give me the code examples! 🗣️ Acknowledgements

TL;DR

The Multimodal Universe (MMU) pools together 80TB1 plus of data from over 30 astronomical surveys into one place. Crossmatching (linking observations of the same object across surveys) is its killer feature, but until now it required downloading hefty chunks of data to local disk. We got tired of needing a cluster just to run a crossmatch, so we gathered in the UniverseTBD and Hugging Science Discord servers to fix that. We've converted the MMU to the parquet-based HATS format so that you can use the LSDB and Hugging Face ecosystems to crossmatch from a laptop.

The datasets are in this Hugging Face collection. No bulk downloads are necessary, and 4GB of RAM is enough even at Gaia scale. Here it is in action:

Back to Articles

TL;DR What exactly is the Multimodal Universe? Why should you care about crossmatching astronomy data? HATS + LSDB ❤️ Hugging Face 🤗 Just give me the code examples! 🗣️ Acknowledgements

TL;DR

The datasets are in this Hugging Face collection. No bulk downloads are necessary, and 4GB of RAM is enough even at Gaia scale. Here it is in action:

80TB+ of astronomy for the HDD-poor: crossmatch the Multimodal Universe from your laptop

80TB+ of astronomy for the HDD-poor: crossmatch the Multimodal Universe from your laptop

Other newsrooms on this story

Related reading

Making Sense of the Early Universe

Astrophysics & AI with Python: Unlocking the Universe with Astroquery

Machine learning for alien climates: Introducing the ThousandWorlds benchmark

‘The greatest cosmic movie ever made’: Historic telescope kicks off an…

Tuesday Telescope: Webb and Hubble team up to reveal spectacular star clusters

Search for Hidden Cosmic Companions in Sun's Backyard - NASA Science

Other newsrooms on this story

Related reading

Making Sense of the Early Universe

Astrophysics & AI with Python: Unlocking the Universe with Astroquery

Machine learning for alien climates: Introducing the ThousandWorlds benchmark

‘The greatest cosmic movie ever made’: Historic telescope kicks off an…

Tuesday Telescope: Webb and Hubble team up to reveal spectacular star clusters

Search for Hidden Cosmic Companions in Sun's Backyard - NASA Science