Mozilla Data Collective seeks to build AI’s data economy around trust

Generative artificial intelligence has a data problem.

For years, the typical approach to building gen AI models has been to gather as much data as possible by scraping vast swaths of the internet, training at an enormous scale and dealing with the consequences later. The result has been increasingly powerful technology, but also growing concerns about bias, consent, ownership and the uneven distribution of value created from the world’s information.

Mozilla Data Collective was created to fill the gaps in this model.

The organization, which launched last November, is attempting to create a different kind of marketplace for AI data built around community ownership, consent and what founder and Chief Executive E.M. Lewis-Jong calls “fair value exchange.”