When I first started learning about Big Data, I used to think it was mainly about storing massive amounts of information.

But after working around real enterprise systems and large-scale pipelines, I realized the real challenge is not simply the size of the data.

It’s everything that comes with it.

As systems grow, data starts arriving from everywhere:

APIs