Hello, I'm Shrijith Venkatramana. I'm building git-lrc, an AI code reviewer that runs on every commit. Star Us to help devs discover the project. Do give it a try and share your feedback for improving the product.

Everyone knows modern AI runs on data. Fewer people realize that today's most capable AI systems increasingly learn from data they created themselves.

For years, the common belief was simple: collect more human-written text, train bigger models, and intelligence would emerge.

That worked—until it didn't.

By 2022, frontier AI labs had consumed a significant fraction of the publicly available high-quality text on the internet. The next leap wasn't going to come from scraping another few billion web pages.