How I built a Retrieval-Augmented Generation system for code review and resume generation — and why avoiding hosted LLMs was the right call.

One of the first decisions I made when designing InternFlow's AI layer was this: I didn't want to depend on OpenAI or any hosted LLM API for the core functionality.

This wasn't about cost alone — though that matters for a student project. It was about understanding how AI products actually work underneath the marketing.

Why not just call GPT?

The core problem with sending code to a hosted LLM without context: it hallucinates. It'll write confident-sounding review comments about patterns it hasn't seen. For a resume generator, it'll invent metrics. None of that is useful for a student trying to land an internship.