Hello, I'm Shrijith Venkatramana. I'm building git-lrc, an AI code reviewer that runs on every commit. Star Us to help devs discover the project. Do give it a try and share your feedback for improving the product.
Large Language Models keep getting smarter.
But there's a problem: users don't experience intelligence directly. They experience latency.
If a model takes 30 seconds to write an answer instead of 3 seconds, most users won't care that it scored higher on some benchmark.
This creates an interesting engineering challenge:






