Hey everyone,

I wanted to share a little side project I cooked up over the last week. So, long story short, I only started diving into the LLM world in February, and honestly, it’s been a wild ride. I started with LM Studio, but as many of you know, by the time you get comfortable with one tool, a new "insane" feature post drops on r/LocalLLaMA and the software is already playing catch-up. I eventually settled on using plain llama.cpp because it seems to be the gold standard, but I kept hitting a wall: the update cycle is so fast, and manually updating it feels a bit ... clunky, especially since there's no integrated updater bundled, especially for those juicy new beta versions that get released so often.

So.. about a week ago, while watching The Wire (adhd at its finest), for some reason I had the idea that basically: Why isn't there an nvm but for llama.cpp?

Coming from the Node.js world, I was missing the simplicity of nvm, so I wanted something that lets me swap, install, uninstall and manage versions on the fly without a headache. So, alongside Claude and my local Qwen 35B (mostly Qwen), I decided to "vibe code" it into existence (I can't believe I'm using this term). The models suggested Go (since it's great for CLI tools), and even though I don't actually know how to write a single line of Go, we made it work.