I’ve been playing around with local LLM agents a lot lately.. Mostly smaller models, MCP tools, Cline/Roo-style workflows, and home lab setups.
Not the “infinite context, infinite budget” world.
More like:
“Can this 4B/9B model actually use the web without getting buried alive by garbage context?”
That was the problem that kept annoying me.







