I’ve been playing around with local LLM agents a lot lately.. Mostly smaller models, MCP tools, Cline/Roo-style workflows, and home lab setups.

Not the “infinite context, infinite budget” world.

More like:

“Can this 4B/9B model actually use the web without getting buried alive by garbage context?”

That was the problem that kept annoying me.