Stop chasing parameter counts. Build the toolbelt instead. — What I learned building Tlamatini (Open Source Destktop App).

For the last few months I've been building Tlamatini, an open-source local-first AI developer assistant. Along the way I kept bumping into the same assumption — both in articles and in my own head — that to build something useful, you need the biggest model you can afford. GPT-4. Claude Opus. Llama 70B at minimum.

Then I started actually shipping with smaller local models, and I learned something that flipped my thinking.

The real lesson

A 20B-parameter LLM, given the right tools, the right agents, and skills fine-tuned to your operating procedures, is good enough to power most of your company's real workflows.

Parameter count is not the bottleneck. The bottleneck is whether the model can act — and that's a tools problem, not a parameters problem.

Then I started actually shipping with smaller local models, and I learned something that flipped my thinking.

The real lesson

A 20B-parameter LLM, given the right tools, the right agents, and skills fine-tuned to your operating procedures, is good enough to power most of your company's real workflows.

Parameter count is not the bottleneck. The bottleneck is whether the model can act — and that's a tools problem, not a parameters problem.

Stop chasing parameter counts. Build the toolbelt instead. — What I learned building Tlamatini (Open Source Destktop App).

Stop chasing parameter counts. Build the toolbelt instead. — What I learned building Tlamatini (Open Source Destktop App).

Related reading

Anti Refusal LLM Service

Open source vs closed AI: real-world tradeoffs

Why I chose Tauri over Electron for my local AI dev tool (80MB vs 600MB RAM)

Build a Socratic Study Buddy with Gemma 4: A Beginner’s Guide to Running AI…

Stop Chasing Shiny Tools: A Minimalist AI Stack That Actually Makes You Money

I built an MCP server that gives AI persistent memory of your SQL database

Related reading

Anti Refusal LLM Service

Open source vs closed AI: real-world tradeoffs

Why I chose Tauri over Electron for my local AI dev tool (80MB vs 600MB RAM)

Build a Socratic Study Buddy with Gemma 4: A Beginner’s Guide to Running AI…

Stop Chasing Shiny Tools: A Minimalist AI Stack That Actually Makes You Money

I built an MCP server that gives AI persistent memory of your SQL database