You have seen this loop before.
An agent starts a “simple” task, say scrape listings, refactor a repo, research a market, or whatever. It fails, it retries, it re-reads context, it apologizes and tries all over again. Twenty minutes in and the dashboard shows six figures of tokens and zero useful outputs or deliverables.
The model did not misbehave on purpose. The orchestrator never had a hard budget gate with an ROI in mind.
Skillware v0.4.0 ships a new skill for exactly that gap: monitoring/token_limiter. It lets you monitor and limit any agent’s token budget in real time — Gemini, Claude, OpenAI, DeepSeek, Ollama, custom Python loops, you name it. Same skill, same JSON, any runtime.
What Skillware is in a nutshell






