If you’ve built AI agents recently, you know the standard playbook: you take a user's prompt, feed it into GPT-4 or Claude alongside a massive JSON schema of available tools, and ask the LLM to figure out which tool to use.

This works for prototypes. But in an Enterprise IT environment, it’s a disaster.

Using an LLM for Intent Routing takes anywhere from 800ms to 2,000ms. It burns API tokens on every single "hello" or "my laptop is broken" message. Worse, LLMs hallucinate—if a user asks to "Provision an Azure SQL database," an overly helpful LLM might hallucinate a non-existent tool call and crash your pipeline.

I wanted to build an autonomous IT Helpdesk agent that was deterministic, instant, and practically free to run. That led me to build AegisDesk, an open-source, multi-agent IT platform powered by LangGraph, SQLite, and Zero-Token Semantic Routing.

The Architecture: Zero-Token Routing