I measured MCP vs a CLI for agent search. The MCP used 17x more tokens per call.

I ran the same Google search through SerpApi's official serpapi-mcp server and through serp, the small open-source (MIT) CLI I built for the same job. Before I had searched anything, the MCP had already put 771 tokens into the model's context. The CLI put zero. When I did search, the MCP returned 6,047 tokens and the CLI returned 351. Same query, same serpapi library underneath, same machine.

That standing cost, paid on every turn whether you search or not, is the number nobody puts in the demo. So I wrote it all down.

TL;DR: for stateless search inside an agent loop, a CLI costs roughly 0 standing tokens against ~771 per turn for an MCP tool, and ~351 per call against ~6,047. The compaction logic on both sides is identical; the CLI just trims to the fields you ask for and stays out of context when idle. Pick the transport that fits the call.

Standing cost, paid every turn

SerpApi MCP

I measured MCP vs a CLI for agent search. The MCP used 17x more tokens per call.

Other newsrooms on this story

Related reading

I Measured MCP vs Direct API Calls: The Token Math No One Tells You

I Measured MCP vs CLI for Agent Tool Use — MCP Used 17x More Tokens Per Call

I Ran 6 MCP Servers Behind One Agent. Here's What the Token Bill Actually…

MCP vs Direct API Calls — My Agent Stack Has Zero MCP Servers

The MCP Tax Hit 42,000 Tokens on a Single Server. Here's What I Did About It.

Tools vs Raw Commands - The Token Cost Theory - Part 1