Context engineering matters for two reasons: reliability and cost. If your agent's context window is full of noise, reasoning quality drops and you're paying for tokens that aren't helping anything. And one of the biggest sources of that noise? Tool results.
HTTP requests, file readers, API clients, and database queries can return really context heavy results. When these verbose tool results enter the conversation, they can crowd out other context and burn up tokens quickly.
You need a way to truncate tool results and only bring in the full context of that tool result when needed. Luckily, Strands Agents just released something that does this for you automatically.
Offloading Noisy Tool Results Automatically
Strands Agents just shipped the ContextOffloader plugin. It's available in both the TypeScript and Python SDKs. It prevents large tool results from consuming your agent's context window automatically. When a tool returns a result that exceeds a configurable token threshold, the plugin stores each content block individually in an external storage backend and replaces it in the conversation with a truncated preview plus per-block references. Each offloaded result includes inline guidance telling the agent to use its available tools to selectively access the data it needs.






