5 Things Broke When I Shipped a RAG + MCP Agent to Production. | Towards AI

Author(s): Sudip P. Originally published on Towards AI. 5 Things Broke When I Shipped a RAG + MCP Agent to Production.Diagram-1: RAG vs MCP agent architectu ...

martedì 26 maggio 2026 New tab

2,150 words~10 min read

Author(s): Sudip P.

Originally published on Towards AI.

Diagram-1: RAG vs MCP agent architecture: a small LLM router classifies each user query as either a Knowledge request (hybrid search → cross-encoder rerank) or an Action request (validate input → tool call). Both paths converge at a single frontier model for synthesis, then pass through eval and logging before returning a response.

Read this article for free: link

TL;DR (because you’re busy)

5 Things Broke When I Shipped a RAG + MCP Agent to Production. | Towards AI

5 Things Broke When I Shipped a RAG + MCP Agent to Production. | Towards AI

Other newsrooms on this story

Related reading

MCP + RAG: Why I Stopped Building Complex RAG Systems After MCP Changed…

Six MCP tools, one trade: walking an AI agent from RFQ to refund

My RAG evaluation pipeline returned nan — here's what that taught me about…

MCP Server Design: 3 Principles We Learned in Production

How to Build an MCP Style Routed AI Agent System with Dynamic Tool Exposure…

Optimizing RAG Pipelines, Migrating AI Agents, and LLM-Powered Troubleshooting

Other newsrooms on this story

Related reading

MCP + RAG: Why I Stopped Building Complex RAG Systems After MCP Changed…

Six MCP tools, one trade: walking an AI agent from RFQ to refund

My RAG evaluation pipeline returned nan — here's what that taught me about…

MCP Server Design: 3 Principles We Learned in Production

How to Build an MCP Style Routed AI Agent System with Dynamic Tool Exposure…

Optimizing RAG Pipelines, Migrating AI Agents, and LLM-Powered Troubleshooting