Three Design Decisions That Shaped the Enterprise RAG Retrieval Pipeline

Enterprise RAG — A practitioner's build log | Post 3 of 6

A retrieval pipeline has more design surface than it appears. The technology choices — vector search, LLM provider, storage engine — get most of the attention. The structural choices — where filtering happens, how evaluation is wired, what the dashboard connects to — determine whether the system actually works correctly in a production environment.

This post documents three structural decisions I made in Enterprise RAG, the constraint that drove each one, and the cost I accepted.

Decision 1: Lexical retrieval before semantic — sequencing, not a permanent choice

The default retrieval implementation uses token cosine similarity against a local SQLite chunk store (RAG_RETRIEVAL_PROVIDER=local). Not vector embeddings. Not a managed search index. Lexical scoring.

Enterprise RAG — A practitioner's build log | Post 3 of 6

This post documents three structural decisions I made in Enterprise RAG, the constraint that drove each one, and the cost I accepted.

Decision 1: Lexical retrieval before semantic — sequencing, not a permanent choice

Three Design Decisions That Shaped the Enterprise RAG Retrieval Pipeline

Three Design Decisions That Shaped the Enterprise RAG Retrieval Pipeline

Related reading

Building a RAG System from Scratch — Design Decisions Explained

What Enterprise RAG Is Ready For Today and What Production Deployment Actually…

The Access Control Gap That Makes Most Enterprise RAG Systems Dangerous

Four Metrics That Actually Tell You Whether Your Enterprise RAG Is Working

Build a RAG Pipeline From Scratch (Production Patterns That Actually Matter)

RAG vs. Agentic RAG vs. Graph RAG: Which One Actually Fits Your Use Case?

Related reading

Building a RAG System from Scratch — Design Decisions Explained

What Enterprise RAG Is Ready For Today and What Production Deployment Actually…

The Access Control Gap That Makes Most Enterprise RAG Systems Dangerous

Four Metrics That Actually Tell You Whether Your Enterprise RAG Is Working

Build a RAG Pipeline From Scratch (Production Patterns That Actually Matter)

RAG vs. Agentic RAG vs. Graph RAG: Which One Actually Fits Your Use Case?