# Day 5 of learning AI Engineering: built a small RAG app over a PDF

I built a small RAG (Retrieval Augmented Generation) project where a user can ask questions from a PDF, and the LLM answers from that PDF along with the page number to look at. The stack is LangChain, OpenAI embeddings, and Qdrant running in Docker.

A small note before we start: this exact same pipeline is what powers web-apps like an "AI Tutor in Educative", an "AI web page builder". The only thing that changes between those products and my PDF Q&A is the data source. That is the key idea to take away.

What RAG is, in one line

Take a document → break it into small chunks → turn each chunk into a vector (a list of numbers) → store those vectors in a database. Later, when the user asks a question, turn the question into a vector too, find the closest chunks, and feed them to an LLM as context.

INDEXING (run once)

What RAG is, in one line

INDEXING (run once)

# Day 5 of learning AI Engineering: built a small RAG app over a PDF

# Day 5 of learning AI Engineering: built a small RAG app over a PDF

Related reading

Build a RAG application with Runware and LangChain

I Built RAG From Scratch in Python to Understand It. Here's What I Learned.

I Built a RAG App, Then Asked It What Car I Like. It Didn't Know.

How to Build a RAG System with Your Own Documents in 7 Simple Steps

I made a personalized AI web app with RAG

Build a Simple RAG App with Telnyx AI Inference

Related reading

Build a RAG application with Runware and LangChain

I Built RAG From Scratch in Python to Understand It. Here's What I Learned.

I Built a RAG App, Then Asked It What Car I Like. It Didn't Know.

How to Build a RAG System with Your Own Documents in 7 Simple Steps

I made a personalized AI web app with RAG

Build a Simple RAG App with Telnyx AI Inference