Why I Stopped Building My Own Document Q&A from Scratch

Two months ago, I was knee-deep in a project that sounded simple: build a system that could answer questions from our company’s internal documentation. We had hundreds of PDFs, Confluence pages, and READMEs. The goal was to let junior developers ask natural language questions and get accurate answers instantly.

I thought, “How hard can it be? I’ll just fine-tune a small LLM on our documents.”

Spoiler: it was that hard, and then some.

The Dead End: Fine-Tuning a Model

I spent two weeks collecting, cleaning, and chunking our documentation. I wrote a Hugging Face training script, rented a GPU, and fine-tuned a 7B parameter model. The result? A model that could recite our API docs verbatim but couldn’t answer a question like “Why does our auth flow fail for expired tokens?” without hallucinating.

I thought, “How hard can it be? I’ll just fine-tune a small LLM on our documents.”

Spoiler: it was that hard, and then some.

The Dead End: Fine-Tuning a Model

Why I Stopped Building My Own Document Q&A from Scratch

Why I Stopped Building My Own Document Q&A from Scratch

Related reading

I Built a Q&A Bot for My Docs and Almost Gave Up (Here's What Worked)

How I Built a Q&A Bot for My Documentation (and What I Learned)

How I stopped dumping PDFs and started chatting with my documentation

How I Finally Tamed Long Document Analysis with LLMs (It Wasn't Simple Chunking)

Developer Documentation Platforms in 2026: GitBook, Mintlify, ReadMe,…

How to Build a RAG System with Your Own Documents in 7 Simple Steps

Related reading

I Built a Q&A Bot for My Docs and Almost Gave Up (Here's What Worked)

How I Built a Q&A Bot for My Documentation (and What I Learned)

How I stopped dumping PDFs and started chatting with my documentation

How I Finally Tamed Long Document Analysis with LLMs (It Wasn't Simple Chunking)

Developer Documentation Platforms in 2026: GitBook, Mintlify, ReadMe,…

How to Build a RAG System with Your Own Documents in 7 Simple Steps