Making the Context Across 46 Repositories Semantically Searchable for AI (Part 2)

The biggest issue Part 1 left open was that AI couldn't reach the 46-repo codebase by natural-language query (the entry-point problem). This post is how I solved it — by reusing the pattern proven in db-graph (1,133-table semantic search), then layering minimal annotations only around boundary nodes. Covers the separate-branch operation that keeps engineers' daily workflow untouched, the SLO that protects the joins between three graphs, the SAME_ENTITY normalization, and the April–May trial-and-error timeline traced through real commits.

martedì 30 giugno 2026 New tab

Hi, I'm Ryan, CTO at airCloset.

In Part 1, I wrote about unifying 46 repositories of production code into a single knowledge graph via static analysis. The graph itself got built, but I closed the post with four open issues: no semantic search, node explosion, having to open the file to actually know what a function does, and the cost of writing a new parser every time a new boundary pattern showed up.

This Part 2 is about how I solved the first one — the entry-point problem (no semantic search). The other three are left exactly as Part 1 described them — I'll come back to them at the end, together with the new issues that surfaced once the entry-point problem was out of the way.

The reason to start with the entry-point problem is simple: if the graph exists but the only way to reach it is grep, the model ends up inferring anyway. The whole point — "give the model verified facts, not inference" — falls apart. So the entry-point problem had to be solved before the others.

The Hint Was in db-graph

Hi, I'm Ryan, CTO at airCloset.

The Hint Was in db-graph

Making the Context Across 46 Repositories Semantically Searchable for AI (Part 2)

Making the Context Across 46 Repositories Semantically Searchable for AI (Part 2)

Related reading

Building One Knowledge Graph Across 46 Repositories With Static Analysis (Part…

Stop Dumping Your Entire Repository Into AI

AST blueprint generator for better AI understanding

How much does context cost an AI coding agent? grep vs graph vs LSP, measured…

How to Build a Local 3D Codebase Knowledge Graph and Sync LLM Context Offline

The difference between "this shouldn't happen" and "this cannot happen" in AI…

Related reading

Building One Knowledge Graph Across 46 Repositories With Static Analysis (Part…

Stop Dumping Your Entire Repository Into AI

AST blueprint generator for better AI understanding

How much does context cost an AI coding agent? grep vs graph vs LSP, measured…

How to Build a Local 3D Codebase Knowledge Graph and Sync LLM Context Offline

The difference between "this shouldn't happen" and "this cannot happen" in AI…