Why I'm building Hyphae: provenance over prediction (and the 3-line baseline that tied it)

A few months ago I set out to build a cognitive substrate without a large language model in the answering path. I had a thesis I liked, a Rust workspace, and a lot of conviction.

Then I wrote a three-line baseline that tied it on every metric I cared about.

This is the story of why that was the best thing that happened to the project — and why I'm still building it, just pointed at a sharper target.

The problem I actually care about

When a language model answers a grounded question, it paraphrases its sources. That paraphrase is fluent, often correct, and — this is the part that bothers me — impossible to bind back to its source byte-for-byte. You can cite a document. You cannot prove, after the fact, that the words in the answer are the words that were stored, unaltered, at a known position.

Why I'm building Hyphae: provenance over prediction (and the 3-line baseline that tied it)

Related reading

I Pre-Registered a Hypothesis. 600 API Calls Later, the Data Killed It.

I spent ten days forcing tiny local models to write real code. Here's what…

AI as Cognitive Infrastructure

I built an AI that predicts its own collapse

Building a Brain in Pure Python

When Four Memory Systems Hit the Same Wall