TL;DRAI

Three LLMs (Qwen, Gemma) prove concepts are single directions in activation space, not regions—confirmed causally. For tech leaders: enables clean steering via linear projection; concepts are rank-1 with implications for interpretability and design.

TL;DR

I've been running an "fMRI for LLMs" — capturing the full internal activations of dense open models (Qwen2.5-7B, Gemma-2-9B, Gemma-4-12B) and applying neuroscience methods to map how meaning is organized. The headline result, confirmed causally and across all three models: a concept is not stored in a region of neurons — it is a single direction in activation space.

1. Meaning lives in a direction, not a region

In the brain, categories live in localized regions (faces → fusiform face area). LLMs are the opposite.

Distributed, superposed code. A 10-way category linear probe decodes far above chance (Gemma-2 0.97, Qwen 0.80), yet the "most selective" units do not replicate across two random halves of the stimuli (overlap ≈ 0.00–0.05). There is no findable "animal neuron."

dev.to

I ran an fMRI on LLMs: a concept is a direction, not a region

TL;DR I've been running an "fMRI for LLMs" — capturing the full internal activations of...

domenica 7 giugno 2026 New tab

TL;DRAI

610 words~3 min read

TL;DR

1. Meaning lives in a direction, not a region

In the brain, categories live in localized regions (faces → fusiform face area). LLMs are the opposite.

I ran an fMRI on LLMs: a concept is a direction, not a region

I ran an fMRI on LLMs: a concept is a direction, not a region

Other newsrooms on this story

Related reading

Notes: Memory, Context, and Large Language Models (LLMs)

What do LLMs think when you don't tell them what to think about?

Turning brain prediction models into testable explanations

How LLMs Actually Work: A Developer's Mental Model

VLX-Seek: Improving VLM Fine-Grained Perception via Region Reference Instead of…

LLMs generate ‘fluent nonsense’ when reasoning outside their training zone

Other newsrooms on this story

Related reading

Notes: Memory, Context, and Large Language Models (LLMs)

What do LLMs think when you don't tell them what to think about?

Turning brain prediction models into testable explanations

How LLMs Actually Work: A Developer's Mental Model

VLX-Seek: Improving VLM Fine-Grained Perception via Region Reference Instead of…

LLMs generate ‘fluent nonsense’ when reasoning outside their training zone