A startup most people have never heard of just walked into Nvidia’s house and claimed it can do inference better. D-Matrix, a Silicon Valley AI hardware company founded in 2019, says its Corsair inference accelerator platform runs AI workloads up to 10 times faster than standalone Nvidia GPUs while consuming up to five times less energy.

The kicker: the Corsair platform entered volume production in June 2026, meaning these aren’t vaporware slides at a conference. They’re shipping hardware.

What the Corsair actually does

D-Matrix is attacking what engineers call the “memory wall.” The biggest slowdown in inference isn’t computation, it’s moving data around. The chip spends more time fetching information from memory than it does actually doing math.

The Corsair platform tackles this with an in-memory computing architecture. Instead of shuttling data back and forth between processors and memory, the computation happens where the data already lives.