In my last post, I laid out the core inequality of Speculative Decoding:

a > 1 + α + β

Acceptance length a must exceed 1 plus the draft/target compute ratio α plus verification overhead β. If it does, SD wins. If it doesn't, SD loses.

That was theory. This post is the practice.

I ran a real A/B test on my machine. The results were worse than I expected — and more interesting.