In my last post, I laid out the core inequality of Speculative Decoding:
a > 1 + α + β
Acceptance length a must exceed 1 plus the draft/target compute ratio α plus verification overhead β. If it does, SD wins. If it doesn't, SD loses.
That was theory. This post is the practice.
I ran a real A/B test on my machine. The results were worse than I expected — and more interesting.












