DSpark can make decoding faster, but acceptance quality still determines how much speed the system actually realizes.

DeepSeek releases DSpark, an open-source speculative decoding framework accelerating DeepSeek-V4 per-user generation 57–85% over MTP-1

DeepSeek's new DSpark framework delivers 60% to 85% faster inference speeds for its V4 models through speculative decoding, with throughput gains up to