Meta and Stanford introduce BLT Diffusion, BLT-S, and BLT-DV to cut byte-level inference memory bandwidth by over 50%.