Achieving 3X speedups with diffusion-style speculative decoding

(developers.googleblog.com)

3 points | by xnx 11 hours ago ago

No comments yet.