Speculative Speculative Decoding: Really, Really Fast LLM Inference

(github.com)

1 points | by fizzbuzz07 7 hours ago ago

No comments yet.