Autoregressive or Diffusion Language Models, Why Choose?

(arxiv.org)

5 points | by mimida a day ago ago

1 comments