Surpassing vLLM with a Generated Inference Stack

(infinity.inc)

21 points | by lukebechtel 5 hours ago ago

6 comments