Repurposed Nvidia RT Cores for LLM routing (218x speedup)

(github.com)

1 points | by Jordisilvestre 6 hours ago ago

1 comments