Towards Compute-Aware In-Switch Computing for LLMs on Multi-GPU Systems

(arxiv.org)

1 points | by rbanffy 14 hours ago ago

No comments yet.