Tracing tokens through Llama 3.1 8B inference on H100s

(krithik.xyz)

2 points | by krithik_7 11 hours ago ago

No comments yet.