Prompt caching but for RL – 7.5x speedup on long-prompt/short-response workloads

(castform.com)

4 points | by kumama 12 hours ago ago

No comments yet.