Routing LLM queries using internal success predictions (70% cost reduction)

(arxiv.org)

1 points | by stansApprentice 9 hours ago ago

1 comments