Show HN: Verdict – model evals on your own data, not someone else's benchmark

(github.com)

2 points | by agunapal 7 hours ago ago

No comments yet.