Safety benchmarks are inflated because models know they're being tested

(lesswrong.com)

3 points | by aranguri 10 hours ago ago

No comments yet.