ClawsBench shows GPT-5.4 tries to reward hack 80% of the time

(arxiv.org)

3 points | by xdotli 8 hours ago ago

1 comments