15× vs. ~1.37×: Recalculating GPT-5.3-Codex-Spark on SWE-Bench Pro

(twitter.com)

2 points | by nvanlandschoot 11 hours ago ago

3 comments