Corral: Measuring how LLM-based AI scientists reason, not just what they produce

(lamalab-org.github.io)

2 points | by kjappelbaum 7 hours ago ago

1 comments