I beat Grok 4 on ARC-AGI-2 using a CPU-only symbolic engine (18.1% score)

(github.com)

5 points | by kofdai 9 hours ago ago

2 comments

jaen 41 minutes ago
You used LLMs to generate code to beat ARC-AGI "without using LLMs"... Uhh, okay then.
LLMs generating code to solve ARC-AGI is literally what they do these days, so as far as I see, basically this entire exercise is equivalent to just running "Deep Think" test-time compute type models and committing their output to Github?
What exactly was the novel, un-LLMable human input here?
kofdai 9 hours ago
[dead]