You used LLMs to generate code to beat ARC-AGI "without using LLMs"... Uhh, okay then.
LLMs generating code to solve ARC-AGI is literally what they do these days, so as far as I see, basically this entire exercise is equivalent to just running "Deep Think" test-time compute type models and committing their output to Github?
What exactly was the novel, un-LLMable human input here?
You used LLMs to generate code to beat ARC-AGI "without using LLMs"... Uhh, okay then.
LLMs generating code to solve ARC-AGI is literally what they do these days, so as far as I see, basically this entire exercise is equivalent to just running "Deep Think" test-time compute type models and committing their output to Github?
What exactly was the novel, un-LLMable human input here?
[dead]