3 points | by niraj-agarwal 10 hours ago ago
1 comments
I generally think this is a valid point to make, but there is some complexity in how you generate input to benchmark problems after the fact. You've essentially given it the answer, based on the results of a human-ai interaction
I generally think this is a valid point to make, but there is some complexity in how you generate input to benchmark problems after the fact. You've essentially given it the answer, based on the results of a human-ai interaction