HN
New
Show
Ask
Jobs
Built with Qwik
Agent-evals: Metacognitive scoring and boundary testing for LLM coding agents
(thinkwright.ai)
2 points | by
oceanwaves
12 hours ago ago
1 comments
12 hours ago
[deleted]
1 comments