I ran 3,360 safety tests on GPT-4o, Claude, Grok, DeepSeek, Gemini

(github.com)

4 points | by aestrad7 5 hours ago ago

7 comments