AMS – Detect unsafe LLMs in 30 seconds via activation analysis

(github.com)

1 points | by gmessenger 10 hours ago ago

1 comments