Malware devs added nuclear and bioweapons text to trigger LLM safety refusals

(twitter.com)

3 points | by porridgeraisin 9 hours ago ago

1 comments