Why Current AI Guardrails Train Models to Fake Alignment

(kellyasay.substack.com)

3 points | by kellya 8 hours ago ago

1 comments