Alignment whack-a-mole: Finetuning activates recall of copyrighted books in LLMs

(github.com)

145 points | by reconnecting 8 hours ago ago

106 comments