Alignment whack-a-mole: Finetuning activates recall of copyrighted books in LLMs

Article URL: https://github.com/cauchy221/Alignment-Whack-a-Mole-Code

Comments URL: https://news.ycombinator.com/item?id=47957627

Points: 79

Comments: 42

Read original

Latest Posts