A new research paper and accompanying code repository reveal that fine-tuning large language models can inadvertently lead to verbatim recall of copyrighted material. The study, titled "Alignment Whack-a-Mole," demonstrates how models trained on specific texts can reproduce large portions of those texts verbatim. The researchers provide a pipeline for preprocessing books, fine-tuning models using APIs from OpenAI, Google (Gemini), and DeepSeek (Tinker), and evaluating the memorization capabilities. AI
影响 Fine-tuning LLMs may inadvertently expose copyrighted material, necessitating careful data curation and evaluation.
排序理由 The cluster describes a research paper and associated code release detailing a novel finding about LLM behavior.
在 Mastodon — mastodon.social 阅读 →
AI 生成摘要 · Google Gemini · 来自 3 个来源。 我们如何撰写摘要 →