A new study on arXiv details a method for extracting memorized book content from open-weight language models. Researchers found that while most models do not extensively memorize most books, there are significant exceptions, with Llama 3.1 70B fully memorizing some titles like 'Harry Potter and the Sorcerer's Stone'. This extensive memorization allows for deterministic extraction of entire books using minimal prompts, impacting ongoing copyright disputes. AI
影响 Findings could influence copyright litigation and model training practices regarding memorization of copyrighted material.
排序理由 Academic paper detailing a new method for extracting memorized content from LLMs. [lever_c_demoted from research: ic=1 ai=1.0]
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →