Researchers have introduced Geometric Unlearning (GU), a novel method for selectively removing specific information from large language models without needing access to the original training data. This approach operates on the model's internal planning states, distilling safe behavior from a small set of reference prompts. GU then uses synthetic prompts to align these states with the desired safe geometry, minimizing impact on the model's general utility. Experiments on privacy benchmarks demonstrated GU's effectiveness in suppressing target information with minimal synthetic data. AI
影响 Offers a more efficient and data-light approach to LLM unlearning, potentially improving privacy compliance for deployed models.
排序理由 The cluster contains an arXiv preprint detailing a new method for LLM unlearning.
AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →