WMDP
PulseAugur coverage of WMDP — every cluster mentioning WMDP across labs, papers, and developer communities, ranked by signal.
-
New metric reveals LLM unlearning methods fail to fully forget sensitive data
A new research paper introduces \"Leak@k\", a metric designed to evaluate the effectiveness of unlearning methods in large language models (LLMs). The study found that most current unlearning techniques fail to complete…
-
New AI unlearning methods balance data removal with model utility
Researchers have developed new methods for machine unlearning, a process that removes specific data from AI models without full retraining. One approach, SHRED, uses self-distillation and logit demotion to identify and …
-
Hugging Face introduces REGLU for efficient LLM unlearning
Researchers have developed a new method called Representation-Guided Low-rank Unlearning (REGLU) to address the challenge of removing specific information from large language models (LLMs) without degrading their overal…