PulseAugur
EN
LIVE 07:10:26

New RAG method Eraser4RAG removes private data, outperforms GPT-4o

Researchers have developed Eraser4RAG, a novel method to remove sensitive information from documents used in Retrieval-Augmented Generation (RAG) systems. This approach constructs a knowledge graph to identify and separate private from public information, then fine-tunes a model to rewrite documents, excluding private triples while preserving public knowledge. Experiments show Eraser4RAG outperforms GPT-4o in effectively erasing private data while maintaining the utility of public information for generative tasks. AI

IMPACT Enhances privacy in RAG systems by enabling custom erasure of sensitive data without compromising generative capabilities.

RANK_REASON The cluster contains a research paper detailing a new method for privacy in RAG systems. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New RAG method Eraser4RAG removes private data, outperforms GPT-4o

COVERAGE [1]

  1. arXiv cs.CL TIER_1 English(EN) · Yujing Wang, Jinwen Chen, Hainan Zhang, Liang Pang, Yongxin Tong, Binghui Guo, Hongwei Zheng, Zhiming Zheng ·

    Learning to Erase Private Knowledge from Multi-Documents for Retrieval-Augmented Large Language Models

    arXiv:2504.09910v2 Announce Type: replace Abstract: Retrieval-Augmented Generation (RAG) is a promising technique for applying LLMs to proprietary domains. However, retrieved documents may contain sensitive knowledge, posing risks of privacy leakage in generative results. Thus, e…