Researchers have developed Faithfulness-QA, a new dataset containing nearly 100,000 samples designed to train Retrieval-Augmented Generation (RAG) models to prioritize retrieved context over their internal knowledge. The dataset was created by systematically replacing named entities in existing question-answering benchmarks with alternatives, thereby generating conflicts between context and parametric memory. This resource aims to improve the faithfulness of RAG systems and provides a benchmark for evaluating their context-grounding capabilities. AI
Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →
IMPACT Improves RAG model faithfulness by providing a dataset to train context-grounding capabilities.
RANK_REASON Release of a new dataset for training and evaluating RAG models.