New RAG research tackles adversarial attacks and bias

By PulseAugur Editorial · [14 sources] · 2026-05-15 09:47

Two new research papers explore methods to improve the reliability and fairness of Retrieval-Augmented Generation (RAG) systems. One paper introduces BiRD, a defense mechanism that uses bidirectional ranking to detect and mitigate adversarial poisoning attacks, significantly reducing attack success rates while maintaining task accuracy. The other paper proposes a fairness-aware retrieval framework that models and controls bias introduced during the retrieval process, aiming to balance relevance and fairness in RAG outputs. AI

IMPACT New research offers methods to enhance RAG system security against attacks and improve fairness, potentially increasing trust and adoption.

RANK_REASON Two academic papers published on arXiv detailing new methods for improving Retrieval-Augmented Generation systems.

Read on arXiv cs.IR (Information Retrieval) →

AI-generated summary · Google Gemini · from 14 sources. How we write summaries →

New RAG research tackles adversarial attacks and bias

COVERAGE [14]

arXiv cs.CL TIER_1 English(EN) · Mingchen Li, Jiatan Huang, Chuxu Zhang, Liang Zhao, Hong Yu · 2026-05-27 04:00

In-Context Optimization for Retrieval-Augmented Generation: A Gradient-Descent Perspective

arXiv:2605.26356v1 Announce Type: new Abstract: In-context learning has recently been linked to implicit gradient descent in linear self-attention models, suggesting that context can induce a forward-pass update. Retrieval-augmented generation (RAG) also relies on context, but re…
arXiv cs.AI TIER_1 English(EN) · Yu-Chen Den, Yung-Yu Shih, Zhi Rui Tam, Kuan-Yu Chen, Pu-Jen Cheng, Yun-Nung Chen, Eugene Yang · 2026-05-27 04:00

ICICLE: Expanding Retrieval with In-Context Documents

arXiv:2605.26902v1 Announce Type: cross Abstract: Generative retrieval (GR) maps queries directly to document identifiers (docids) using parametric knowledge, However, this design makes corpus expansion costly: adding new documents requires updating model parameters to encode new…
arXiv cs.AI TIER_1 English(EN) · Tetsuya Sakai, Jina Lee, Hanpei Fang, Young-In Song · 2026-05-27 04:00

Plans for Evaluating Structured Generative Search Summaries

arXiv:2605.26400v1 Announce Type: cross Abstract: We propose a framework for evaluating structured generative search summaries that are placed atop organic web search results. A structured summary, generated by a large language model, typically consists of an overview, several se…
arXiv cs.AI TIER_1 English(EN) · Zhe Yu, Wenpeng Xing, Chen Ye, Xuyang Teng, Bo Yang, Changting Lin, Meng Han · 2026-05-27 04:00

Detecting Is Not Resolving: The Monitoring Control Gap in Retrieval Augmented LLMs

arXiv:2605.27157v1 Announce Type: new Abstract: Retrieval-augmented LLMs are deployed for tasks where evidence quality determines action safety, yet evaluation protocols assume that single-turn robustness predicts robustness when evidence accumulates across turns. We show this as…
arXiv cs.AI TIER_1 English(EN) · Meng Han · 2026-05-26 15:18

Detecting Is Not Resolving: The Monitoring Control Gap in Retrieval Augmented LLMs

Retrieval-augmented LLMs are deployed for tasks where evidence quality determines action safety, yet evaluation protocols assume that single-turn robustness predicts robustness when evidence accumulates across turns. We show this assumption is fundamentally incorrect. Models exhi…
arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Eugene Yang · 2026-05-26 12:01

ICICLE: Expanding Retrieval with In-Context Documents

Generative retrieval (GR) maps queries directly to document identifiers (docids) using parametric knowledge, However, this design makes corpus expansion costly: adding new documents requires updating model parameters to encode new document-docid associations incurs repeated train…
arXiv cs.CL TIER_1 English(EN) · Miaohe Niu, Lianlei Shan, Zhengtao Yu, Jingbo Zhu, Tong Xiao · 2026-05-26 04:00

EfficientGraph-RAG: Structured Retrieval-State Management for Cross-Task Retrieval-Augmented Generation

arXiv:2605.25379v1 Announce Type: new Abstract: Retrieval-augmented generation (RAG) has become the standard way to ground large language models in external knowledge, but many systems still organize evidence as flat chunks and retrieve it through largely unstructured search. Thi…
arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Young-In Song · 2026-05-26 00:08

Plans for Evaluating Structured Generative Search Summaries

We propose a framework for evaluating structured generative search summaries that are placed atop organic web search results. A structured summary, generated by a large language model, typically consists of an overview, several sections with section titles, and a list of source d…
arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Chao Liang · 2026-05-19 17:11

BiRD: A Bidirectional Ranking Defense Mechanism for Retrieval Augmented Generation

The growing adoption of Retrieval-Augmented Generation (RAG) has led to a rise in adversarial attacks. Existing defenses, relying on semantic analysis or voting, face a trade-off between high computational cost and limited robustness under strong poisoning attacks. Their fundamen…
arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Kostas Stefanidis · 2026-05-15 09:47

Fairness-Aware Retrieval Optimization for Retrieval-Augmented Generation

Retrieval-Augmented Generation (RAG) improves reliability of large language models by incorporating external knowledge, but the retrieval process can introduce bias that propagates to generated outputs. This issue is particularly challenging in top-k settings, where multiple docu…
arXiv cs.CV TIER_1 English(EN) · Zhifeng Wang, Jason Jingshi Li, Kaihao Zhang, Ramesh Sankaranarayana · 2026-05-26 04:00

AstroRAG -- A Pagerank-Based Retrieval-Augmented Generation Pipeline for Question Answering in Astronomy

arXiv:2605.25039v1 Announce Type: new Abstract: Large language models (LLMs) demonstrate strong performance in natural language processing but often generate factual errors when relying solely on parametric knowledge. Retrieval-Augmented Generation (RAG) mitigates these errors by…
dev.to — LLM tag TIER_1 English(EN) · G V NIKITHA · 2026-05-26 10:30

What is RAG? A Beginner's Guide to Retrieval-Augmented Generation (For Engineers Who Actually Build It)

<p>RAG sounds complicated.</p> <p>It's not.</p> <p>But a lot of introductions to RAG make it sound more mysterious than it actually is. They use terms like "semantic search" and "vector embeddings" and "retrieval pipeline" before explaining what the actual problem is.</p> <p>So l…
r/MachineLearning TIER_1 English(EN) · /u/Just_Jaguar3701 · 2026-05-25 21:56

Aiki my local Wikipedia Retrieval-Augmented Generation system [R]

<div class="md"><h1>Hey</h1> <p>i built Aiki a lightweight tool that let's you chat with Wikipedia locally.</p> <p><strong>what it does:</strong> - Downloads and chunks wikipedia articles (u can choose those articles by their name or articles and also the option of…
dev.to — LLM tag TIER_1 English(EN) · Suraj Sharma · 2026-05-25 11:56

RAG Explained: How Retrieval-Augmented Generation Actually Works

<p><a class="article-body-image-wrapper" href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F1huwl40mxv99gjfyy340.png"><img alt="RAG Pipeline Diagram" heig…

COVERAGE [14]

RELATED ENTITIES

RELATED TOPICS