English(EN) Rethinking Retrieval-Augmented Generation as a Cooperative Decision-Making Problem

新的RAG研究解决了表格数据、成本和跨语言知识问题

作者 PulseAugur 编辑部 · [18 个来源] · 2026-04-27 04:00

几篇最新的研究论文探讨了检索增强生成（RAG）系统的进展。一篇论文介绍了正交子空间分解（OSD），用于将特定任务行为与参数化RAG中的文档知识分离开来，从而改进适配器组合。另一篇论文CroSearch-R1提出了一个框架，通过将多语言信息整合到强化学习过程中，更好地利用跨语言知识进行RAG。此外，研究还探讨了指代消解对RAG的影响，证明其能够减少歧义并提高性能，特别是对于较小的模型。其他研究则侧重于通过重排分析增强RAG在金融报告等特定领域的应用，以及使用语义缓存进行知识图谱问答。 AI

影响这些论文共同推动了RAG技术的发展，有望提高LLM应用的事实准确性、跨语言能力和可解释性。

排序理由该集群包含多个arXiv预印本，详细介绍了用于改进RAG系统的新研究方法和数据集。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 18 个来源。我们如何撰写摘要 →

报道来源 [18]

arXiv cs.CL TIER_1 English(EN) · Pooja Guttal, Varun Magotra, Vasudeva Mahavishnu, Natasha Chanto, Sidharth Sivaprasad, Manas Gaur · 2026-05-04 04:00

面向检索增强生成的表格数据结构感知分块

arXiv:2605.00318v1 Announce Type: new Abstract: Tabular documents such as CSV and Excel files are widely used in enterprise data pipelines, yet existing chunking strategies for retrieval-augmented generation (RAG) are primarily designed for unstructured text and do not account fo…
arXiv cs.LG TIER_1 English(EN) · Shawqi Al-Maliki, Ammar Gharaibeh, Mohamed Rahouti, Mohammad Ruhul Amin, Mohamed Abdallah, Junaid Qadir, Ala Al-Fuqaha · 2026-05-01 04:00

预算受限的在线检索增强生成：Chunk-as-a-Service 模型

arXiv:2604.26981v1 Announce Type: cross Abstract: Large Language Models (LLMs) have revolutionized the field of natural language processing. However, they exhibit some limitations, including a lack of reliability and transparency: they may hallucinate and fail to provide sources …
arXiv cs.CL TIER_1 English(EN) · Manas Gaur · 2026-05-01 00:57

用于检索增强生成的表格数据的结构感知分块

Tabular documents such as CSV and Excel files are widely used in enterprise data pipelines, yet existing chunking strategies for retrieval-augmented generation (RAG) are primarily designed for unstructured text and do not account for tabular structure. We propose a structure-awar…
arXiv cs.CL TIER_1 English(EN) · Koki Itai, Shunichi Hasegawa, Yuta Yamamoto, Gouki Minegishi, Masaki Otsuki · 2026-04-30 04:00

LIT-RAGBench：评估检索增强生成中大型语言模型生成器能力

arXiv:2603.06198v2 Announce Type: replace Abstract: Retrieval-Augmented Generation (RAG) is a framework in which a Generator, such as a Large Language Model (LLM), produces answers by retrieving documents from an external collection using a Retriever. In practice, Generators must…
arXiv cs.CL TIER_1 English(EN) · Yushi Sun, Lei Chen · 2026-04-30 04:00

CacheRAG：知识图谱问答中用于检索增强生成的语义缓存系统

arXiv:2604.26176v1 Announce Type: cross Abstract: The integration of Large Language Models (LLMs) with Retrieval-Augmented Generation (RAG) has significantly advanced Knowledge Graph Question Answering (KGQA). However, existing LLM-driven KGQA systems act as stateless planners, g…
arXiv cs.CL TIER_1 English(EN) · Weihang Su, Hanwen Zhang, Qingyao Ai, Yiqun Liu · 2026-04-30 04:00

解耦知识与任务子空间以实现可组合参数化检索增强生成

arXiv:2604.26768v1 Announce Type: new Abstract: Parametric Retrieval-Augmented Generation (PRAG) encodes external documents into lightweight parameter modules that can be retrieved and merged at inference time, offering a promising alternative to in-context retrieval augmentation…
arXiv cs.CL TIER_1 English(EN) · Yiqun Liu · 2026-04-29 15:00

解耦知识与任务子空间以实现可组合参数化检索增强生成

Parametric Retrieval-Augmented Generation (PRAG) encodes external documents into lightweight parameter modules that can be retrieved and merged at inference time, offering a promising alternative to in-context retrieval augmentation. Despite its potential, many PRAG implementatio…
arXiv cs.CL TIER_1 English(EN) · Zhiyuan Cheng, Longying Lai, Yue Liu, Kai Cheng, Xiaoxi Qi · 2026-04-29 04:00

增强财务报告问答：带有重排分析的检索增强生成系统

arXiv:2603.16877v2 Announce Type: replace Abstract: Financial analysts face significant challenges extracting information from lengthy 10-K reports, which often exceed 100 pages. This paper presents a Retrieval-Augmented Generation (RAG) system designed to answer questions about …
arXiv cs.CL TIER_1 English(EN) · Rui Qi, Fengran Mo, Sijin Lu, Yufeng Chen, Jian-Yun Nie, Kaiyu Huang · 2026-04-29 04:00

CroSearch-R1：更好地利用跨语言知识进行检索增强生成

arXiv:2604.25182v1 Announce Type: new Abstract: A multilingual collection may contain useful knowledge in other languages to supplement and correct the facts in the original language for Retrieval-Augmented Generation (RAG). However, the vanilla approach that simply concatenates …
arXiv cs.CL TIER_1 English(EN) · Youngjoon Jang, Seongtae Hong, Junyoung Son, Sungjin Park, Chanjun Park, Heuiseok Lim · 2026-04-29 04:00

从模糊到精准：指代消解对检索增强生成系统的变革性影响

arXiv:2507.07847v3 Announce Type: replace Abstract: Retrieval-Augmented Generation (RAG) has emerged as a crucial framework in natural language processing (NLP), improving factual consistency and reducing hallucinations by integrating external document retrieval with large langua…
arXiv cs.CL TIER_1 English(EN) · Lei Chen · 2026-04-28 23:46

CacheRAG：知识图谱问答中用于检索增强生成的语义缓存系统

The integration of Large Language Models (LLMs) with Retrieval-Augmented Generation (RAG) has significantly advanced Knowledge Graph Question Answering (KGQA). However, existing LLM-driven KGQA systems act as stateless planners, generating retrieval plans in isolation without exp…
arXiv cs.LG TIER_1 English(EN) · Zhuoling Li, Ha Linh Hong Tran Nguyen, Valeria Bladinieres, Maxim Romanovsky · 2026-04-28 04:00

XGRAG：一种用于解释基于知识图谱的检索增强生成的原生图框架

arXiv:2604.24623v1 Announce Type: cross Abstract: Graph-based Retrieval-Augmented Generation (GraphRAG) extends traditional RAG by using knowledge graphs (KGs) to give large language models (LLMs) a structured, semantically coherent context, yielding more grounded answers. Howeve…
arXiv cs.AI TIER_1 English(EN) · Miao Xie, Xiao Zhang, Yi Li, Chunli Lv · 2026-04-28 04:00

面向事实查询的结构化引导检索增强生成

arXiv:2604.22843v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) has been proposed to mitigate hallucinations in large language models (LLMs), where generated outputs may be factually incorrect. However, existing RAG approaches predominantly rely on vector s…
arXiv cs.AI TIER_1 English(EN) · Aryan Patodiya · 2026-04-28 04:00

StratRAG：用于检索增强生成系统的多跳检索评估数据集

arXiv:2604.22757v1 Announce Type: cross Abstract: We introduce StratRAG, an open-source retrieval evaluation dataset for benchmarking Retrieval-Augmented Generation (RAG) systems on multi-hop reasoning tasks under realistic, noisy document-pool conditions. Derived from HotpotQA (…
arXiv cs.LG TIER_1 English(EN) · Yuchen Yan, Peiyan Zhang, Zhihua Liu, Hao Wang, Yatao Bian, Weiming Li, Xiaoshuai Hao · 2026-04-28 04:00

面向多跳检索增强生成的问答自适应图学习

arXiv:2510.11541v2 Announce Type: replace Abstract: Retrieval-augmented generation (RAG) has demonstrated its ability to enhance Large Language Models (LLMs) by integrating external knowledge sources. However, multi-hop questions, which require the identification of multiple know…
arXiv cs.CL TIER_1 English(EN) · Kaiyu Huang · 2026-04-28 03:41

CroSearch-R1：更好地利用跨语言知识进行检索增强生成

A multilingual collection may contain useful knowledge in other languages to supplement and correct the facts in the original language for Retrieval-Augmented Generation (RAG). However, the vanilla approach that simply concatenates multiple pieces of knowledge from different lang…
arXiv cs.AI TIER_1 English(EN) · Maxim Romanovsky · 2026-04-27 15:52

XGRAG：一种基于图的原生框架，用于解释 KG 驱动的检索增强生成

Graph-based Retrieval-Augmented Generation (GraphRAG) extends traditional RAG by using knowledge graphs (KGs) to give large language models (LLMs) a structured, semantically coherent context, yielding more grounded answers. However, GraphRAG reasoning process remains a black-box,…
arXiv cs.CL TIER_1 English(EN) · Lichang Song, Ting Long, Yi Chang · 2026-04-27 04:00

将检索增强生成视为合作决策问题的新思考

arXiv:2602.18734v2 Announce Type: replace Abstract: Retrieval-Augmented Generation (RAG) has demonstrated strong effectiveness in knowledge-intensive tasks by grounding language generation in external evidence. Despite its success, many existing RAG systems are built based on a r…

报道来源 [18]

相关实体

相关话题