PulseAugur
实时 23:40:30
English(EN) Rethinking Retrieval-Augmented Generation as a Cooperative Decision-Making Problem

新的RAG研究解决了表格数据、成本和跨语言知识问题

几篇最新的研究论文探讨了检索增强生成(RAG)系统的进展。一篇论文介绍了正交子空间分解(OSD),用于将特定任务行为与参数化RAG中的文档知识分离开来,从而改进适配器组合。另一篇论文CroSearch-R1提出了一个框架,通过将多语言信息整合到强化学习过程中,更好地利用跨语言知识进行RAG。此外,研究还探讨了指代消解对RAG的影响,证明其能够减少歧义并提高性能,特别是对于较小的模型。其他研究则侧重于通过重排分析增强RAG在金融报告等特定领域的应用,以及使用语义缓存进行知识图谱问答。 AI

影响 这些论文共同推动了RAG技术的发展,有望提高LLM应用的事实准确性、跨语言能力和可解释性。

排序理由 该集群包含多个arXiv预印本,详细介绍了用于改进RAG系统的新研究方法和数据集。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 18 个来源。 我们如何撰写摘要 →

新的RAG研究解决了表格数据、成本和跨语言知识问题

报道来源 [18]

  1. arXiv cs.CL TIER_1 English(EN) · Pooja Guttal, Varun Magotra, Vasudeva Mahavishnu, Natasha Chanto, Sidharth Sivaprasad, Manas Gaur ·

    Structure-Aware Chunking for Tabular Data in Retrieval-Augmented Generation

    arXiv:2605.00318v1 Announce Type: new Abstract: Tabular documents such as CSV and Excel files are widely used in enterprise data pipelines, yet existing chunking strategies for retrieval-augmented generation (RAG) are primarily designed for unstructured text and do not account fo…

  2. arXiv cs.LG TIER_1 English(EN) · Shawqi Al-Maliki, Ammar Gharaibeh, Mohamed Rahouti, Mohammad Ruhul Amin, Mohamed Abdallah, Junaid Qadir, Ala Al-Fuqaha ·

    Budget-Constrained Online Retrieval-Augmented Generation: The Chunk-as-a-Service Model

    arXiv:2604.26981v1 Announce Type: cross Abstract: Large Language Models (LLMs) have revolutionized the field of natural language processing. However, they exhibit some limitations, including a lack of reliability and transparency: they may hallucinate and fail to provide sources …

  3. arXiv cs.CL TIER_1 English(EN) · Manas Gaur ·

    Structure-Aware Chunking for Tabular Data in Retrieval-Augmented Generation

    Tabular documents such as CSV and Excel files are widely used in enterprise data pipelines, yet existing chunking strategies for retrieval-augmented generation (RAG) are primarily designed for unstructured text and do not account for tabular structure. We propose a structure-awar…

  4. arXiv cs.CL TIER_1 English(EN) · Koki Itai, Shunichi Hasegawa, Yuta Yamamoto, Gouki Minegishi, Masaki Otsuki ·

    LIT-RAGBench: Benchmarking Generator Capabilities of Large Language Models in Retrieval-Augmented Generation

    arXiv:2603.06198v2 Announce Type: replace Abstract: Retrieval-Augmented Generation (RAG) is a framework in which a Generator, such as a Large Language Model (LLM), produces answers by retrieving documents from an external collection using a Retriever. In practice, Generators must…

  5. arXiv cs.CL TIER_1 English(EN) · Yushi Sun, Lei Chen ·

    CacheRAG: A Semantic Caching System for Retrieval-Augmented Generation in Knowledge Graph Question Answering

    arXiv:2604.26176v1 Announce Type: cross Abstract: The integration of Large Language Models (LLMs) with Retrieval-Augmented Generation (RAG) has significantly advanced Knowledge Graph Question Answering (KGQA). However, existing LLM-driven KGQA systems act as stateless planners, g…

  6. arXiv cs.CL TIER_1 English(EN) · Weihang Su, Hanwen Zhang, Qingyao Ai, Yiqun Liu ·

    Decoupling Knowledge and Task Subspaces for Composable Parametric Retrieval Augmented Generation

    arXiv:2604.26768v1 Announce Type: new Abstract: Parametric Retrieval-Augmented Generation (PRAG) encodes external documents into lightweight parameter modules that can be retrieved and merged at inference time, offering a promising alternative to in-context retrieval augmentation…

  7. arXiv cs.CL TIER_1 English(EN) · Yiqun Liu ·

    Decoupling Knowledge and Task Subspaces for Composable Parametric Retrieval Augmented Generation

    Parametric Retrieval-Augmented Generation (PRAG) encodes external documents into lightweight parameter modules that can be retrieved and merged at inference time, offering a promising alternative to in-context retrieval augmentation. Despite its potential, many PRAG implementatio…

  8. arXiv cs.CL TIER_1 English(EN) · Zhiyuan Cheng, Longying Lai, Yue Liu, Kai Cheng, Xiaoxi Qi ·

    Enhancing Financial Report Question-Answering: A Retrieval-Augmented Generation System with Reranking Analysis

    arXiv:2603.16877v2 Announce Type: replace Abstract: Financial analysts face significant challenges extracting information from lengthy 10-K reports, which often exceed 100 pages. This paper presents a Retrieval-Augmented Generation (RAG) system designed to answer questions about …

  9. arXiv cs.CL TIER_1 English(EN) · Rui Qi, Fengran Mo, Sijin Lu, Yufeng Chen, Jian-Yun Nie, Kaiyu Huang ·

    CroSearch-R1: Better Leveraging Cross-lingual Knowledge for Retrieval-Augmented Generation

    arXiv:2604.25182v1 Announce Type: new Abstract: A multilingual collection may contain useful knowledge in other languages to supplement and correct the facts in the original language for Retrieval-Augmented Generation (RAG). However, the vanilla approach that simply concatenates …

  10. arXiv cs.CL TIER_1 English(EN) · Youngjoon Jang, Seongtae Hong, Junyoung Son, Sungjin Park, Chanjun Park, Heuiseok Lim ·

    From Ambiguity to Accuracy: The Transformative Effect of Coreference Resolution on Retrieval-Augmented Generation systems

    arXiv:2507.07847v3 Announce Type: replace Abstract: Retrieval-Augmented Generation (RAG) has emerged as a crucial framework in natural language processing (NLP), improving factual consistency and reducing hallucinations by integrating external document retrieval with large langua…

  11. arXiv cs.CL TIER_1 English(EN) · Lei Chen ·

    CacheRAG: A Semantic Caching System for Retrieval-Augmented Generation in Knowledge Graph Question Answering

    The integration of Large Language Models (LLMs) with Retrieval-Augmented Generation (RAG) has significantly advanced Knowledge Graph Question Answering (KGQA). However, existing LLM-driven KGQA systems act as stateless planners, generating retrieval plans in isolation without exp…

  12. arXiv cs.LG TIER_1 English(EN) · Zhuoling Li, Ha Linh Hong Tran Nguyen, Valeria Bladinieres, Maxim Romanovsky ·

    XGRAG: A Graph-Native Framework for Explaining KG-based Retrieval-Augmented Generation

    arXiv:2604.24623v1 Announce Type: cross Abstract: Graph-based Retrieval-Augmented Generation (GraphRAG) extends traditional RAG by using knowledge graphs (KGs) to give large language models (LLMs) a structured, semantically coherent context, yielding more grounded answers. Howeve…

  13. arXiv cs.AI TIER_1 English(EN) · Miao Xie, Xiao Zhang, Yi Li, Chunli Lv ·

    Structure Guided Retrieval-Augmented Generation for Factual Queries

    arXiv:2604.22843v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) has been proposed to mitigate hallucinations in large language models (LLMs), where generated outputs may be factually incorrect. However, existing RAG approaches predominantly rely on vector s…

  14. arXiv cs.AI TIER_1 English(EN) · Aryan Patodiya ·

    StratRAG: A Multi-Hop Retrieval Evaluation Dataset for Retrieval-Augmented Generation Systems

    arXiv:2604.22757v1 Announce Type: cross Abstract: We introduce StratRAG, an open-source retrieval evaluation dataset for benchmarking Retrieval-Augmented Generation (RAG) systems on multi-hop reasoning tasks under realistic, noisy document-pool conditions. Derived from HotpotQA (…

  15. arXiv cs.LG TIER_1 English(EN) · Yuchen Yan, Peiyan Zhang, Zhihua Liu, Hao Wang, Yatao Bian, Weiming Li, Xiaoshuai Hao ·

    Question-Adaptive Graph Learning for Multi-hop Retrieval Augmented Generation

    arXiv:2510.11541v2 Announce Type: replace Abstract: Retrieval-augmented generation (RAG) has demonstrated its ability to enhance Large Language Models (LLMs) by integrating external knowledge sources. However, multi-hop questions, which require the identification of multiple know…

  16. arXiv cs.CL TIER_1 English(EN) · Kaiyu Huang ·

    CroSearch-R1: Better Leveraging Cross-lingual Knowledge for Retrieval-Augmented Generation

    A multilingual collection may contain useful knowledge in other languages to supplement and correct the facts in the original language for Retrieval-Augmented Generation (RAG). However, the vanilla approach that simply concatenates multiple pieces of knowledge from different lang…

  17. arXiv cs.AI TIER_1 English(EN) · Maxim Romanovsky ·

    XGRAG: A Graph-Native Framework for Explaining KG-based Retrieval-Augmented Generation

    Graph-based Retrieval-Augmented Generation (GraphRAG) extends traditional RAG by using knowledge graphs (KGs) to give large language models (LLMs) a structured, semantically coherent context, yielding more grounded answers. However, GraphRAG reasoning process remains a black-box,…

  18. arXiv cs.CL TIER_1 English(EN) · Lichang Song, Ting Long, Yi Chang ·

    Rethinking Retrieval-Augmented Generation as a Cooperative Decision-Making Problem

    arXiv:2602.18734v2 Announce Type: replace Abstract: Retrieval-Augmented Generation (RAG) has demonstrated strong effectiveness in knowledge-intensive tasks by grounding language generation in external evidence. Despite its success, many existing RAG systems are built based on a r…