retrieval-augmented generation
PulseAugur coverage of retrieval-augmented generation — every cluster mentioning retrieval-augmented generation across labs, papers, and developer communities, ranked by signal.
- instance of Royal Galician Academy 90%
- used by LLM 90%
- used by large-language models 90%
- used by LLMs 90%
- instance of Graphrag 90%
- instance of HotpotQA 90%
- instance of Apium graveolens 90%
- instance of GraphRAG with Knowledge Graphs for Question Answering on Administrative Meeting Records 90%
- uses TigerGraph 90%
- used by large language model 90%
- instance of 2WikiMultiHopQA 90%
- used by Pinecone 90%
- 2026-05-20 research_milestone A developer built a safety-first RAG agent for support tickets, ranking highly in a hackathon. 来源
- 2026-05-10 research_milestone A study empirically analyzed byte-exact deduplication in RAG systems, demonstrating significant context reduction without quality loss. 来源
- 2026-05-10 research_milestone A study assessed RAG and fine-tuning for industrial question-answering applications, finding RAG to be more cost-effective. 来源
- 2026-05-10 research_milestone A study assessed RAG and fine-tuning for industrial question-answering applications, finding RAG to be more cost-effective. 来源
18 天有情绪数据
-
Author shares production AI lessons beyond LLM tutorials
The author shares practical lessons learned from building two production AI systems: a hybrid RAG system and an autonomous code review agent. Key takeaways emphasize the significant gap between LLM tutorials and real-wo…
-
New framework MTR-Suite improves conversational retrieval benchmarks
Researchers have developed MTR-Suite, a new framework designed to improve the evaluation and creation of conversational retrieval benchmarks. This suite includes MTR-Eval, an LLM-based tool for assessing existing benchm…
-
ColPali RAG system eliminates OCR, boosts document retrieval performance
A new system called ColPali has been developed to improve Retrieval-Augmented Generation (RAG) for documents. It bypasses the need for Optical Character Recognition (OCR) and text chunking by encoding image patches dire…
-
Air Canada chatbot lawsuit reveals RAG data quality flaws
A recent lawsuit against Air Canada, where their chatbot provided incorrect bereavement fare information, highlights a critical issue in Retrieval-Augmented Generation (RAG) systems. The problem was not AI hallucination…
-
RAG Explained: How AI Learns to Access External Information
Retrieval-Augmented Generation (RAG) is a technique that enhances AI models by allowing them to access and cite external information. This method improves the accuracy and relevance of AI-generated responses by groundin…
-
BalanceRAG optimizes retrieval-augmented generation with joint risk calibration
Researchers have introduced BalanceRAG, a novel approach to optimize retrieval-augmented generation (RAG) systems. This method aims to reduce unnecessary retrieval calls by intelligently calibrating the uncertainty thre…
-
Developer builds VORTEXRAG to fix RAG failures
A developer spent six months debugging a Retrieval-Augmented Generation (RAG) system for document Q&A, identifying two key failure modes: semantic drift in query reformulation and context poisoning by irrelevant but sim…
-
RAG fails enterprise R&D; Naboo offers context layer fix
Retrieval-Augmented Generation (RAG) is proving insufficient for complex enterprise R&D environments, according to Naboo CEO Gilad Salinger. RAG struggles with fragmented data across multiple systems like code repositor…
-
Fine-tuning vs. RAG: Choosing the right LLM customization
Fine-tuning large language models offers greater power and customization than Retrieval-Augmented Generation (RAG) but comes with a higher cost. Understanding the trade-offs between these two techniques is crucial for s…
-
新的审计协议解决了多租户RAG系统中的隐私风险
研究人员在多租户检索增强生成(RAG)系统中发现了一个隐私漏洞,特别是在账户串通方面。虽然这些服务通常保证每个账户的差分隐私,但研究表明,多个账户之间的协调串通会以与串通账户数量平方根成比例的速度降低这种隐私。为了解决这个问题,开发了一种新颖的审计协议,可以在不暴露敏感数据的情况下评估未经修改的RAG部署中检索分数通道的隐私。
-
wiki42 tool compiles markdown wikis for RAG systems
The open-source tool wiki42, developed by 42rows, is designed to convert markdown wikis into chunks suitable for Retrieval-Augmented Generation (RAG) systems. Unlike generic chunkers that split text based on token count…
-
Developer benchmarks RAG pipelines on Indian health literature
A developer is building a system to benchmark retrieval-augmented generation (RAG) pipelines using Indian public health literature. The platform will compare three AI retrieval methods on approximately 9,000 research pa…
-
Paper reframes AI ads as trustworthy intervention challenge
A new paper proposes that generative AI advertising should be viewed as a problem of trustworthy intervention rather than simple content placement. The research introduces a taxonomy of influence tiers, ranging from pro…
-
DocNest tool preserves PDF structure for better RAG performance
A developer has created DocNest, a tool designed to improve Retrieval-Augmented Generation (RAG) systems by focusing on document ingestion rather than just retrieval. DocNest preserves the structure of documents, includ…
-
New method boosts LLM long context handling with attention-state memory
Researchers have developed a new method called attention-state memory to improve how large language models handle long context inputs. This training-free approach externalizes the prefix into a memory of precomputed att…
-
New RAG framework predicts information needs to cut latency
Researchers have developed a new framework for Retrieval-Augmented Generation (RAG) that significantly reduces latency by predicting and prefetching information. This system analyzes generation dynamics to anticipate in…
-
Blogger shares LLM chunking strategies for long MDX articles
A technical blogger details strategies for managing token limits when feeding long MDX articles to Large Language Models. The author explains that exceeding a model's context window can lead to errors or incomplete proc…
-
AI 对无效 Bug 报告进行分类,并提出无代码修复建议
研究人员开发了自动分类无效 Bug 报告并建议无代码修复的方法,旨在减少客户支持中的资源浪费。他们在一个精心策划的基准上试验了大型语言模型 (LLM)、检索增强生成 (RAG) 和代理式网络搜索。检索增强生成在子分类方面取得了最高的性能,而代理式网络搜索在生成无代码修复方面表现出色。
-
CyberGraph RAG uses TigerGraph to improve LLM cybersecurity analysis
Researchers developed CyberGraph RAG, a system designed to improve how large language models handle cybersecurity data by leveraging graph databases. Unlike traditional RAG which struggles with the relational nature of …
-
Agentic RAG fixes 40% retrieval failure in LLM pipelines
A new approach called Agentic RAG addresses significant retrieval failures in standard RAG pipelines, which are shown to fail up to 40% of the time in production. Unlike standard RAG, Agentic RAG uses an agent to dynami…