retrieval-augmented generation
PulseAugur coverage of retrieval-augmented generation — every cluster mentioning retrieval-augmented generation across labs, papers, and developer communities, ranked by signal.
- instance of DagsHub 90%
- used by Markdown 90%
- instance of HotpotQA 90%
- instance of Graphrag 90%
- instance of GraphRAG with Knowledge Graphs for Question Answering on Administrative Meeting Records 90%
- used by Hierarchical Navigable Small World graphs 90%
- used by BGE-Reranker-v2-m3 90%
- instance of text-embedding-3-large 90%
- instance of vector RAG 90%
- instance of Ragas 90%
- uses TigerGraph 90%
- instance of PromptFromZero 90%
- 2026-05-20 research_milestone A developer built a safety-first RAG agent for support tickets, ranking highly in a hackathon. source
- 2026-05-10 research_milestone A study empirically analyzed byte-exact deduplication in RAG systems, demonstrating significant context reduction without quality loss. source
- 2026-05-10 research_milestone A study assessed RAG and fine-tuning for industrial question-answering applications, finding RAG to be more cost-effective. source
- 2026-05-10 research_milestone A study assessed RAG and fine-tuning for industrial question-answering applications, finding RAG to be more cost-effective. source
31 day(s) with sentiment data
-
New AI job board AIRAG Jobs launches to combat outdated listings
A developer has created AIRAG Jobs, a new job board specifically for AI engineering roles, aiming to solve the frustrations of broad, duplicate, or outdated listings on existing platforms. The site aggregates jobs direc…
-
Streamlit App Demonstrates Five Key AI Patterns with Ollama
A local Streamlit application powered by Ollama has been developed to showcase five core AI patterns. This application integrates conversational chat, MCP (likely referring to a specific AI methodology or framework), pr…
-
Safely rolling out new AI models in production requires a phased approach
Deploying new AI models into production requires a structured rollout plan to mitigate risks such as changes in answer quality, latency, and cost. A phased approach, including local smoke tests, staging evaluations, sha…
-
Stale documents in RAG systems pose significant risks, study finds
A recent study conducted by Emory University and IBM Research investigated the impact of stale documents on retrieval-augmented generation (RAG) systems. The experiment revealed that outdated information in a RAG system…
-
Local RAG system for scientific articles faces initial failures, then improved
The author attempted to build a local Retrieval-Augmented Generation (RAG) system for scientific articles, incorporating features like graphs, hybrid search, HyDE, and rerankers. Initially, the full pipeline underperfor…
-
AI developers need multi-model usage tracking for cost, latency, and reliability
Developers building applications with multiple AI models require robust usage tracking to manage costs, latency, and reliability. This involves logging specific metadata for each request, such as workflow, model used, t…
-
New 'Eyes-on-Me' method enables scalable RAG system poisoning
Researchers have developed a new method called "Eyes-on-Me" to more effectively poison retrieval-augmented generation (RAG) systems. This technique decomposes adversarial documents into reusable "Attention Attractors" a…
-
New benchmark evaluates retrieval in multimodal knowledge graph-augmented generation
Researchers have introduced MKG-RAG-Bench, a new benchmark designed to evaluate retrieval performance in multimodal knowledge graph-augmented generation (MKG-RAG) systems. Existing benchmarks often neglect the complexit…
-
RAG chunk overlap discrepancy found in system implementation
The author investigated the actual chunk overlap in their Retrieval-Augmented Generation (RAG) system, expecting a consistent 100-character overlap as specified. Upon printing and analyzing the chunks, they discovered t…
-
RAG system failures often due to retriever, not LLM
This article argues that issues with Retrieval-Augmented Generation (RAG) systems often stem from problems with the vector search retriever rather than the large language model (LLM) itself. It suggests building a found…
-
RAG Chunking Methods: A Guide to Improving LLM Accuracy
Chunking is a critical preprocessing step for Retrieval-Augmented Generation (RAG) systems, which aim to improve the factual accuracy of Large Language Models (LLMs) by providing them with external knowledge. The effect…
-
PhysRAG pipeline enhances AI video generation with physics knowledge · 2 sources tracked
Researchers have introduced PhysRAG, a new pipeline designed to improve the physical accuracy of AI-generated videos. This method utilizes Retrieval-Augmented Generation (RAG) to overcome limitations in training data by…
-
Production RAG Pipelines: LlamaIndex and Pinecone for Scalable AI
Building a production-ready retrieval-augmented generation (RAG) pipeline involves more than just connecting a large language model (LLM) to a knowledge base; it requires careful attention to infrastructure and data pip…
-
New AI framework LCAi enhances life cycle assessment interpretation
Researchers have developed a novel framework called LCAi that leverages retrieval-augmented generation (RAG) to improve the interpretation phase of life cycle assessments (LCAs). This AI-assisted approach fuses data fro…
-
AI tutor TutorIA adapts to child profiles and remembers sessions
TutorIA is an AI-powered educational tutor designed for children aged 6 to 14, aiming to provide personalized learning experiences. It adapts its language and teaching methods based on a child's specific profile, such a…
-
RAG pipeline chunking strategies are key to retrieval quality, not just basic diagrams
Two articles discuss the critical role of chunking strategies in Retrieval-Augmented Generation (RAG) pipelines. The first emphasizes that RAG is more than just a basic four-box diagram, highlighting the need for accoun…
-
New App Enables Local, Offline Chat With Documents
Off Grid AI Desktop is a new, free, open-source application designed to enable users to chat with their documents locally on their personal computers. The tool handles the entire process, including embedding, vector sto…
-
New RAG method Eraser4RAG removes private data, outperforms GPT-4o
Researchers have developed Eraser4RAG, a novel method to remove sensitive information from documents used in Retrieval-Augmented Generation (RAG) systems. This approach constructs a knowledge graph to identify and separ…
-
New RAG method improves agent persuasion by decoupling logic from topic
Researchers have developed a new method called Taxonomic Strategy Retrieval (TS-RAG) to address compounding failures in foundation model agents, particularly in subjective tasks like persuasion. Standard Retrieval-Augme…
-
Engram pioneers AI 'memory' by baking knowledge into weights, not just context
AI startup Engram is developing a novel approach to AI memory and continual learning, aiming to embed specialized knowledge directly into model weights rather than relying solely on retrieval-augmented generation (RAG) …