ENTITY BM25

BM25

PulseAugur coverage of BM25 — every cluster mentioning BM25 across labs, papers, and developer communities, ranked by signal.

Total · 30d

64

64 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

49

49 over 90d

TIER MIX · 90D

research 28
tool 33
commentary 3

TOPICS

RELATIONSHIPS

SENTIMENT · 30D

24 day(s) with sentiment data

RECENT · PAGE 1/4 · 64 TOTAL

TOOL · CL_112566 · Jun 26 · 15:29

Stale documents in RAG systems pose significant risks, study finds

A recent study conducted by Emory University and IBM Research investigated the impact of stale documents on retrieval-augmented generation (RAG) systems. The experiment revealed that outdated information in a RAG system…
TOOL · CL_109904 · Jun 25 · 04:00

Many-shot ICL boosts low-resource language translation, study finds

Researchers have conducted an empirical study on many-shot in-context learning (ICL) for machine translation, specifically focusing on low-resource languages. Their findings indicate that increasing the number of exampl…
TOOL · CL_109471 · Jun 24 · 00:12

Adaptive Re-Ranking cuts IR latency by routing queries efficiently

Researchers have introduced Adaptive Re-Ranking, a framework designed to optimize computational costs and latency in information retrieval systems. This method routes queries based on their complexity, employing differe…
TOOL · CL_103227 · Jun 22 · 03:56

Build Hybrid RAG System Combining Semantic and Keyword Search

This article details the construction of a hybrid Retrieval-Augmented Generation (RAG) system that combines the strengths of both semantic and keyword search. It addresses the limitations of single-mode retrieval, where…
COMMENTARY · CL_103119 · Jun 22 · 00:23

AI agents fail due to flawed search index distribution, not prompting

A common issue in AI agents is that their search results appear correct but lead to factually wrong answers due to problems with the underlying search index. This is not a prompting issue but a distribution problem, whe…
RESEARCH · CL_105011 · Jun 22 · 00:00

HAKARI-Bench offers lightweight evaluation for retrieval models · 2 sources tracked

Researchers have introduced HAKARI-Bench, a lightweight benchmark designed to streamline the evaluation of retrieval architectures and efficiency settings for retrieval-augmented generation and semantic search. This new…
TOOL · CL_105013 · Jun 21 · 22:09

VISTA Architect AI system integrates LLMs with EHRs for medical data synthesis

Researchers have developed VISTA Architect, a novel AI system designed to integrate large language models with electronic health records (EHRs). This system transforms clinical data into a knowledge graph, creating a sy…
TOOL · CL_104614 · Jun 20 · 17:04

Novelty-Aware Agentic Retrieval System Enhances Scientific Literature Search

Researchers have developed a Novelty-Aware Research Agent, an agentic retrieval system designed to go beyond standard RAG by providing structured multi-step reasoning for scientific literature search. This system aims t…
TOOL · CL_104621 · Jun 19 · 15:50

Local 7B model study dissects agentic RAG for multi-hop QA

Researchers have conducted an ablation study on agentic retrieval-augmented generation (RAG) systems, specifically focusing on multi-hop question answering with a local 7B parameter model, Qwen2.5-7B-Instruct. The study…
TOOL · CL_98656 · Jun 18 · 11:43

PostgreSQL AI deployment challenges addressed by open-source stack

Mike Josephson from pgEdge discussed the challenges of deploying AI applications with PostgreSQL, highlighting that most current applications are still in experimental stages. He detailed an open-source stack, including…
RESEARCH · CL_106736 · Jun 18 · 11:38

Streaming RAG technique hides tool latency by stabilizing query intent early

A new arXiv paper investigates Streaming Retrieval-Augmented Generation (Streaming RAG), a technique that hides tool latency by issuing retrieval queries in parallel with user input. Researchers characterized "tool-inte…
RESEARCH · CL_99524 · Jun 18 · 11:38

Streaming RAG research quantifies latency reduction via tool-intent stabilization

A new research paper explores the effectiveness of Streaming Retrieval-Augmented Generation (Streaming RAG) in reducing latency for users. The study introduces the concept of 'tool-intent stabilization,' which measures …
TOOL · CL_96092 · Jun 17 · 04:00

AI agent learns to improve legal case retrieval through self-evolution

Researchers have developed a novel self-evolving agent framework designed to enhance legal case retrieval systems. This agent iteratively refines rewriting rules for the BM25 baseline by utilizing an LLM within an autom…
RESEARCH · CL_92664 · Jun 15 · 21:00

RAG pipelines: From BM25 to reranking for improved AI assistant accuracy

A developer detailed the process of building a retrieval-augmented generation (RAG) pipeline for an AI assistant integrated into a Go-based task queue system. The initial implementation used ChromaDB for vector search, …
RESEARCH · CL_93380 · Jun 15 · 09:58

daVinci-kernel uses RL to optimize GPU kernels with evolving skill library

Researchers have developed daVinci-kernel, a novel reinforcement learning framework designed to optimize GPU kernels. This system co-evolves skill selection, summarization, and utilization, employing three agents that s…
RESEARCH · CL_93339 · Jun 14 · 19:52

New framework redefines entity relevance for document retrieval

A new research paper proposes a framework to improve document re-ranking by distinguishing between conceptual entity relevance and observable entity relevance. The authors argue that current entity-aware retrieval metho…
TOOL · CL_89824 · Jun 14 · 06:06

RAG System Quality Hinges on Retrieval, Not Just Prompts

This article argues that most problems with Retrieval-Augmented Generation (RAG) systems stem from poor retrieval rather than the language model itself. The author suggests eight fixes, prioritizing improvements to the …
TOOL · CL_89560 · Jun 13 · 22:20

BM25 and Dense Fusion: Hybrid RAG for Exact Match Accuracy

A technical article discusses the limitations of pure vector search in Retrieval-Augmented Generation (RAG) systems, particularly when dealing with exact identifiers like error codes, product SKUs, or specific phrases. …
TOOL · CL_89187 · Jun 13 · 14:29

Chinese Parsers DeepDoc, MinerU Crossover in Japanese RAG Performance

A comparative analysis of two Chinese open-source document parsers, DeepDoc and MinerU, for Japanese RAG systems reveals a crossover performance based on the retrieval method used. DeepDoc demonstrated superior results …
TOOL · CL_88763 · Jun 13 · 07:17

Structured Parsing Boosts Dense Retrieval Performance in LLM RAG

A study comparing document parsing strategies for retrieval-augmented generation (RAG) found that structured parsing significantly benefits dense retrieval more than traditional BM25 methods. When using dense retrieval,…