Brief

last 24h

[2/2] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · arXiv cs.CV English(EN) · 8h

MIRAGE: Runtime Scheduling for Multi-Vector Image Retrieval with Hierarchical Decomposition

Researchers have introduced MIRAGE, a new framework designed to improve the efficiency and accuracy of multi-vector image retrieval (MVR) within multimodal large language models (MLLMs). MIRAGE addresses limitations in current MVR systems by employing a hierarchical approach that better aligns queries with diverse image objects and reduces redundant computations through cross-hierarchy similarity consistency. The system also automates parameter configuration for various datasets, enhancing its practicality. Empirical results indicate that MIRAGE significantly boosts accuracy while reducing computational costs by up to 3.5 times compared to existing MVR systems. AI

IMPACT MIRAGE's efficiency gains could accelerate the development and deployment of more sophisticated multimodal AI applications.
RESEARCH · arXiv cs.IR (Information Retrieval) English(EN) · 3w · [2 sources]

MVR-cache: Optimizing Semantic Caching via Multi-Vector Retrieval and Learned Prompt Segmentation

Researchers have developed MVR-cache, a new semantic caching system designed to reduce the costs and latency associated with Large Language Models (LLMs). This system utilizes Multi-Vector Retrieval (MVR) and a learnable prompt segmentation model to achieve more accurate identification of matching prompts. By intelligently splitting prompts and employing a reinforcement learning strategy, MVR-cache has demonstrated an increase in cache hit rates by up to 37% compared to existing state-of-the-art methods, while maintaining strict correctness guarantees. AI

IMPACT MVR-cache's significant improvement in cache hit rates could lead to reduced operational costs and faster response times for LLM-powered applications.

Brief

MIRAGE: Runtime Scheduling for Multi-Vector Image Retrieval with Hierarchical Decomposition

MVR-cache: Optimizing Semantic Caching via Multi-Vector Retrieval and Learned Prompt Segmentation