MIRAGE: Runtime Scheduling for Multi-Vector Image Retrieval with Hierarchical Decomposition
Researchers have introduced MIRAGE, a new framework designed to improve the efficiency and accuracy of multi-vector image retrieval (MVR) within multimodal large language models (MLLMs). MIRAGE addresses limitations in current MVR systems by employing a hierarchical approach that better aligns queries with diverse image objects and reduces redundant computations through cross-hierarchy similarity consistency. The system also automates parameter configuration for various datasets, enhancing its practicality. Empirical results indicate that MIRAGE significantly boosts accuracy while reducing computational costs by up to 3.5 times compared to existing MVR systems. AI
IMPACT MIRAGE's efficiency gains could accelerate the development and deployment of more sophisticated multimodal AI applications.