Researchers have developed FreeRet, a novel framework that enables multimodal large language models (MLLMs) to function as effective retrievers without requiring additional training. This plug-and-play system extracts semantically grounded embeddings from off-the-shelf MLLMs for initial candidate search and then utilizes their reasoning capabilities for precise reranking. FreeRet demonstrates significant performance improvements over models trained on millions of pairs on the MMEB and MMEB-V2 benchmarks, showcasing its potential to unify retrieval, reranking, and generation within a single model. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
IMPACT Enables MLLMs to act as powerful, training-free retrievers, potentially simplifying RAG systems and enhancing multimodal search capabilities.
RANK_REASON This is a research paper describing a new framework for MLLMs.