PulseAugur
EN
LIVE 05:10:07

New benchmark ZeroSight tests zero-shot image retrieval

Researchers have introduced ZeroSight, a new benchmark designed to evaluate Zero-Shot Composed Image Retrieval (ZS-CIR) capabilities more rigorously. Existing datasets often use images that models have already been trained on, compromising the zero-shot premise, and lack consistent relationships between reference and target images. ZeroSight utilizes video-sourced data and LLM-generated captions to ensure true zero-shot conditions and consistent pairs, while also proposing a new method called SC4CIR to improve performance by identifying hard negative targets. AI

IMPACT Establishes a more rigorous evaluation for zero-shot image retrieval, potentially leading to more robust multimodal models.

RANK_REASON The cluster contains a research paper introducing a new benchmark and method for a specific AI task. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CV →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.CV TIER_1 English(EN) · Changsheng Xu ·

    Never Seen Before: Benchmarking Genuine Zero-Shot Composed Image Retrieval with Consistent Video-Sourced Datasets

    Zero-Shot Composed Image Retrieval (ZS-CIR) aims to retrieve a target image based on a query composed of a reference image and a relative caption without training samples. Existing ZS-CIR datasets often suffer from complete irrelevance between reference and target images due to n…