PulseAugur
EN
LIVE 19:55:55

FashionLens uses LLMs for versatile fashion image retrieval

Researchers have developed FashionLens, a unified framework for versatile fashion image retrieval using Multimodal Large Language Models. This system addresses the limitations of existing approaches by supporting diverse query formats and search intentions. To achieve this, FashionLens incorporates a Proposal-Guided Spherical Query Calibrator for task-aligned metric spaces and a Gradient-Guided Adaptive Sampling strategy to balance optimization across varying task complexities. The framework demonstrates state-of-the-art performance on the new U-FIRE benchmark, which consolidates fragmented fashion datasets. AI

IMPACT This framework could significantly improve e-commerce search by enabling more nuanced and diverse fashion image retrieval.

RANK_REASON The cluster contains an academic paper detailing a new framework and benchmark for fashion image retrieval.

Read on Hugging Face Daily Papers →

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

COVERAGE [3]

  1. Hugging Face Daily Papers TIER_1 English(EN) ·

    FashionLens: Toward Versatile Fashion Image Retrieval via Task-Adaptive Learning

    A unified fashion image retrieval framework is proposed that handles diverse query formats and search intentions through multimodal large language models with adaptive calibration and sampling strategies.

  2. arXiv cs.CV TIER_1 English(EN) · Haokun Wen, Xuemeng Song, Xinghao Xie, Xiaolin Chen, Xiangyu Zhao, Weili Guan ·

    FashionLens: Toward Versatile Fashion Image Retrieval via Task-Adaptive Learning

    arXiv:2605.22552v1 Announce Type: new Abstract: Fashion image retrieval is a cornerstone of modern e-commerce systems. A unified framework that supports diverse query formats and search intentions is highly desired in practice. However, existing approaches focus on narrow retriev…

  3. arXiv cs.CV TIER_1 English(EN) · Weili Guan ·

    FashionLens: Toward Versatile Fashion Image Retrieval via Task-Adaptive Learning

    Fashion image retrieval is a cornerstone of modern e-commerce systems. A unified framework that supports diverse query formats and search intentions is highly desired in practice. However, existing approaches focus on narrow retrieval tasks and do not fully capture such diversity…