Brief · PulseAugur

RESEARCH · arXiv cs.CV English(EN) · 1w · [2 sources]

Mixed-Modality Dual Face-Hair Retrieval

Researchers have introduced Dual Face-Hair Retrieval (DFHR), a novel task for image retrieval that combines identity information from a face image with hairstyle preferences provided as either an image or text. This approach requires sophisticated reasoning across semantically distinct attributes from different data types, necessitating disentangled features and cross-modal alignment. To support this, they have also developed DFHR-Bench, a new benchmark dataset containing over 180,000 annotated triplets, and proposed the Multimodal Face-Hair Combiner (MFHC) framework. AI

IMPACT Establishes a new paradigm for attribute-controllable visual retrieval, potentially impacting personalized search and recommendation systems.

MFHC
DFHR-Bench
Dual Face-Hair Retrieval
arXiv
Multimodal Face-Hair Combiner