Brief · PulseAugur

TOOL · arXiv cs.IR (Information Retrieval) English(EN) · 2w

Text-Video Retrieval With Global-Local Contrastive Consistency Learning

Researchers have developed a new method called Global-Local Contrastive Consistency Learning (GLCCL) to improve text-video retrieval. This approach uses a parameter-free module to generate semantic features from video frames and full videos, guided by text queries. A novel Contrastive Score Consistency loss function is employed to enhance the model's ability to distinguish between relevant and irrelevant video-text pairs, leading to superior performance on benchmark datasets. AI

IMPACT Improves semantic alignment for text-video retrieval, potentially leading to more efficient and accurate search capabilities.

Global-Local Contrastive Consistency Learning (GLCCL)
MSR-VTT
DiDeMo
VATEX