新GLCCL方法提升文本-视频检索准确性

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-18 07:17

研究人员开发了一种名为全局-局部对比一致性学习（GLCCL）的新方法，以改进文本-视频检索。该方法使用一个无参数模块，在文本查询的指导下，从视频帧和完整视频中生成语义特征。采用了一种新颖的对比分数一致性损失函数，以增强模型区分相关和不相关视频-文本对的能力，从而在基准数据集上取得卓越的性能。 AI

影响提高了文本-视频检索的语义对齐，可能带来更高效、更准确的搜索能力。

排序理由该集群包含一篇详细介绍文本-视频检索新方法的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Genke Yang · 2026-05-18 07:17

基于全局-局部对比一致性学习的文本-视频检索

Text-video retrieval aims to find the most semantically similar videos with given text queries. However, since videos contain more diverse content than texts, the main semantics expressed by each text-video pair is often partially relevant. The primary methods involve the utiliza…