English(EN) InterPartAbility: Text-Guided Part Matching for Interpretable Person Re-Identification

InterPartAbility 通过部件匹配改进文本引导的行人重识别

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-01 04:00

研究人员开发了InterPartAbility，一种新颖的文本引导行人重识别方法，增强了可解释性。该方法将图像的部件显式地与文本描述进行匹配，实现了短语-区域对齐。一个新的补丁-短语交互模块指导模型关注相关的图像区域，CLIP ViT自注意力被约束以产生与部件级短语对齐的空间集中激活。InterPartAbility在CUHK-PEDES和ICFG-PEDES等基准测试中实现了最先进的可解释性，同时保持了强大的检索准确性。 AI

影响提高了行人重识别任务中视觉-语言模型的解释性。

排序理由该集群包含一篇介绍行人重识别新方法的学术论文。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Shakeeb Murtaza, Aryan Shukla, Rajarshi Bhattacharya, Maguelonne Heritier, Eric Granger · 2026-05-01 04:00

InterPartAbility: Text-Guided Part Matching for Interpretable Person Re-Identification

arXiv:2604.27122v1 Announce Type: new Abstract: Text-to-image person re-identification (TI-ReID) relies on natural-language text description to retrieve top matching individuals from a large gallery of images. While recent large vision-language models (VLMs) achieve strong retrie…

报道来源 [1]

InterPartAbility: Text-Guided Part Matching for Interpretable Person Re-Identification

相关话题