RATS！新型Transformer架构在视觉模型中发现物体部件

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-12 17:59

研究人员推出了RATS（Register Attention Transformers），一种新颖的自监督视觉模型架构，旨在发现类似于人类物体部件识别的组合结构。RATS利用可学习的寄存器令牌，通过瓶颈路由图像块信息，寄存器无需显式部件标注即可专门化为原型语义区域。该方法在分割基准测试中表现出色，平均比基线模型高出+12 mIoU，并在ADE20K和COCO等数据集上显示出持续的提升。 AI

影响引入了一种新颖的结构化和可解释的视觉表示学习架构先验，有望改进物体识别和分割。

排序理由该集群包含一篇详细介绍新模型架构的研究论文。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CV TIER_1 Deutsch(DE) · Timing Yang, Predrag Neskovic, Jansen Seheult, Wenchao Han, Anand Bhattad, Alan Yuille, Feng Wang · 2026-06-15 04:00

RATS! Patches Talk Through Registers: Emergent Parts in Register Attention Transformers

arXiv:2606.14701v1 Announce Type: new Abstract: When humans see a bird, they recognize far more than just "bird" -- they see a head, wings, and talons, a structured assembly of reusable parts that can be identified across every bird they have ever seen. We ask whether a self-supe…
arXiv cs.CV TIER_1 Deutsch(DE) · Feng Wang · 2026-06-12 17:59

RATS! Patches Talk Through Registers: Emergent Parts in Register Attention Transformers

When humans see a bird, they recognize far more than just "bird" -- they see a head, wings, and talons, a structured assembly of reusable parts that can be identified across every bird they have ever seen. We ask whether a self-supervised visual model can discover the same compos…

报道来源 [2]

RATS! Patches Talk Through Registers: Emergent Parts in Register Attention Transformers

RATS! Patches Talk Through Registers: Emergent Parts in Register Attention Transformers

相关实体

相关话题