English(EN) The Detector Teaches Itself: Lightweight Self-Supervised Adaptation for Open-Vocabulary Object Detection

新方法提升开放词汇目标检测的鲁棒性和自适应能力

作者 PulseAugur 编辑部 · [9 个来源] · 2026-05-05 02:31

研究人员引入了几种新方法来改进开放词汇目标检测，该领域旨在根据人类提示识别任意目标。一种方法 EBOD 将基于提示的检测器与特征匹配模块集成，无需重新训练即可抑制重复出现的假阳性和假阴性。另一种方法 RGSE 在测试时使用进化搜索过程来精炼文本嵌入，以有效地对齐文本和视觉嵌入。此外，FACTOR 利用反事实推理，通过扰动测试图像并分析属性敏感性来使模型适应分布变化，而 DAT 提供了一种轻量级的自监督微调方法来增强用于目标检测的视觉语言模型。 AI

影响开放词汇目标检测的这些进展旨在提高准确性和鲁棒性，可能导致现实世界应用中更可靠的 AI 系统。

排序理由多篇 arXiv 论文引入了改进开放词汇目标检测的新颖方法。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 9 个来源。我们如何撰写摘要 →

报道来源 [9]

Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-06 05:10

Example-Based Object Detection

In recent years, object detection has achieved significant progress, especially in the field of open-vocabulary object detection. Unlike traditional methods that rely on predefined categories, open-vocabulary approaches can detect arbitrary objects based on human-provided prompts…
arXiv cs.CV TIER_1 English(EN) · ZhiXin Sun · 2026-05-07 04:00

Example-Based Object Detection

arXiv:2605.04501v1 Announce Type: new Abstract: In recent years, object detection has achieved significant progress, especially in the field of open-vocabulary object detection. Unlike traditional methods that rely on predefined categories, open-vocabulary approaches can detect a…
arXiv cs.CV TIER_1 English(EN) · Lihua Zhou, Mao Ye, Xiatian Zhu, Nianxin Li, Changyi Ma, Shuaifeng Li, Yitong Qin, Hongbin Liu, Jiebo Luo, Zhen Lei · 2026-05-07 04:00

Reward-Guided Semantic Evolution for Test-time Adaptive Object Detection

arXiv:2605.04531v1 Announce Type: new Abstract: Open-vocabulary object detection with vision-language models (VLMs) such as Grounding DINO suffers from performance degradation under test-time distribution shifts, primarily due to semantic misalignment between text embeddings and …
arXiv cs.CV TIER_1 English(EN) · Zhen Lei · 2026-05-06 06:17

Reward-Guided Semantic Evolution for Test-time Adaptive Object Detection

Open-vocabulary object detection with vision-language models (VLMs) such as Grounding DINO suffers from performance degradation under test-time distribution shifts, primarily due to semantic misalignment between text embeddings and shifted visual embeddings of region proposals. W…
arXiv cs.CV TIER_1 English(EN) · ZhiXin Sun · 2026-05-06 05:10

Example-Based Object Detection

In recent years, object detection has achieved significant progress, especially in the field of open-vocabulary object detection. Unlike traditional methods that rely on predefined categories, open-vocabulary approaches can detect arbitrary objects based on human-provided prompts…
arXiv cs.CV TIER_1 English(EN) · Yazhe Wan (Queen Mary University of London), Changjae Oh (Queen Mary University of London) · 2026-05-06 04:00

The Detector Teaches Itself: Lightweight Self-Supervised Adaptation for Open-Vocabulary Object Detection

arXiv:2605.03642v1 Announce Type: new Abstract: Open-vocabulary object detection aims to recognize objects from an open set of categories, which leverages vision-language models (VLMs) pre-trained on large-scale image-text data. The cooperative paradigm combines an object detecto…
arXiv cs.CV TIER_1 English(EN) · Kaixiang Zhao, Mao Ye, Lihua Zhou, Hu Wang, Luping Ji, Song Tang, Xiatian Zhu · 2026-05-06 04:00

FACTOR: Counterfactual Training-Free Test-Time Adaptation for Open-Vocabulary Object Detection

arXiv:2605.03294v1 Announce Type: new Abstract: Open-vocabulary object detection often fails under distribution shifts, as it can be misled by spurious correlations between non-causal visual attributes (e.g., brightness, texture) and object categories. Existing test-time adaptati…
arXiv cs.CV TIER_1 English(EN) · Changjae Oh · 2026-05-05 11:14

The Detector Teaches Itself: Lightweight Self-Supervised Adaptation for Open-Vocabulary Object Detection

Open-vocabulary object detection aims to recognize objects from an open set of categories, which leverages vision-language models (VLMs) pre-trained on large-scale image-text data. The cooperative paradigm combines an object detector with a VLM to achieve zero-shot recognition of…
arXiv cs.CV TIER_1 English(EN) · Xiatian Zhu · 2026-05-05 02:31

FACTOR: Counterfactual Training-Free Test-Time Adaptation for Open-Vocabulary Object Detection

Open-vocabulary object detection often fails under distribution shifts, as it can be misled by spurious correlations between non-causal visual attributes (e.g., brightness, texture) and object categories. Existing test-time adaptation (TTA) methods either depend on costly online …

报道来源 [9]

相关实体

相关话题