PulseAugur
实时 04:50:52

New methods improve open-vocabulary object detection robustness and adaptation

Researchers have introduced several new methods to improve open-vocabulary object detection, a field that aims to identify arbitrary objects based on human prompts. One approach, EBOD, integrates a prompt-based detector with feature matching modules to suppress recurring false positives and negatives without retraining. Another method, Reward-Guided Semantic Evolution (RGSE), refines text embeddings at test time using an evolutionary search process to align text and visual embeddings efficiently. Additionally, FACTOR utilizes counterfactual reasoning to adapt models to distribution shifts by perturbing test images and analyzing attribute sensitivity, while DAT offers a lightweight, self-supervised fine-tuning approach to enhance vision-language models for object detection. AI

影响 These advancements in open-vocabulary object detection aim to improve accuracy and robustness, potentially leading to more reliable AI systems in real-world applications.

排序理由 Multiple arXiv papers introduce novel methods for improving open-vocabulary object detection.

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 9 个来源。 我们如何撰写摘要 →

New methods improve open-vocabulary object detection robustness and adaptation

报道来源 [9]

  1. Hugging Face Daily Papers TIER_1 English(EN) ·

    Example-Based Object Detection

    In recent years, object detection has achieved significant progress, especially in the field of open-vocabulary object detection. Unlike traditional methods that rely on predefined categories, open-vocabulary approaches can detect arbitrary objects based on human-provided prompts…

  2. arXiv cs.CV TIER_1 English(EN) · ZhiXin Sun ·

    Example-Based Object Detection

    arXiv:2605.04501v1 Announce Type: new Abstract: In recent years, object detection has achieved significant progress, especially in the field of open-vocabulary object detection. Unlike traditional methods that rely on predefined categories, open-vocabulary approaches can detect a…

  3. arXiv cs.CV TIER_1 English(EN) · Lihua Zhou, Mao Ye, Xiatian Zhu, Nianxin Li, Changyi Ma, Shuaifeng Li, Yitong Qin, Hongbin Liu, Jiebo Luo, Zhen Lei ·

    Reward-Guided Semantic Evolution for Test-time Adaptive Object Detection

    arXiv:2605.04531v1 Announce Type: new Abstract: Open-vocabulary object detection with vision-language models (VLMs) such as Grounding DINO suffers from performance degradation under test-time distribution shifts, primarily due to semantic misalignment between text embeddings and …

  4. arXiv cs.CV TIER_1 English(EN) · Zhen Lei ·

    Reward-Guided Semantic Evolution for Test-time Adaptive Object Detection

    Open-vocabulary object detection with vision-language models (VLMs) such as Grounding DINO suffers from performance degradation under test-time distribution shifts, primarily due to semantic misalignment between text embeddings and shifted visual embeddings of region proposals. W…

  5. arXiv cs.CV TIER_1 English(EN) · ZhiXin Sun ·

    Example-Based Object Detection

    In recent years, object detection has achieved significant progress, especially in the field of open-vocabulary object detection. Unlike traditional methods that rely on predefined categories, open-vocabulary approaches can detect arbitrary objects based on human-provided prompts…

  6. arXiv cs.CV TIER_1 English(EN) · Yazhe Wan (Queen Mary University of London), Changjae Oh (Queen Mary University of London) ·

    The Detector Teaches Itself: Lightweight Self-Supervised Adaptation for Open-Vocabulary Object Detection

    arXiv:2605.03642v1 Announce Type: new Abstract: Open-vocabulary object detection aims to recognize objects from an open set of categories, which leverages vision-language models (VLMs) pre-trained on large-scale image-text data. The cooperative paradigm combines an object detecto…

  7. arXiv cs.CV TIER_1 English(EN) · Kaixiang Zhao, Mao Ye, Lihua Zhou, Hu Wang, Luping Ji, Song Tang, Xiatian Zhu ·

    FACTOR: Counterfactual Training-Free Test-Time Adaptation for Open-Vocabulary Object Detection

    arXiv:2605.03294v1 Announce Type: new Abstract: Open-vocabulary object detection often fails under distribution shifts, as it can be misled by spurious correlations between non-causal visual attributes (e.g., brightness, texture) and object categories. Existing test-time adaptati…

  8. arXiv cs.CV TIER_1 English(EN) · Changjae Oh ·

    The Detector Teaches Itself: Lightweight Self-Supervised Adaptation for Open-Vocabulary Object Detection

    Open-vocabulary object detection aims to recognize objects from an open set of categories, which leverages vision-language models (VLMs) pre-trained on large-scale image-text data. The cooperative paradigm combines an object detector with a VLM to achieve zero-shot recognition of…

  9. arXiv cs.CV TIER_1 English(EN) · Xiatian Zhu ·

    FACTOR: Counterfactual Training-Free Test-Time Adaptation for Open-Vocabulary Object Detection

    Open-vocabulary object detection often fails under distribution shifts, as it can be misled by spurious correlations between non-causal visual attributes (e.g., brightness, texture) and object categories. Existing test-time adaptation (TTA) methods either depend on costly online …