Researchers have developed a new framework to improve open-vocabulary object detection (OVD), a technique that allows AI models to identify objects beyond their training data. The proposed method addresses inaccuracies in pseudo-labeling by using hierarchical confidence calibration to ensure reliable class assignments across different semantic levels. Additionally, a new adaptation of CLIP called LoCLIP incorporates an objectness token to reduce bias towards known classes and provide more dependable objectness estimations. Experiments on benchmarks like COCO and LVIS show this approach achieves state-of-the-art performance. AI
影响 Advances open-vocabulary object detection capabilities, potentially improving AI's ability to identify novel objects in diverse datasets.
排序理由 This is a research paper introducing a novel framework and model adaptation for open-vocabulary object detection.
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →