PulseAugur
LIVE 12:22:58
research · [2 sources] ·
0
research

ScriptHOI framework improves open-vocabulary human-object interaction detection

Researchers have developed ScriptHOI, a novel framework for open-vocabulary human-object interaction detection. This approach decomposes interaction phrases into specific state slots like body-role and contact, enabling a more nuanced understanding beyond simple co-occurrence. ScriptHOI utilizes a visual state tokenizer and slot-wise matching to assess script coverage and conflict, improving recognition of rare interactions and reducing false positives. The method also incorporates interval partial-label learning to better handle incomplete annotations. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT Enhances the ability of AI systems to understand complex human actions in visual scenes, improving applications like robotics and surveillance.

RANK_REASON This is a research paper detailing a new method for human-object interaction detection.

Read on arXiv cs.CV →

COVERAGE [2]

  1. arXiv cs.CV TIER_1 · Minh Anh Nguyen, Quang Huy Tran, Bao Ngoc Le, SuiYang Guang, Tuan Kiet Pham, Linh Chi Vo ·

    ScriptHOI: Learning Scripted State Transitions for Open-Vocabulary Human-Object Interaction Detection

    arXiv:2605.05057v1 Announce Type: new Abstract: Open-vocabulary human-object interaction (HOI) detection requires recognizing interaction phrases that may not appear as annotated categories during training. Recent vision-language HOI detectors improve semantic transfer by matchin…

  2. arXiv cs.CV TIER_1 · Linh Chi Vo ·

    ScriptHOI: Learning Scripted State Transitions for Open-Vocabulary Human-Object Interaction Detection

    Open-vocabulary human-object interaction (HOI) detection requires recognizing interaction phrases that may not appear as annotated categories during training. Recent vision-language HOI detectors improve semantic transfer by matching human-object features with text embeddings, bu…