PulseAugur
EN
LIVE 13:23:01

AI framework enhances hateful video detection with explainable rationales

Researchers have developed a new framework called IARE to improve the explainability of AI models detecting hateful videos. This framework aims to provide contextual rationales and logical reasoning alongside detection decisions, moving beyond simple binary classification. IARE utilizes multimodal chain-of-thought and Direct Preference Optimization to enhance the integration of harmful elements and the coherence of justifications. Experiments on two new datasets, Ex-HateMM and Ex-ImpliHateVid, show that IARE achieves state-of-the-art performance in both detection accuracy and rationale generation. AI

IMPACT Improves AI's ability to explain decisions in content moderation, potentially leading to more trustworthy and transparent systems.

RANK_REASON The cluster contains an academic paper detailing a new AI framework and datasets for a specific research problem.

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

  1. arXiv cs.CL TIER_1 English(EN) · Junyu Lu, Deyi Ji, Liqun Liu, Xiaokun Zhang, Youlin Wu, Roy Ka-Wei Lee, Peng Shu, Huan Yu, Jie Jiang, Bo Xu, Liang Yang, Hongfei Lin ·

    Decoding Multimodal Cues: Unveiling the Implicit Meaning Behind Hateful Videos

    arXiv:2606.11953v1 Announce Type: new Abstract: Hateful videos have become prevalent on online platforms, highlighting an urgent need for effective detection. However, existing studies primarily focus on binary classification and fail to provide contextual rationales that reveal …

  2. arXiv cs.CL TIER_1 English(EN) · Hongfei Lin ·

    Decoding Multimodal Cues: Unveiling the Implicit Meaning Behind Hateful Videos

    Hateful videos have become prevalent on online platforms, highlighting an urgent need for effective detection. However, existing studies primarily focus on binary classification and fail to provide contextual rationales that reveal the implicit meanings behind these judgments, si…