Brief · PulseAugur

TOOL · arXiv cs.AI English(EN) · 8h

Adapting Reinforcement Learning with Chain-of-Thought Supervision for Explainable Detection of Hateful and Propagandistic Memes

Researchers have developed a new method using reinforcement learning and Chain-of-Thought (CoT) supervision to improve the detection and explanation of hateful and propagandistic memes. This approach enhances multimodal large language models (MLLMs) by optimizing for both classification accuracy and the quality of generated explanations. Experiments on English and Arabic benchmarks showed significant improvements in accuracy and provided more balanced per-class performance with natural-language justifications. AI

IMPACT This research offers a novel approach to enhance AI's ability to identify and explain harmful content in memes, potentially improving content moderation systems.

reinforcement learning
Arabic
English
Multimodal Large Language Models and Tunings: Vision, Language, Sensors, Audio, and Beyond
Group Relative Policy Optimization
Hateful and Propagandistic Memes
Hateful Memes
Mohamed Bayan Kmainasi