Adapting Reinforcement Learning with Chain-of-Thought Supervision for Explainable Detection of Hateful and Propagandistic Memes
Researchers have developed a new method using reinforcement learning and Chain-of-Thought (CoT) supervision to improve the detection and explanation of hateful and propagandistic memes. This approach enhances multimodal large language models (MLLMs) by optimizing for both classification accuracy and the quality of generated explanations. Experiments on English and Arabic benchmarks showed significant improvements in accuracy and provided more balanced per-class performance with natural-language justifications. AI
IMPACT This research offers a novel approach to enhance AI's ability to identify and explain harmful content in memes, potentially improving content moderation systems.