New dataset trains AI to counter online hate speech and misinformation

By PulseAugur Editorial · [1 sources] · 2026-06-18 15:32

Researchers have introduced CATCH-ME, a novel dataset designed to train natural language processing models to generate effective counterspeech against online hate speech and misinformation. This dataset is the first of its kind to address the intersection of these two threats across multiple turns and languages. It includes expert-curated dialogues in five languages, targeting hate speech directed at seven marginalized groups, and is grounded in verified external knowledge for factual accuracy. The dataset is particularly applicable for retrieval-augmented generation (RAG) systems, providing document- and chunk-level annotations to enhance the persuasiveness and factual grounding of generated counterspeech. AI

IMPACT This dataset could significantly improve AI's ability to combat online hate speech and misinformation by enabling more nuanced and factually grounded responses.

RANK_REASON The cluster contains a research paper detailing a new dataset for NLP. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

paper
safety

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New dataset trains AI to counter online hate speech and misinformation

COVERAGE [1]

arXiv cs.CL TIER_1 English(EN) · Marco Guerini · 2026-06-18 15:32

CATCH-ME if you RAG: a dataset of Contextually Annotated multi-Turn Counterspeech against Hate and Misinformation Exchanges

Online hate speech and misinformation frequently overlap, yet NLP research has mainly treated them in isolation. While LLMs represent a scalable solution for assisting humans in the generation of counterspeech for both threats, zero-shot models frequently generate repetitive and …

COVERAGE [1]

CATCH-ME if you RAG: a dataset of Contextually Annotated multi-Turn Counterspeech against Hate and Misinformation Exchanges

RELATED ENTITIES

RELATED TOPICS