New methods tackle annotator disagreement in hate speech classification

By PulseAugur Editorial · [1 sources] · 2026-06-09 04:00

Researchers have developed new methods to handle disagreements among human annotators when classifying hate speech. Their work explores various aggregation techniques, including majority voting and regression-based approaches, to better utilize the information present in these disagreements. The study demonstrates that discarding samples with non-consensus annotations leads to overly optimistic results, and that modeling annotator disagreement can improve the robustness and reliability of hate speech detection systems, even establishing new state-of-the-art results for Turkish tweets. AI

IMPACT Improves the reliability of AI systems for detecting harmful online content by better modeling human subjectivity.

RANK_REASON The cluster contains an academic paper detailing new methods for a specific AI task. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

paper
safety

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.AI TIER_1 English(EN) · Somaiyeh Dehghan, Mehmet Umut Sen, Berrin Yanikoglu · 2026-06-09 04:00

Dealing with Annotator Disagreement in Hate Speech Classification

arXiv:2502.08266v3 Announce Type: replace-cross Abstract: Hate speech detection is a crucial task, especially on social media where harmful content can spread quickly. Collecting social media content (tweets etc.) to train machine learning models is easy, but detecting and catego…

COVERAGE [1]

Dealing with Annotator Disagreement in Hate Speech Classification

RELATED ENTITIES

RELATED TOPICS