PulseAugur
EN
LIVE 09:08:03

New methods tackle annotator disagreement in hate speech classification

Researchers have developed new methods to handle disagreements among human annotators when classifying hate speech. Their work explores various aggregation techniques, including majority voting and regression-based approaches, to better utilize the information present in these disagreements. The study demonstrates that discarding samples with non-consensus annotations leads to overly optimistic results, and that modeling annotator disagreement can improve the robustness and reliability of hate speech detection systems, even establishing new state-of-the-art results for Turkish tweets. AI

IMPACT Improves the reliability of AI systems for detecting harmful online content by better modeling human subjectivity.

RANK_REASON The cluster contains an academic paper detailing new methods for a specific AI task. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.AI TIER_1 English(EN) · Somaiyeh Dehghan, Mehmet Umut Sen, Berrin Yanikoglu ·

    Dealing with Annotator Disagreement in Hate Speech Classification

    arXiv:2502.08266v3 Announce Type: replace-cross Abstract: Hate speech detection is a crucial task, especially on social media where harmful content can spread quickly. Collecting social media content (tweets etc.) to train machine learning models is easy, but detecting and catego…