Researchers have introduced ToxiREX, a new multilingual dataset designed to capture implicit and context-dependent toxicity in online conversations. The dataset comprises Reddit comment threads, annotated using a structured toxic reasoning schema, and includes content in six languages. ToxiREX aims to provide a more nuanced understanding of toxicity by considering conversational context, a feature not present in previous datasets. Initial experiments show that while language models can perform better than random chance on this task, significant improvements are still needed. AI
IMPACT This dataset could improve LLM safety by enabling better detection of nuanced and context-dependent toxic language.
RANK_REASON The cluster describes a new academic dataset and associated research paper.
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →