Brief

last 24h

[2/2] 222 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · arXiv cs.CL English(EN) · 23h

The Ghost Annotator: a Framework to Explore Human Label Variation in Content Moderation through Conformal Prediction

Researchers have developed a new framework called the Ghost Annotator to analyze human label variation in content moderation tasks, particularly when LLMs are used for annotation. This framework combines conformal prediction with collaborative filtering to model LLM behavior against human annotators, identifying instances where model predictions diverge from human consensus. The study found that larger LLMs tend to be more confident in classifying content that doesn't align with any human annotation, and revealed a consistent pattern of demographic misalignment, suggesting biases in pretraining data. AI

IMPACT This framework could help identify and mitigate biases in LLMs used for content moderation, leading to fairer and more reliable AI systems.
RESEARCH · arXiv cs.AI English(EN) · 2d · [2 sources]

SynCred-Bench: Benchmarking Synthetic Credibility in AI-Generated Visual Misinformation

Researchers have developed SynCred-Bench, a new benchmark designed to evaluate the detection of AI-generated visual misinformation that mimics credible sources. The benchmark includes 600 AI-generated images and a set of real images to test for false positives. Evaluations show that current AI detection systems, including large language models and open-source tools, perform poorly, with even human annotators struggling to identify this type of synthetic credibility. AI

IMPACT Highlights significant gaps in AI detection capabilities for sophisticated visual misinformation, necessitating further research and development in this area.

Brief

The Ghost Annotator: a Framework to Explore Human Label Variation in Content Moderation through Conformal Prediction

SynCred-Bench: Benchmarking Synthetic Credibility in AI-Generated Visual Misinformation