NLP research often omits key human annotation details, study finds

By PulseAugur Editorial · [1 sources] · 2026-06-02 04:00

A new study published on arXiv assesses the reporting of human annotation practices in Natural Language Processing (NLP) research from 2018 to 2025. The research found that while operational details like recruitment and annotation volume are frequently documented, crucial information for assessing validity, such as annotator training, compensation, and agreement metrics, is often omitted. The study introduces a framework and recommendations to improve the reliability and interpretability of human annotation in NLP. AI

IMPACT Highlights critical gaps in reporting for NLP research, potentially impacting the reliability and reproducibility of AI models trained on human-annotated data.

RANK_REASON Academic paper analyzing reporting practices in a research field. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

arXiv
NLP

paper
other

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.AI TIER_1 English(EN) · Maria Kunilovskaya, Gagan Bhatia, Lisa Sophie Albertelli, Yanran Chen, Christian Greisinger, Lotta Kiefer, Christoph Leiter, Subhadeep Roy, Tewodros Achamaleh, Muhammad Arslan Manzoor, Sebastian Pohl, Yufang Hou, Steffen Eger · 2026-06-02 04:00

Who Annotates in NLP? A Large-scale Assessment of Human Annotation Reporting between 2018 and 2025

arXiv:2606.02255v1 Announce Type: cross Abstract: Human annotation is the empirical foundation of much NLP research, from dataset construction to model evaluation, but papers often leave unclear who produced the annotations and how the annotation process was controlled. We provid…

COVERAGE [1]

Who Annotates in NLP? A Large-scale Assessment of Human Annotation Reporting between 2018 and 2025

RELATED ENTITIES

RELATED TOPICS