PulseAugur
EN
LIVE 18:53:11

NeurIPS desk-rejects papers using uncalibrated AI detector

A researcher was desk-rejected from NeurIPS 2026 for an alleged AI policy violation, based on the output of a proprietary AI detector called Pangram. The researcher argues that the detector's uncalibrated nature and potential for false positives make it an unreliable tool for such decisions. To illustrate, the researcher ran Pangram on papers authored by NeurIPS Position Paper Track Chairs, yielding scores between 24% and 69% AI-generated text, which they state does not necessarily indicate AI authorship. AI

IMPACT Raises concerns about the reliability of AI detection tools in academic integrity and policy enforcement.

RANK_REASON The cluster discusses a methodological issue with an AI detector used in a conference's paper review process, which is a research-adjacent topic. [lever_c_demoted from research: ic=1 ai=0.7]

Read on r/MachineLearning →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. r/MachineLearning TIER_1 English(EN) · /u/Asleep-Requirement13 ·

    NeurIPS used uncalibrated AI detector for desk rejections [D]

    <!-- SC_OFF --><div class="md"><p>I recently had a submission desk-rejected from the NeurIPS 2026 Position Paper Track for an alleged AI-policy violation. After corresponding with the track leadership and reading their public blog post, I think the broader methodological issue is…