PulseAugur
EN
LIVE 20:46:57

NeurIPS reviewers warned of LLM prompt injection attacks

Reviewers for the NeurIPS conference are being cautioned about potential prompt injection attacks when evaluating submissions. A user reported observing a sophisticated prompt injection technique, similar to one used at ICML, targeting their own paper. This highlights a growing concern regarding the integrity of AI-assisted academic reviews. AI

IMPACT Highlights potential vulnerabilities in AI-assisted academic review processes, necessitating new safeguards.

RANK_REASON The cluster discusses a safety concern related to academic paper reviews at a research conference, involving potential misuse of LLMs. [lever_c_demoted from research: ic=1 ai=1.0]

Read on r/MachineLearning →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. r/MachineLearning TIER_1 English(EN) · /u/Massive-Bobcat-5363 ·

    NeurIPS Reciprocal Reviewers be careful in reviewing with LLMs [D]

    <!-- SC_OFF --><div class="md"><p>As the title says. I am not a reciprocal reviewer but I just noticed a clever prompt injection like they did in ICML for our submission.</p> </div><!-- SC_ON --> &#32; submitted by &#32; <a href="https://www.reddit.com/user/Massive-Bobcat-5363"> …