PulseAugur
实时 14:07:54

Researchers critique reliance on proprietary tools for NLP and LLM evaluation

Two new research papers explore advancements and challenges in NLP. One paper introduces ImCoref-CeS, a novel framework that combines a supervised neural method with LLM-based reasoning to improve coreference resolution. The other paper discusses the implications of the Perspective API's closure, highlighting the risks of relying on proprietary tools for toxicity measurement and advocating for independent, reproducible infrastructure. AI

影响 Highlights the need for robust, open-source evaluation tools in NLP and the potential of LLMs to enhance existing NLP tasks.

排序理由 Two distinct research papers published on arXiv, one detailing a new NLP framework and the other analyzing the impact of a tool's deprecation.

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。 我们如何撰写摘要 →

Researchers critique reliance on proprietary tools for NLP and LLM evaluation

报道来源 [4]

  1. arXiv cs.CL TIER_1 English(EN) · Kangyang Luo, Yuzhuo Bai, Shuzheng Si, Cheng Gao, Zhitong Wang, Yingli Shen, Wenhao Li, Zhu Liu, Yufeng Han, Jiayi Wu, Cunliang Kong, Maosong Sun ·

    ImCoref-CeS: An Improved Lightweight Pipeline for Coreference Resolution with LLM-based Checker-Splitter Refinement

    arXiv:2510.10241v2 Announce Type: replace Abstract: Coreference Resolution (CR) is a critical task in Natural Language Processing (NLP). Current research faces a key dilemma: whether to further explore the potential of supervised neural methods based on small language models, who…

  2. arXiv cs.CL TIER_1 English(EN) · David Hartmann, Manuel Tonneau, Angelie Kraft, LK Seiling, Dimitri Staufer, Pieter Delobelle, Jan Fillies, Anna Ricarda Luther, Jan Batzner, Mareike Lisker ·

    Bye Bye Perspective API: Lessons for Measurement Infrastructure in NLP, CSS and LLM Evaluation

    arXiv:2604.25580v1 Announce Type: new Abstract: The closure of Perspective API at the end of 2026 discards what has functioned as the de facto standard for automated toxicity measurement in NLP, CSS, and LLM evaluation research. We document the structural dependence that the comm…

  3. arXiv cs.CL TIER_1 English(EN) · Mareike Lisker ·

    Bye Bye Perspective API: Lessons for Measurement Infrastructure in NLP, CSS and LLM Evaluation

    The closure of Perspective API at the end of 2026 discards what has functioned as the de facto standard for automated toxicity measurement in NLP, CSS, and LLM evaluation research. We document the structural dependence that the communities built on this single proprietary tool an…

  4. Hugging Face Daily Papers TIER_1 English(EN) ·

    Bye Bye Perspective API: Lessons for Measurement Infrastructure in NLP, CSS and LLM Evaluation

    The closure of Perspective API at the end of 2026 discards what has functioned as the de facto standard for automated toxicity measurement in NLP, CSS, and LLM evaluation research. We document the structural dependence that the communities built on this single proprietary tool an…