PulseAugur
EN
LIVE 15:24:12

Small language models outperform LLMs in multilingual citation detection

Researchers have developed a new multilingual corpus, MCN, to address citation needed detection (CND) for lower-resource languages on Wikipedia. Their study demonstrates that small language models (SLMs) fine-tuned with an encoder-style objective outperform larger language models (LLMs) on this task. Notably, SLMs trained solely on English data showed strong cross-lingual performance, suggesting that compact, specialized models are more suitable than LLMs for CND in resource-constrained environments. AI

IMPACT Provides a more accessible and effective approach to fact-checking for lower-resource language communities, potentially improving information quality on platforms like Wikipedia.

RANK_REASON The cluster contains an academic paper detailing a new corpus and experimental findings on language models.

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

Small language models outperform LLMs in multilingual citation detection

COVERAGE [2]

  1. arXiv cs.CL TIER_1 English(EN) · Gerrit Quaremba, Amy Rechkemmer, Elizabeth Black, Denny Vrande\v{c}i\'c, Elena Simperl ·

    Multilingual and Cross-Lingual Citation Needed Detection on Wikipedia for Lower-Resource Languages

    arXiv:2605.31136v1 Announce Type: new Abstract: In automated fact-checking (AFC), check-worthiness detection identifies claims requiring verification based on domain-specific criteria. On Wikipedia, this task instantiates as Citation Needed Detection (CND), which flags claims lac…

  2. arXiv cs.CL TIER_1 English(EN) · Elena Simperl ·

    Multilingual and Cross-Lingual Citation Needed Detection on Wikipedia for Lower-Resource Languages

    In automated fact-checking (AFC), check-worthiness detection identifies claims requiring verification based on domain-specific criteria. On Wikipedia, this task instantiates as Citation Needed Detection (CND), which flags claims lacking supporting citations. However, existing res…