PulseAugur
LIVE 19:31:02
research · [2 sources] ·

LLMs improve Luxembourgish borrowing detection with knowledge graph prompts

Researchers have developed a new benchmark, LexNeo-Bench, to evaluate how well large language models understand lexical borrowing in low-resource languages like Luxembourgish. The benchmark, derived from a Luxembourgish news corpus, labels tokens as native or borrowed from French, German, or English. When prompted with a linguistic knowledge graph, LLMs showed significantly improved accuracy in classifying borrowed words, narrowing the performance gap between smaller and larger models. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT Enhances LLM evaluation for low-resource languages, potentially improving writing assistance tools for diverse linguistic communities.

RANK_REASON The cluster describes an academic paper introducing a new benchmark and evaluation methodology for LLMs.

Read on arXiv cs.CL →

COVERAGE [2]

  1. arXiv cs.CL TIER_1 · Nina Hosseini-Kivanani ·

    Do LLMs Know What Luxembourgish Borrows? Probing Lexical Neology in Low-Resource Multilingual Models

    Large language models (LLMs) are increasingly used for writing assistance in small contact languages, yet it is unclear whether they respect community norms around lexical borrowing and neology. We introduce LexNeo-Bench, a 3{,}050-instance token-level benchmark derived from LuxB…

  2. Hugging Face Daily Papers TIER_1 ·

    Do LLMs Know What Luxembourgish Borrows? Probing Lexical Neology in Low-Resource Multilingual Models

    Large language models (LLMs) are increasingly used for writing assistance in small contact languages, yet it is unclear whether they respect community norms around lexical borrowing and neology. We introduce LexNeo-Bench, a 3{,}050-instance token-level benchmark derived from LuxB…