PulseAugur
实时 02:11:43
English(EN) Do LLMs Know What Luxembourgish Borrows? Probing Lexical Neology in Low-Resource Multilingual Models

大型语言模型通过知识图谱提示改进卢森堡语借词检测

研究人员开发了一个新的基准LexNeo-Bench,用于评估大型语言模型对卢森堡语等低资源语言的词汇借用理解程度。该基准源自卢森堡语新闻语料库,将词标记为本地词或从法语、德语或英语借用的词。当使用语言知识图谱进行提示时,大型语言模型在分类借词方面的准确性显著提高,缩小了小型模型和大型模型之间的性能差距。 AI

影响 增强了对低资源语言的大型语言模型评估,有可能改进针对不同语言社区的写作辅助工具。

排序理由 该集群描述了一篇介绍大型语言模型新基准和评估方法的学术论文。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.CL TIER_1 English(EN) · Nina Hosseini-Kivanani ·

    Do LLMs Know What Luxembourgish Borrows? Probing Lexical Neology in Low-Resource Multilingual Models

    Large language models (LLMs) are increasingly used for writing assistance in small contact languages, yet it is unclear whether they respect community norms around lexical borrowing and neology. We introduce LexNeo-Bench, a 3{,}050-instance token-level benchmark derived from LuxB…

  2. Hugging Face Daily Papers TIER_1 English(EN) ·

    Do LLMs Know What Luxembourgish Borrows? Probing Lexical Neology in Low-Resource Multilingual Models

    Large language models (LLMs) are increasingly used for writing assistance in small contact languages, yet it is unclear whether they respect community norms around lexical borrowing and neology. We introduce LexNeo-Bench, a 3{,}050-instance token-level benchmark derived from LuxB…