PulseAugur
实时 09:21:13

DuDi framework boosts small language models' multilingual abilities

Researchers have developed DuDi, a novel dual-signal distillation framework designed to enhance the multilingual capabilities of small language models (SLMs). This method combines sequence-level and token-level signals, incorporating a cross-lingual verbalizer to refine teacher feedback. Experiments demonstrate that DuDi significantly improves performance on Southeast Asian languages, outperforming existing distillation techniques across various model scales and families. AI

影响 Enhances multilingual capabilities of small language models, potentially improving accessibility and performance for under-resourced languages.

排序理由 This is a research paper describing a new method for improving language models. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

报道来源 [1]

  1. arXiv cs.CL TIER_1 English(EN) · Patomporn Payoungkhamdee, Tinnakit Udsa, Jian Gang Ngui, Sarana Nutanong, Alham Fikri Aji, Peerat Limkonchotiwat ·

    DuDi: Dual-Signal Distillation with Cross-Lingual Verbalizer

    arXiv:2606.04694v1 Announce Type: new Abstract: Small language models (SLMs) are efficient and scalable, but their multilingual capabilities degrade severely at sub-billion scales, especially for Southeast Asian (SEA) languages. We introduce DuDi, a dual-signal multilingual disti…