PulseAugur
实时 15:52:25

New dataset and benchmark advance Bangla text-to-gloss translation for BdSL

Researchers have developed the first dataset and benchmark for Bangla text-to-gloss translation, addressing a significant gap for the Bangla Sign Language (BdSL) community. The dataset includes manually annotated and synthetically generated sentence-gloss pairs, designed to aid low-resource translation efforts. Experiments showed that GPT-5.4 performed best overall, while a fine-tuned mBART model offered competitive results with a smaller size, and Qwen-3 excelled in human evaluations. AI

影响 Introduces a new dataset and benchmark for Bangla text-to-gloss translation, potentially improving accessibility for the deaf and hard-of-hearing community in Bangladesh.

排序理由 This is a research paper introducing a new dataset and benchmark for a specific NLP task. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

New dataset and benchmark advance Bangla text-to-gloss translation for BdSL

报道来源 [1]

  1. arXiv cs.CL TIER_1 English(EN) · Sharif Mohammad Abdullah, Abhijit Paul, Shubhashis Roy Dipta, Zarif Masud, Shebuti Rayana, Ahmedul Kabir ·

    Breaking the Silence: A Dataset and Benchmark for Bangla Text-to-Gloss Translation

    arXiv:2504.02293v3 Announce Type: replace Abstract: Gloss is a written approximation that bridges Sign Language (SL) and its corresponding spoken language. Despite a deaf and hard-of-hearing population of at least 3 million in Bangladesh, Bangla Sign Language (BdSL) remains large…