Researchers have introduced ForMaT, a new dataset designed to improve visually-grounded multilingual PDF translation. The dataset comprises 3,956 PDFs across 15 language pairs, meticulously preserving original layout metadata to capture complex elements like tables and formulas. Current machine translation systems exhibit significant weaknesses in maintaining the link between text and its visual context, highlighting the need for layout-aware models that can integrate both visual and textual information for accurate document reconstruction. AI
影响 This dataset aims to improve machine translation systems' ability to handle complex document layouts, potentially leading to more accurate and context-aware translations of visually rich documents.
排序理由 The cluster describes the release of a new academic dataset for a specific NLP task. [lever_c_demoted from research: ic=1 ai=1.0]
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →