mBART
PulseAugur coverage of mBART — every cluster mentioning mBART across labs, papers, and developer communities, ranked by signal.
-
Direct sign language translation model developed using synthetic data
Researchers have developed a novel method for direct translation between different sign languages, addressing a gap in current sign language technology. Their approach utilizes back-translation to create synthetic paral…
-
Encoder-decoder transformers advance constituent parsing accuracy
Researchers have explored the use of pre-trained encoder-decoder transformer models for syntactic constituent parsing, a key task for natural language understanding. Their work extends existing sequence-to-sequence appr…
-
New dataset and benchmark advance Bangla text-to-gloss translation for BdSL
Researchers have developed the first dataset and benchmark for Bangla text-to-gloss translation, addressing a significant gap for the Bangla Sign Language (BdSL) community. The dataset includes manually annotated and sy…
-
New study benchmarks machine transliteration models for Tajik-Farsi languages
This paper introduces a new benchmark for machine transliteration between Tajik and Farsi, developing a unique parallel corpus from diverse sources. The study compares six model architectures, including rule-based syste…
-
CRAFT method speeds up training data selection for sequence-to-sequence models
Researchers have developed a new method called CRAFT (Clustered Regression for Adaptive Filtering of Training data) to efficiently select high-quality subsets of training data for sequence-to-sequence models. This appro…