Marathi
PulseAugur coverage of Marathi — every cluster mentioning Marathi across labs, papers, and developer communities, ranked by signal.
3 day(s) with sentiment data
-
New Marathi POS Tagging Dataset and BERT Models Released
Researchers have introduced L3Cube-MahaPOS, a new dataset for Marathi Part-of-Speech (POS) tagging, addressing the scarcity of annotated resources for the language. The dataset contains over 32,000 manually annotated se…
-
New N-VSSM Model Outperforms Claude Opus 4.5 in Long-Form Narrative Consistency
Researchers have developed NarrativeWorldBench, a new benchmark designed to evaluate large language models (LLMs) on their ability to maintain narrative consistency in long-form audio dramas. Current frontier LLMs strug…
-
New dataset AgriGov boosts AI for Indian farmers
Researchers have developed AgriGov, a new multilingual dataset aimed at improving AI tools for Indian farmers. The dataset focuses on government schemes and welfare policies, initially covering 50 schemes across English…
-
New Marathi dataset BhashaSetu boosts low-resource translation quality
Researchers have introduced BhashaSetu, a new dataset designed to improve low-resource machine translation for Marathi. The dataset contains 2.78 million sentence pairs across various domains, including stemmed and lemm…
-
LLMs improve multilingual speech correction by tuning for fluency
Researchers have developed a new method for correcting disfluencies in multilingual speech transcripts using large language models (LLMs). The pipeline first identifies disfluent tokens and then uses these signals to fi…