PulseAugur
EN
LIVE 11:54:52

New dataset AgriGov boosts AI for Indian farmers

Researchers have developed AgriGov, a new multilingual dataset aimed at improving AI tools for Indian farmers. The dataset focuses on government schemes and welfare policies, initially covering 50 schemes across English, Hindi, and Marathi. It was created using automated scraping and a translation pipeline involving Google Translate, MarianMT, and human post-editing, resulting in approximately 8,000 parallel sentence pairs. AI

IMPACT Enhances AI capabilities for domain-specific machine translation and information retrieval relevant to agricultural policy.

RANK_REASON The cluster contains an academic paper detailing a new dataset.

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

  1. arXiv cs.AI TIER_1 English(EN) · Mohsina Bilal, Gopakumar G ·

    AgriGov: A Structured Multilingual Dataset Curation for Indian Government Schemes for Farmers

    arXiv:2606.08272v1 Announce Type: cross Abstract: AgriGov is a curated, trilingual (English-Hindi-Marathi) dataset designed to address the scarcity of domain-grounded multilingual resources for agricultural policies and farmer welfare schemes. Initially, we collected and structur…

  2. arXiv cs.AI TIER_1 English(EN) · Gopakumar G ·

    AgriGov: A Structured Multilingual Dataset Curation for Indian Government Schemes for Farmers

    AgriGov is a curated, trilingual (English-Hindi-Marathi) dataset designed to address the scarcity of domain-grounded multilingual resources for agricultural policies and farmer welfare schemes. Initially, we collected and structured data from 50 government schemes sourced from tr…