AgriGov: A Structured Multilingual Dataset Curation for Indian Government Schemes for Farmers
Researchers have developed AgriGov, a new multilingual dataset aimed at improving AI tools for Indian farmers. The dataset focuses on government schemes and welfare policies, initially covering 50 schemes across English, Hindi, and Marathi. It was created using automated scraping and a translation pipeline involving Google Translate, MarianMT, and human post-editing, resulting in approximately 8,000 parallel sentence pairs. AI
IMPACT Enhances AI capabilities for domain-specific machine translation and information retrieval relevant to agricultural policy.