PulseAugur
EN
LIVE 02:30:40

New Bengali agricultural dataset KrishokChat released for low-resource advisory

Researchers have developed KrishokChat, a new dataset and benchmark designed to improve agricultural advisory services in Bengali for low-resource settings. The dataset includes over 145,000 question-answering pairs, grounded in 129 agricultural manuals and featuring verified citation provenance for all information. A Farmer Benchmark of 1,001 real-world farmer queries was also created to evaluate model performance. Initial testing with Gemma-4-E2B showed that while KrishokChat improves structured formatting, models still face challenges with precise chemical dosage generalization, indicating its primary utility for retrieval-augmented generation. AI

IMPACT Enhances AI capabilities for agricultural advice in Bengali, particularly for retrieval-augmented generation systems.

RANK_REASON Publication of a new dataset and benchmark for a specific domain (agricultural advisory) in a low-resource language. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New Bengali agricultural dataset KrishokChat released for low-resource advisory

COVERAGE [1]

  1. arXiv cs.LG TIER_1 English(EN) · Khan Raiyan Ibne Reza, Omar Ibne Shahid ·

    KrishokChat: A Citation-Grounded Dataset and Benchmark for Bengali Agricultural Advisory

    arXiv:2606.29243v1 Announce Type: new Abstract: We present KrishokChat, the first citation-grounded Bengali agricultural instruction-tuning dataset for crop advisory in low-resource settings. We establish a foundation of 290 hierarchical Knowledge Nodes, extracting disease sympto…