English(EN) HistoriQA-ThirdRepublic: Multi-Hop Question Answering Corpus for Historical Research, Parliamentary Debates from the French Third Republic (1870-1940)

发布新的法语历史问答数据集以评估LLM

作者 PulseAugur 编辑部 · [1 个来源] · 2026-07-01 04:00

研究人员开发了HistoriQA-ThirdRepublic，一个新设计的法语多跳问答历史研究数据集。该语料库与一位历史学家合作创建，包含1782个问题，源自法兰西第三共和国（1870-1940）的议会辩论和报纸。它旨在通过捕捉复杂的推理模式（如跨源综合和时间推理）来评估检索增强和大型语言模型系统，弥合NLP基准与历史学术之间的差距。 AI

影响为评估LLM在历史研究中的应用提供了一个专业数据集，有可能提高领域特定的问答能力。

排序理由该集群包含一篇新发布的学术论文，详细介绍了一个用于NLP研究的新数据集。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Aur\'elien Pellet (LRE), Julien Perez (EPITA, LRE), Marie Puren (LRE, CJM) · 2026-07-01 04:00

HistoriQA-ThirdRepublic: Multi-Hop Question Answering Corpus for Historical Research, Parliamentary Debates from the French Third Republic (1870-1940)

arXiv:2606.31325v1 Announce Type: new Abstract: We present HistoriQA-ThirdRepublic: a French-language dataset of multi-hop historical questions derived from parliamentary debates and newspapers of the French Third Republic. Designed in collaboration with a historian, the corpus c…

报道来源 [1]

HistoriQA-ThirdRepublic: Multi-Hop Question Answering Corpus for Historical Research, Parliamentary Debates from the French Third Republic (1870-1940)

相关话题