Researchers have developed a novel self-ensemble framework to improve narrative question answering (NQA) by reranking multiple generated answers. This approach enhances robustness by selecting answers based on semantic agreement, without altering the core model architecture. Experiments on the NarrativeQA dataset showed significant performance gains across various models, including FLAN-T5 and Pegasus-Large, with Pegasus-Large seeing a notable increase of over 14%. AI
IMPACT Enhances robustness of language models for complex question answering tasks.
RANK_REASON The cluster contains a research paper detailing a new method for narrative question answering, published on arXiv. [lever_c_demoted from research: ic=1 ai=1.0]
- alphaXiv
- arXiv
- CatalyzeX Code Finder for Papers
- CORE Recommender
- DagsHub
- FLAN-T5
- Gotit.pub
- Hugging Face
- NarrativeQA
- PEGASUS-large
- ScienceCast
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →