PulseAugur
EN
LIVE 16:24:38

Commercial LLMs Outperform Open-Source in Islamic Inheritance Reasoning

A new paper evaluates the performance of commercial and open-source large language models on Arabic Islamic inheritance reasoning tasks. The study found that commercial models generally outperform open-source models, showing greater reliability in identifying heirs, applying exclusion rules, and maintaining consistency. Gemini 2.5 Flash achieved the best performance among the evaluated models, with a Mean Reciprocal Error (MRE) of 0.989. AI

IMPACT Highlights the current limitations of open-source models in complex legal and numerical reasoning, suggesting areas for future development.

RANK_REASON This is a research paper evaluating LLM performance on a specific reasoning task. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.CL TIER_1 English(EN) · Mohammed Amine Mouhoub, Chahinez Bouchekif ·

    Which Models Perform Better in Inheritance Reasoning?

    arXiv:2606.13751v1 Announce Type: new Abstract: This paper presents the participation of team PSL in the QIAS 2026 Shared Task on Arabic Islamic inheritance reasoning. The task evaluates the ability of large language models to solve inheritance cases that require legal interpreta…