MirathQA: A dataset for evaluating large language models on Hanbali Islamic inheritance reasoning tasks

Islamic inheritance (Muwārīth/مواريث) refers to the distribution of a deceased person's estate among qualified heirs in accordance with Sharia laws derived from the Qur’an and Sunnah. This dataset focuses specifically on the Hanbali school of jurisprudence (المذهب الحنبلي), one of the four major Sunni schools, which has distinct rulings on residuary heirs (taʿsīb/التعصيب) and blocking mechanisms (al-ḥajb/الحجب). Despite recent advances, Arabic large language models (LLMs) often struggle with tasks requiring exact arithmetic, multi-step reasoning, and strict adherence to domain-specific rules. These challenges are particularly evident in Islamic inheritance law (ʿilm al-farāʾiḍ/علم الفرائض), where case resolution demands sequential rule application, identification of heirs and blocked heirs (ḥijb/حجب), handling of proportional reduction (ʿawl/عول) and return (radd/رد), and accounting for juristic differences. The lack of well-structured Arabic datasets further restricts systematic evaluation in this area. To address this gap, we introduce the MirathQA, comprising over 1394 questions derived from 242 real-life inheritance cases collected from different Arabic sources. Each case includes a textual description, verified solution, heirs and their corresponding shares, and annotations for blocked or modified heirs. Additional multiple-choice and true/false questions assess reasoning skills such as exclusion and proportional reduction. The dataset is divided into training (70%), validation (15%), and test (15%) splits at the case level to prevent leakage and ensure reproducibility. It is also provided in both CSV and Excel formats. This resource establishes a benchmark for evaluating Arabic LLM reasoning, while also serving education, interdisciplinary research, and legal AI applications.

Sharefah Ahmed Alghamdi

MirathQA: A dataset for evaluating large language models on Hanbali Islamic inheritance reasoning tasks