Fine-grained Claim-level RAG Benchmark for Law
Researchers have developed ClaimRAG-LAW, a new benchmark dataset designed to evaluate retrieval-augmented generation (RAG) systems in the legal domain. This dataset supports both French and English, catering to both legal experts and non-experts with diverse question types. Initial evaluations using ClaimRAG-LAW revealed limitations in the retrieval and generation capabilities of current state-of-the-art legal RAG systems. AI
IMPACT This new benchmark aims to improve the accuracy and reliability of AI systems in the legal field, potentially leading to more trustworthy legal AI applications.