HKJudge: A Legal Discourse-Annotated Corpus for Interpreting What Courts Find, How They Reason, and What They Rule
Researchers have developed HKJudge, a novel dataset of Hong Kong court judgments annotated at the sentence level by legal linguistics experts. This corpus, comprising approximately 290,000 sentences, is designed to capture the reasoning and rulings within legal discourse. The dataset supports tasks like rhetorical role classification and legal element extraction, and has been used to benchmark various language models. AI
IMPACT Enables more sophisticated AI analysis of legal documents, potentially improving legal prediction and research.