PulseAugur
LIVE 14:00:30
research · [2 sources] ·
0
research

New Vietnamese legal NLI dataset released for AI research

Researchers have introduced ViLegalNLI, a new dataset designed for natural language inference tasks within the Vietnamese legal domain. This dataset comprises over 42,000 premise-hypothesis pairs extracted from official legal documents, labeled with entailment or non-entailment. It aims to serve as a benchmark for evaluating AI systems in understanding and reasoning about Vietnamese legal texts, incorporating complex legal logic and terminology. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT Establishes a benchmark for legal AI in Vietnam, potentially improving statutory text understanding and decision support systems.

RANK_REASON This is a research paper introducing a new dataset for a specific domain.

Read on arXiv cs.CL →

COVERAGE [2]

  1. arXiv cs.LG TIER_1 · Nhung Thi-Hong Duong, Mai Ngoc Ho, Tin Van Huynh, Kiet Van Nguyen ·

    ViLegalNLI: Natural Language Inference for Vietnamese Legal Texts

    arXiv:2605.00116v1 Announce Type: cross Abstract: In this article, we introduce ViLegalNLI, the first large-scale Vietnamese Natural Language Inference (NLI) dataset specifically constructed for the legal domain. The dataset consists of 42,012 premise-hypothesis pairs derived fro…

  2. arXiv cs.CL TIER_1 · Kiet Van Nguyen ·

    ViLegalNLI: Natural Language Inference for Vietnamese Legal Texts

    In this article, we introduce ViLegalNLI, the first large-scale Vietnamese Natural Language Inference (NLI) dataset specifically constructed for the legal domain. The dataset consists of 42,012 premise-hypothesis pairs derived from official statutory documents and annotated with …