New Vietnamese legal NLI dataset released for AI research

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 2 sources

Researchers have introduced ViLegalNLI, a new dataset designed for natural language inference tasks within the Vietnamese legal domain. This dataset comprises over 42,000 premise-hypothesis pairs extracted from official legal documents, labeled with entailment or non-entailment. It aims to serve as a benchmark for evaluating AI systems in understanding and reasoning about Vietnamese legal texts, incorporating complex legal logic and terminology. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT Establishes a benchmark for legal AI in Vietnam, potentially improving statutory text understanding and decision support systems.

RANK_REASON This is a research paper introducing a new dataset for a specific domain.

Read on arXiv cs.CL →

paper
other

COVERAGE [2]

arXiv cs.LG TIER_1 · Nhung Thi-Hong Duong, Mai Ngoc Ho, Tin Van Huynh, Kiet Van Nguyen · 2026-05-04 04:00

ViLegalNLI: Natural Language Inference for Vietnamese Legal Texts

arXiv:2605.00116v1 Announce Type: cross Abstract: In this article, we introduce ViLegalNLI, the first large-scale Vietnamese Natural Language Inference (NLI) dataset specifically constructed for the legal domain. The dataset consists of 42,012 premise-hypothesis pairs derived fro…
arXiv cs.CL TIER_1 · Kiet Van Nguyen · 2026-04-30 18:16

ViLegalNLI: Natural Language Inference for Vietnamese Legal Texts

In this article, we introduce ViLegalNLI, the first large-scale Vietnamese Natural Language Inference (NLI) dataset specifically constructed for the legal domain. The dataset consists of 42,012 premise-hypothesis pairs derived from official statutory documents and annotated with …

COVERAGE [2]

ViLegalNLI: Natural Language Inference for Vietnamese Legal Texts

ViLegalNLI: Natural Language Inference for Vietnamese Legal Texts

RELATED ENTITIES

RELATED TOPICS