English(EN) REDACT: A Systematically Controlled Multilingual Benchmark for Personal Information Detection

新的REDACT基准系统性地测试了25种语言的PII检测能力

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-18 07:38

研究人员推出REDACT，这是一个新的多语言基准，旨在系统性地评估个人身份信息（PII）的检测能力。该基准包含超过13,000条记录，324,000个标注，涵盖51种实体类型，并支持25种语言。研究评估了包括GPT-4.1和Claude Sonnet 4.6在内的五种检测器，结果表明，虽然基于LLM的检测器通常更强大，但它们的性能会因数据敏感性和披露形式而显著不同。该基准旨在提供对PII检测能力更受控、更全面的评估。 AI

影响为PII检测提供了一个更强大的评估框架，这对于负责任的AI部署和数据隐私至关重要。

排序理由该集群描述了一个新的学术基准和对PII检测系统的评估。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · Guneesh Vats, Anubha Agrawal, Shikha Singhal, Ajita Dash, Praison Selvaraj, Vidhan Jhawar, Ranga Prasad Chenna, Bharadwaj Y M G · 2026-06-19 04:00

REDACT：个人信息检测的可控多语言基准测试系统

arXiv:2606.19881v1 Announce Type: new Abstract: Benchmark infrastructure for personally identifiable information (PII) detection remains limited: existing corpora cover few entity types, use ad hoc generation conditions, and do not show which surface conditions cause detector fai…
arXiv cs.CL TIER_1 English(EN) · Bharadwaj Y M G · 2026-06-18 07:38

REDACT：用于个人信息检测的系统控制多语言基准

Benchmark infrastructure for personally identifiable information (PII) detection remains limited: existing corpora cover few entity types, use ad hoc generation conditions, and do not show which surface conditions cause detector failures. We present REDACT, a systematically contr…

报道来源 [2]

REDACT：个人信息检测的可控多语言基准测试系统

REDACT：用于个人信息检测的系统控制多语言基准

相关实体

相关话题