English(EN) To Redact, or not to Redact? A Local LLM Approach to Deliberative Process Privilege Classification

本地 LLM 对敏感政府文件进行分类，媲美商业模型

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-11 08:55

研究人员开发了一种本地大型语言模型 (LLM) 方法，用于对政府文件中的敏感信息进行分类，特别关注《信息自由法》(FOIA) 请求的审议程序特权。该研究使用了 Qwen3.5 9B 模型，该模型可以在消费级硬件上运行，从而避免了与云 API 相关的法律和政治问题。他们的方法结合了思维链 (Chain-of-Thought) 和少样本提示 (few-shot prompting) 以及基于错误的示例，取得了与商业模型相当的性能，并在召回率和 F2 分数上优于先前的工作。分析显示，被归类为审议性的句子通常包含表示观点的动词，并且是以第一人称表述的。 AI

影响能够安全地在本地对敏感政府文件进行分类，有可能提高对透明度法律的合规性。

排序理由学术论文，详细介绍了使用 LLM 进行文档分类的新颖方法。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · David Graus · 2026-05-11 08:55

To Redact, or not to Redact? A Local LLM Approach to Deliberative Process Privilege Classification

Government transparency laws, like the Freedom of Information (FOIA) acts in the United States and United Kingdom, and the Woo (Open Government Act) in the Netherlands, grant citizens the right to directly request documents from the government. As these documents might contain se…

报道来源 [1]

To Redact, or not to Redact? A Local LLM Approach to Deliberative Process Privilege Classification

相关实体

相关话题