English(EN) When Helpful Context Leaks: Privacy Risks in Domain-Adapted ASR

新研究揭示领域自适应SpeechLLM中的隐私风险

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-27 09:30

一篇新发表在arXiv上的研究论文详细介绍了一种领域自适应自动语音识别（ASR）模型（通常称为SpeechLLM）中存在的重大隐私风险。研究表明，当这些模型通过敏感信息的提示或专有数据的微调进行特定领域定制时，它们可能会无意中转录其上下文或训练数据中语音上相似的词语，即使实际说出的词语不同。这种泄露可能暴露私人信息。研究人员开发了一个数据集来量化不同定制方法下的风险，发现结合提示和微调会加剧该问题。他们还评估了一种提示级别的缓解策略，并得出结论，在没有额外上下文提示的情况下进行微调，在准确性和隐私之间取得了最佳平衡。 AI

影响强调了定制AI语音模型中潜在的数据泄露问题，并突出了在专业部署中实施强大隐私保护措施的必要性。

排序理由该集群包含一篇学术论文，详细介绍了AI模型隐私风险的新发现。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · Maike Z\"ufle, Jan Niehues · 2026-05-28 04:00

当有用的上下文泄露时：领域自适应ASR中的隐私风险

arXiv:2605.28211v1 Announce Type: new Abstract: SpeechLLMs are increasingly deployed in professional settings where domain customisation is standard practice: users supply context in prompts with sensitive information, fine-tune on proprietary recordings, or both. We identify and…
arXiv cs.CL TIER_1 English(EN) · Jan Niehues · 2026-05-27 09:30

当有用的上下文泄露时：领域自适应ASR中的隐私风险

SpeechLLMs are increasingly deployed in professional settings where domain customisation is standard practice: users supply context in prompts with sensitive information, fine-tune on proprietary recordings, or both. We identify and systematically investigate an overlooked privac…

报道来源 [2]

当有用的上下文泄露时：领域自适应ASR中的隐私风险

当有用的上下文泄露时：领域自适应ASR中的隐私风险

相关实体

相关话题