English(EN) Detecting Trojaned DNNs via Spectral Regression Analysis

新的MIST方法可检测微调深度神经网络中的木马

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-20 13:19

研究人员开发了一种名为MIST的新方法，用于检测在微调过程中嵌入深度神经网络（DNN）中的恶意木马。MIST分析模型内部表示的谱变化，以识别表明木马攻击的偏差。这种方法将木马检测视为一个回归问题，并且与现有方法相比，即使在事先不知道攻击细节的情况下，也表现出更高的准确性。 AI

影响引入了一种新颖的技术，以增强AI模型在开发过程中抵御复杂攻击的安全性。

排序理由学术论文，详细介绍了一种检测AI模型安全漏洞的新方法。

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Samuele Pasini, Jinhan Kim, Paolo Tonella · 2026-05-22 04:00

通过谱回归分析检测特洛伊木马深度神经网络

arXiv:2605.21146v1 Announce Type: cross Abstract: Modern DNNs are repeatedly fine-tuned to incorporate new data and functionality. This evolutionary workflow introduces a security risk when updated data cannot be fully trusted, as adversaries may implant Trojans during fine-tunin…
arXiv cs.AI TIER_1 English(EN) · Paolo Tonella · 2026-05-20 13:19

通过谱回归分析检测特洛伊木马深度神经网络

Modern DNNs are repeatedly fine-tuned to incorporate new data and functionality. This evolutionary workflow introduces a security risk when updated data cannot be fully trusted, as adversaries may implant Trojans during fine-tuning. We present MIST, a Trojan detection approach th…