中文(ZH) 【ICML 2026】基于响应自举的LVLM安全微调框架 BYORn

New BYORn Framework Defends LVLMs Against Backdoor Attacks

By PulseAugur Editorial · [1 sources] · 2026-06-29 02:22

Researchers have developed a novel defense framework called BYORn (Bootstrap Your Own Responses) to protect Large Vision-Language Models (LVLMs) from backdoor attacks during supervised fine-tuning (SFT). This method leverages the inherent semantic understanding of pre-trained models to detect and replace maliciously altered responses with dynamically generated, semantically consistent ones. BYORn effectively neutralizes various backdoor attacks with minimal impact on the model's general performance, and in some cases, even enhances it through a regularization effect. AI

IMPACT This research offers a robust defense against data poisoning in LVLMs, potentially improving the security and reliability of AI systems in sensitive applications.

RANK_REASON The cluster describes a novel defense framework for LVLMs against backdoor attacks, presented in a research paper accepted to ICML 2026. [lever_c_demoted from research: ic=1 ai=1.0]

Read on 雷峰网 (Leiphone) →

paper
safety

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New BYORn Framework Defends LVLMs Against Backdoor Attacks

COVERAGE [1]

雷峰网 (Leiphone) TIER_1 中文(ZH) · 2026-06-29 02:22

[ICML 2026] BYORn: A Response Bootstrapping Framework for Safe Fine-tuning of LVLMs

<section style="padding: 0 10px; line-height: 1.6; text-align: left; font-family: Optima-Regular, Optima, PingFangSC-light, PingFangTC-light, 'PingFang SC', Cambria, Cochin, Georgia, Times, 'Times New Roman', serif; font-size: 15px; letter-spacing: 0.05em; color: #595959;"><p sty…

COVERAGE [1]

[ICML 2026] BYORn: A Response Bootstrapping Framework for Safe Fine-tuning of LVLMs

RELATED ENTITIES

RELATED TOPICS