New T-VSS method boosts VLM adversarial robustness efficiently

By PulseAugur Editorial · [1 sources] · 2026-06-22 10:21

Researchers have developed a new method called Test-Time Visual Subspace Steering (T-VSS) to improve the adversarial robustness of vision-language models (VLMs). This technique adapts the visual feature space directly at test time, rather than relying on indirect methods like prompt tuning or input-space optimization. T-VSS constructs a sample-specific low-rank subspace from feature residuals of an attacked image and then learns a feature correction within this subspace using reliability-weighted entropy minimization. Experiments demonstrate that T-VSS enhances adversarial robustness across various benchmarks while maintaining strong clean accuracy and offering improved efficiency compared to existing test-time adaptation methods. AI

IMPACT This research offers a more efficient method for enhancing the security of vision-language models against adversarial attacks.

RANK_REASON Academic paper detailing a new method for improving AI model robustness. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CV →

paper
safety

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New T-VSS method boosts VLM adversarial robustness efficiently

COVERAGE [1]

arXiv cs.CV TIER_1 English(EN) · Changick Kim · 2026-06-22 10:21

T-VSS: Test-Time Visual Subspace Steering for Adversarial Robustness of Vision-Language Models

Vision-language models (VLMs) achieve strong zero-shot recognition, but they remain highly vulnerable to adversarial perturbations. Recent test-time adaptations improve robustness without retraining, but they do not directly adapt the corrupted visual representation itself. Promp…

COVERAGE [1]

T-VSS: Test-Time Visual Subspace Steering for Adversarial Robustness of Vision-Language Models

RELATED ENTITIES

RELATED TOPICS