English(EN) When Context Sticks: Studying Interference in In-Context Learning

Transformer研究深入探讨安全漏洞、训练动态和上下文学习的局限性

作者 PulseAugur 编辑部 · [4 个来源] · 2026-04-28 04:00

研究人员发现了用于在推理过程中保护Transformer模型的混洗防御机制中的漏洞，并演示了一种通过对齐置换激活来提取模型权重的攻击。另一项研究深入探讨了Transformer训练的光谱动态，揭示了编码学习过程不同方面的瞬态压缩波和持久光谱梯度。此外，对上下文学习的调查表明，先前的示例会干扰模型适应新任务的能力，训练课程显著影响弹性，并且泛化能力取决于预训练任务是从子空间并集还是单个高斯分布中提取的。 AI

影响这些论文深入探讨了Transformer的安全漏洞、训练效率以及上下文学习背后的机制，可能为未来的模型开发和防御策略提供指导。

排序理由该集群包含多篇学术论文，探讨了Transformer模型的不同方面，包括安全性、训练动态和上下文学习。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。我们如何撰写摘要 →

报道来源 [4]

arXiv cs.AI TIER_1 English(EN) · Jingwen Leng · 2026-05-06 13:31

Transformer安全推理中置乱防御的（不）安全性

For Transformer models, cryptographically secure inference ensures that the client learns only the final output, while the server learns nothing about the client's input. However, securely computing nonlinear layers remains a major efficiency bottleneck due to the substantial com…
arXiv cs.LG TIER_1 English(EN) · Yi Liu · 2026-04-28 04:00

Transformer 训练的光谱生命周期：瞬态压缩波、持久光谱梯度和 Q/K--V 不对称性

arXiv:2604.22778v1 Announce Type: new Abstract: We present the first systematic study of weight matrix singular value spectra \emph{during} transformer pretraining, tracking full SVD decompositions of every weight matrix at 25-step intervals across three model scales (30M--285M p…
arXiv cs.LG TIER_1 English(EN) · Hanna R{\o}d, Dagny Streit, Nils Valseth Selte, Justin Li · 2026-04-28 04:00

当上下文粘连：研究上下文学习中的干扰

arXiv:2604.23371v1 Announce Type: new Abstract: This paper investigates context stickiness in in-context learning (ICL), a phenomenon where earlier examples in a prompt interfere with a transformer's ability to adapt to later tasks. Using synthetic regression tasks over linear an…
arXiv stat.ML TIER_1 English(EN) · Soo Min Kwon, Alec S. Xu, Can Yaras, Laura Balzano, Qing Qu · 2026-04-30 04:00

上下文学习的分布外泛化：一个低维子空间视角

arXiv:2505.14808v2 Announce Type: replace Abstract: The transformer's remarkable ability to perform in-context learning (ICL) has sparked a wide range of studies designed to understand its strengths and limitations. However, a theoretical understanding of when ICL can and cannot …

报道来源 [4]

Transformer安全推理中置乱防御的（不）安全性

Transformer 训练的光谱生命周期：瞬态压缩波、持久光谱梯度和 Q/K--V 不对称性

当上下文粘连：研究上下文学习中的干扰

上下文学习的分布外泛化：一个低维子空间视角

相关实体

相关话题