English(EN) WARP: Weight-Space Analysis for Recovering Training Data Portfolios

新的WARP框架从权重推断基础模型训练数据

作者 PulseAugur 编辑部 · [1 个来源] · 2026-07-03 04:00

研究人员开发了一个名为WARP的新框架，可以直接从基础模型发布的权重中推断出其使用的训练数据组合。该方法绕过了对训练数据或训练轨迹的直接访问需求，而这些通常由模型开发者保密。WARP通过分析权重空间中训练数据的几何足迹来工作，能够以高精度近似领域比例，其表现优于现有的成员推断等方法。 AI

影响增强了基础模型训练的透明度，可能有助于复现性和偏差检测。

排序理由该集群包含一篇学术论文，详细介绍了用于分析基础模型的新研究框架。[lever_c_demoted from research: ic=1 ai=1.0]

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.LG TIER_1 English(EN) · Tzu-Heng Huang, Aditya Goyal, John Cooper, Frederic Sala · 2026-07-03 04:00

WARP: Weight-Space Analysis for Recovering Training Data Portfolios

arXiv:2607.01686v1 Announce Type: new Abstract: Foundation models are routinely released to the public, yet the data recipes used to train them -- such as domain mixture weights that determine how different sources are sampled -- are rarely disclosed. This creates an access asymm…