新框架解决了AI合作中的奖励分配问题

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-26 16:08

研究人员为AI合作中的奖励分配引入了一个新颖的框架，其中人类代理在不同的价值约束下贡献数据并参与模型更新。该系统被称为价值条件梯度过滤，仅奖励那些符合每个主体价值概况的更新。该方法利用在线边际贡献信号和遍历学习（TL）基底内的累积收入结算，提供了比FedAvg等传统联邦学习方法更精细的归因。 AI

影响这项研究可能带来更公平、更高效的去中心化AI系统协作模型。

排序理由该集群包含一篇详细介绍AI合作新框架的学术论文。

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Young Yoon, Jimin Kim, Soyeon Park · 2026-06-29 04:00

迈向完全委托AI合作中的价值约束信用分配

arXiv:2606.28217v1 Announce Type: cross Abstract: We propose a framework for reward allocation in fully delegated AI cooperatives where humans are represented by agents that contribute data and participate in model updates under heterogeneous value constraints. The key idea is to…
arXiv cs.MA (Multiagent) TIER_1 English(EN) · Soyeon Park · 2026-06-26 16:08

迈向完全委托AI合作中的价值约束信用分配

We propose a framework for reward allocation in fully delegated AI cooperatives where humans are represented by agents that contribute data and participate in model updates under heterogeneous value constraints. The key idea is to credit only those updates that remain admissible …