English(EN) Randomized Least Squares Value Iteration itself is Joint Differentially Private

RLSVI算法通过随机探索实现联合差分隐私

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-02 04:00

研究人员开发了一种新的强化学习算法隐私分析方法，特别关注随机最小二乘值迭代（RLSVI）。他们的工作展示了RLSVI中用于探索的固有噪声如何同时提供差分隐私保护。该研究提供了这种隐私保证的数学表征，表明在表格马尔可夫决策过程中，RLSVI是$(\varepsilon(\delta),\delta)$-联合差分隐私的。 AI

影响这项研究通过提供正式的隐私保证，可能使得强化学习在敏感领域得到应用。

排序理由该集群包含一篇学术论文，详细介绍了强化学习算法的新隐私分析。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.LG TIER_1 English(EN) · Haiyang Lu, Pratik Gajane, Shaojie Bai, Mohammad Sadegh Talebi · 2026-06-02 04:00

Randomized Least Squares Value Iteration itself is Joint Differentially Private

arXiv:2606.01952v1 Announce Type: new Abstract: As reinforcement learning (RL) increasingly applies to sensitive domains, such as health care and recommendation systems, privacy-preserving techniques have become essential to protect users' sensitive information. We investigate pr…

报道来源 [1]

Randomized Least Squares Value Iteration itself is Joint Differentially Private

相关实体

相关话题