PulseAugur
实时 13:55:32
English(EN) Online learning with Erdős-Rényi side-observation graphs

新算法改进了带有侧观察图的在线学习

研究人员为具有部分损失信息的对抗性多臂老虎机问题开发了新算法。这些算法旨在处理非选择臂以固定、未知概率揭示其损失的情况。所提出的方法即使在不知道损失观察的确切概率的情况下,也能实现接近最优的遗憾界限。 AI

影响 为具有部分反馈的老虎机问题引入了新颖的算法,有可能改进在线学习系统的决策。

排序理由 在 arXiv 上发表的学术论文,详细介绍了针对特定机器学习问题的新算法。

在 arXiv stat.ML 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

新算法改进了带有侧观察图的在线学习

报道来源 [2]

  1. arXiv stat.ML TIER_1 English(EN) · Tom\'a\v{s} Koc\'ak, Gergely Neu, Michal Valko ·

    Online learning with Erd\H{o}s-R\'enyi side-observation graphs

    arXiv:2604.25271v1 Announce Type: new Abstract: We consider adversarial multi-armed bandit problems where the learner is allowed to observe losses of a number of arms beside the arm that it actually chose. We study the case where all non-chosen arms reveal their loss with a fixed…

  2. arXiv stat.ML TIER_1 English(EN) · Michal Valko ·

    Online learning with Erdős-Rényi side-observation graphs

    We consider adversarial multi-armed bandit problems where the learner is allowed to observe losses of a number of arms beside the arm that it actually chose. We study the case where all non-chosen arms reveal their loss with a fixed but unknown probability $r$, independently of e…