PulseAugur
实时 23:04:24
English(EN) Online Convex Optimization with Sublinear Noisy Probes

新的OCO框架通过噪声探测改进遗憾

研究人员开发了一个新的在线凸优化(OCO)框架,即使在有限且有噪声的成对探测预算下,也能改进最坏情况下的遗憾。所提出的方法统一了次线性最佳专家查询和成对反馈,表明次线性的噪声探测预算可以被证明在全反馈OCO机制中增强遗憾。该分析通过方差缩减和连续指数权重(Continuous Exponential Weights)的二阶分析来量化探测的好处,从而得出严格的遗憾保证。 AI

排序理由 该集群包含一篇详细介绍在线凸优化新理论框架的学术论文。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.LG TIER_1 English(EN) · Simone Di Gregorio, Anupam Gupta, Stefano Leonardi, Matteo Russo ·

    Online Convex Optimization with Sublinear Noisy Probes

    arXiv:2606.14640v1 Announce Type: new Abstract: We study Online Convex Optimization (OCO) over a convex set $K\subseteq \mathbb R^d$, where in each round $t$ the learner selects $x_t\in K$ and then observes a convex loss $f_t:K\to[0,1]$, with the goal of minimizing regret to the …

  2. arXiv cs.LG TIER_1 English(EN) · Matteo Russo ·

    Online Convex Optimization with Sublinear Noisy Probes

    We study Online Convex Optimization (OCO) over a convex set $K\subseteq \mathbb R^d$, where in each round $t$ the learner selects $x_t\in K$ and then observes a convex loss $f_t:K\to[0,1]$, with the goal of minimizing regret to the best fixed decision in hindsight. We introduce a…