研究人员在约束线性二次调节器的安全学习控制中实现了近乎最优的遗憾

作者 PulseAugur 编辑部 · [2 个来源] · 2026-04-24 02:14

研究人员开发了一种用于约束随机线性二次调节器自适应控制的新算法。该算法实现了近乎最优的遗憾 $\tilde{O}(\sqrt{T})$ 并满足了机会约束，能够处理无界噪声。该方法包括使用半定规划选择一个乐观策略，然后将其缩减以确保安全，理论保证源于新颖的基于协方差的分析。 AI

影响引入了一种具有理论保证的约束系统新型控制算法，可能对机器人和自主系统产生影响。

排序理由这是一篇发表在 arXiv 上的研究论文，详细介绍了一种针对特定控制问题的新算法。

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.LG TIER_1 English(EN) · Spencer Hutchinson, Nanfei Jiang, Mahnoosh Alizadeh · 2026-04-27 04:00

Near-Optimal Regret for the Safe Learning-based Control of the Constrained Linear Quadratic Regulator

arXiv:2604.22158v1 Announce Type: cross Abstract: We study the problem of adaptive control of the stochastic linear quadratic regulator (LQR) with constraints that must be satisfied at every time step. Prior work on the multidimensional problem has shown $\tilde{O}(T^{2/3})$ regr…
arXiv cs.LG TIER_1 English(EN) · Mahnoosh Alizadeh · 2026-04-24 02:14

Near-Optimal Regret for the Safe Learning-based Control of the Constrained Linear Quadratic Regulator

We study the problem of adaptive control of the stochastic linear quadratic regulator (LQR) with constraints that must be satisfied at every time step. Prior work on the multidimensional problem has shown $\tilde{O}(T^{2/3})$ regret and satisfaction of robust constraints, leaving…