Rescaled ASGD 优化具有异构数据的分布式学习

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-13 12:27

研究人员推出了一种名为 Rescaled Asynchronous SGD (ASGD) 的新方法，用于在异构条件下优化分布式机器学习模型。该方法通过重新缩放特定工作节点的步长来解决标准 ASGD 中因较快的工作节点贡献更多更新而产生的偏差。该方法在理论上保证收敛到正确的全局目标，并在非凸设置中匹配已知的最小时间复杂度下界。 AI

影响引入了一种更有效的分布式人工智能训练优化方法，有可能提高在异构硬件上的性能。

排序理由详细介绍一种新优化方法的学术论文。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.LG TIER_1 English(EN) · Peter Richtárik · 2026-05-13 12:27

Rescaled Asynchronous SGD: Optimal Distributed Optimization under Data and System Heterogeneity

Asynchronous stochastic gradient descent (ASGD) is a standard way to exploit heterogeneous compute resources in distributed learning: instead of forcing fast workers to wait for slow ones, the server updates the model whenever a gradient arrives. Vanilla ASGD applies each arrivin…
arXiv stat.ML TIER_1 English(EN) · Ammar Mahran, Artavazd Maranjyan, Peter Richt\'arik · 2026-05-14 04:00

Rescaled Asynchronous SGD: Optimal Distributed Optimization under Data and System Heterogeneity

arXiv:2605.13434v1 Announce Type: cross Abstract: Asynchronous stochastic gradient descent (ASGD) is a standard way to exploit heterogeneous compute resources in distributed learning: instead of forcing fast workers to wait for slow ones, the server updates the model whenever a g…

报道来源 [2]

Rescaled Asynchronous SGD: Optimal Distributed Optimization under Data and System Heterogeneity

Rescaled Asynchronous SGD: Optimal Distributed Optimization under Data and System Heterogeneity

相关实体

相关话题