English(EN) A Fully First-Order Layer for Differentiable Optimization

新方法探索用于神经网络的无梯度优化

作者 PulseAugur 编辑部 · [3 个来源] · 2026-06-13 02:51

研究人员正在探索用于优化神经网络的新颖方法，而不依赖于传统的基于梯度的方法。一篇论文介绍了一种用于可微优化的全一阶层，通过将问题重新表述为双层优化任务来避免计算量大的Hessian计算。另一项研究提出了一种在希尔伯特空间中进行无限维优化的无梯度方法，利用方向导数和自动微分，该方法在通过物理信息神经网络求解微分方程方面显示出潜力。在MNIST数据集上的实际演示成功地采用了一种无导数优化方法，在图像分类中取得了具有竞争力的准确率，并在高维参数空间中优于基线Adam优化器。 AI

影响这些无梯度优化技术可以为训练复杂模型提供新的途径，有可能降低计算成本，并在梯度难以计算的情况下实现优化。

排序理由该集群包含讨论机器学习模型新颖优化技术研究的学术论文和一篇Reddit帖子。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。我们如何撰写摘要 →

报道来源 [3]

arXiv cs.LG TIER_1 English(EN) · Zihao Zhao, Kai-Chia Mo, Shing-Hei Ho, Brandon Amos, Kai Wang · 2026-06-16 04:00

A Fully First-Order Layer for Differentiable Optimization

arXiv:2512.02494v2 Announce Type: replace Abstract: Differentiable optimization layers enable learning systems to make decisions by solving embedded optimization problems. However, computing gradients via implicit differentiation requires solving a linear system with Hessian term…
arXiv stat.ML TIER_1 English(EN) · Caio Peixoto, Daniel Csillag, Bernardo F. P. da Costa, Yuri F. Saporito · 2026-06-16 04:00

Random Gradient-Free Optimization in Infinite Dimensional Spaces

arXiv:2512.20566v2 Announce Type: replace-cross Abstract: We propose a new gradient-free method for infinite-dimensional optimization in Hilbert spaces that requires only the computation of directional derivatives. Though functional optimization is often solved through finite-dim…
r/MachineLearning TIER_1 English(EN) · /u/Mis4318 · 2026-06-13 02:51

Derivative-Free Neural Network Optimization: MNIST Case [R]

<table> <tr><td> <a href="https://www.reddit.com/r/MachineLearning/comments/1u4fc16/derivativefree_neural_network_optimization_mnist/"> <img alt="Derivative-Free Neural Network Optimization: MNIST Case [R]" src="https://preview.redd.it/te5dm6f9sy6h1.png?width=140&height=106&a…

报道来源 [3]

A Fully First-Order Layer for Differentiable Optimization

Random Gradient-Free Optimization in Infinite Dimensional Spaces

Derivative-Free Neural Network Optimization: MNIST Case [R]

相关实体

相关话题