Nonlinearly Preconditioned Gradient Methods: Momentum and Stochastic Analysis

1citations

arXiv:2510.11312

citations

#2497

in NEURIPS 2025

of 5858 papers

Top Authors

Data Points

Top Authors

Konstantinos Oikonomidis Jan Quan Panagiotis Patrinos

Abstract

We study nonlinearly preconditioned gradient methods for smooth nonconvex optimization problems, focusing on sigmoid preconditioners that inherently perform a form of gradient clipping akin to the widely used gradient clipping technique. Building upon this idea, we introduce a novel heavy ball-type algorithm and provide convergence guarantees under a generalized smoothness condition that is less restrictive than traditional Lipschitz smoothness, thus covering a broader class of functions. Additionally, we develop a stochastic variant of the base method and study its convergence properties under different noise assumptions. We compare the proposed algorithms with baseline methods on diverse tasks from machine learning including neural network training.

Citation History

Jan 24, 2026

Jan 26, 2026

Jan 28, 2026

Feb 13, 2026

1+1

Feb 13, 2026