Gaussian smoothing gradient descent for minimizing functions (GSmoothGD)

Starnes, Andrew; Dereventsov, Anton; Webster, Clayton

Abstract:This work analyzes the convergence of a class of smoothing-based gradient descent methods when applied to optimization problems. In particular, Gaussian smoothing is employed to define a nonlocal gradient that reduces high-frequency noise, small variations, and rapid fluctuations in the computation of the descent directions while preserving the structure and features of the loss landscape. The resulting Gaussian smoothing gradient descent (GSmoothGD) approach can facilitate gradient descent in navigating away from and avoiding local minima with increased ease, thereby substantially enhancing its overall performance even when applied to non-convex optimization problems. This work also provides rigorous theoretical error estimates on the rate of convergence of GSmoothGD iterates. These estimates exemplify the impact of underlying function convexity, smoothness, input dimension, and the Gaussian smoothing radius. To combat the curse of dimensionality, we numerically approximate the GSmoothGD nonlocal gradient using Monte Carlo (MC) sampling and provide a theory in which the iterates converge regardless of the function smoothness and dimension. Finally, we present several strategies to update the smoothing parameter aimed at diminishing the impact of local minima, thereby rendering the attainment of global minima more achievable. Computational evidence complements the present theory and shows the effectiveness of the MC-GSmoothGD method compared to other smoothing-based algorithms, momentum-based approaches, and classical gradient-based algorithms from numerical optimization.

Comments:	29 pages, 2 figures, 2 tables
Subjects:	Optimization and Control (math.OC)
MSC classes:	35Q90, 65H20, 90C25, 90C30, 90C56
Cite as:	arXiv:2311.00521 [math.OC]
	(or arXiv:2311.00521v2 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2311.00521

Mathematics > Optimization and Control

Title:Gaussian smoothing gradient descent for minimizing functions (GSmoothGD)

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators