On the Performance Analysis of Momentum Method: A Frequency Domain Perspective

Li, Xianliang; Luo, Jun; Zheng, Zhiwei; Wang, Hanxiao; Luo, Li; Wen, Lingkun; Wu, Linlong; Xu, Sheng

Computer Science > Machine Learning

arXiv:2411.19671 (cs)

[Submitted on 29 Nov 2024 (v1), last revised 10 Mar 2025 (this version, v4)]

Title:On the Performance Analysis of Momentum Method: A Frequency Domain Perspective

Authors:Xianliang Li, Jun Luo, Zhiwei Zheng, Hanxiao Wang, Li Luo, Lingkun Wen, Linlong Wu, Sheng Xu

View PDF HTML (experimental)

Abstract:Momentum-based optimizers are widely adopted for training neural networks. However, the optimal selection of momentum coefficients remains elusive. This uncertainty impedes a clear understanding of the role of momentum in stochastic gradient methods. In this paper, we present a frequency domain analysis framework that interprets the momentum method as a time-variant filter for gradients, where adjustments to momentum coefficients modify the filter characteristics. Our experiments support this perspective and provide a deeper understanding of the mechanism involved. Moreover, our analysis reveals the following significant findings: high-frequency gradient components are undesired in the late stages of training; preserving the original gradient in the early stages, and gradually amplifying low-frequency gradient components during training both enhance performance. Based on these insights, we propose Frequency Stochastic Gradient Descent with Momentum (FSGDM), a heuristic optimizer that dynamically adjusts the momentum filtering characteristic with an empirically effective dynamic magnitude response. Experimental results demonstrate the superiority of FSGDM over conventional momentum optimizers.

Comments:	ICLR 2025. 22 pages, 14 figures. Keywords: Momentum Method, Stochastic Gradient Descent, Z-Transform, Frequency Domain Analysis, Deep Learning
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2411.19671 [cs.LG]
	(or arXiv:2411.19671v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2411.19671

Submission history

From: Xianliang Li [view email]
[v1] Fri, 29 Nov 2024 12:56:43 UTC (1,212 KB)
[v2] Wed, 12 Feb 2025 01:34:12 UTC (1,241 KB)
[v3] Thu, 27 Feb 2025 08:33:40 UTC (1,239 KB)
[v4] Mon, 10 Mar 2025 09:16:28 UTC (1,239 KB)

Computer Science > Machine Learning

Title:On the Performance Analysis of Momentum Method: A Frequency Domain Perspective

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On the Performance Analysis of Momentum Method: A Frequency Domain Perspective

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators