A Simple Asymmetric Momentum Make SGD Greatest Again

Zhang, Gongyue; Zhang, Dinghuang; Zhao, Shuwen; Liu, Donghan; Toptan, Carrie M.; Liu, Honghai

Computer Science > Machine Learning

arXiv:2309.02130v1 (cs)

[Submitted on 5 Sep 2023 (this version), latest version 3 Oct 2023 (v2)]

Title:A Simple Asymmetric Momentum Make SGD Greatest Again

Authors:Gongyue Zhang, Dinghuang Zhang, Shuwen Zhao, Donghan Liu, Carrie M. Toptan, Honghai Liu

View PDF

Abstract:We propose the simplest SGD enhanced method ever, Loss-Controlled Asymmetric Momentum(LCAM), aimed directly at the Saddle Point problem. Compared to the traditional SGD with Momentum, there's no increase in computational demand, yet it outperforms all current optimizers. We use the concepts of weight conjugation and traction effect to explain this phenomenon. We designed experiments to rapidly reduce the learning rate at specified epochs to trap parameters more easily at saddle points. We selected WRN28-10 as the test network and chose cifar10 and cifar100 as test datasets, an identical group to the original paper of WRN and Cosine Annealing Scheduling(CAS). We compared the ability to bypass saddle points of Asymmetric Momentum with different priorities. Finally, using WRN28-10 on Cifar100, we achieved a peak average test accuracy of 80.78\% around 120 epoch. For comparison, the original WRN paper reported 80.75\%, while CAS was at 80.42\%, all at 200 epoch. This means that while potentially increasing accuracy, we use nearly half convergence time. Our demonstration code is available at\\ this https URL

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2309.02130 [cs.LG]
	(or arXiv:2309.02130v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2309.02130

Submission history

From: Gongyue Zhang [view email]
[v1] Tue, 5 Sep 2023 11:16:47 UTC (49 KB)
[v2] Tue, 3 Oct 2023 04:47:17 UTC (343 KB)

Computer Science > Machine Learning

Title:A Simple Asymmetric Momentum Make SGD Greatest Again

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Simple Asymmetric Momentum Make SGD Greatest Again

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators