Explicit Eigenvalue Regularization Improves Sharpness-Aware Minimization

Luo, Haocheng; Truong, Tuan; Pham, Tung; Harandi, Mehrtash; Phung, Dinh; Le, Trung

Computer Science > Machine Learning

arXiv:2501.12666 (cs)

[Submitted on 22 Jan 2025]

Title:Explicit Eigenvalue Regularization Improves Sharpness-Aware Minimization

Authors:Haocheng Luo, Tuan Truong, Tung Pham, Mehrtash Harandi, Dinh Phung, Trung Le

View PDF HTML (experimental)

Abstract:Sharpness-Aware Minimization (SAM) has attracted significant attention for its effectiveness in improving generalization across various tasks. However, its underlying principles remain poorly understood. In this work, we analyze SAM's training dynamics using the maximum eigenvalue of the Hessian as a measure of sharpness, and propose a third-order stochastic differential equation (SDE), which reveals that the dynamics are driven by a complex mixture of second- and third-order terms. We show that alignment between the perturbation vector and the top eigenvector is crucial for SAM's effectiveness in regularizing sharpness, but find that this alignment is often inadequate in practice, limiting SAM's efficiency. Building on these insights, we introduce Eigen-SAM, an algorithm that explicitly aims to regularize the top Hessian eigenvalue by aligning the perturbation vector with the leading eigenvector. We validate the effectiveness of our theory and the practical advantages of our proposed approach through comprehensive experiments. Code is available at this https URL.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2501.12666 [cs.LG]
	(or arXiv:2501.12666v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2501.12666

Submission history

From: Haocheng Luo [view email]
[v1] Wed, 22 Jan 2025 06:03:16 UTC (323 KB)

Computer Science > Machine Learning

Title:Explicit Eigenvalue Regularization Improves Sharpness-Aware Minimization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Explicit Eigenvalue Regularization Improves Sharpness-Aware Minimization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators