Towards Understanding the Optimization Mechanisms in Deep Learning

Qi, Binchuan; Gong, Wei; Li, Li

Computer Science > Machine Learning

arXiv:2503.23016 (cs)

[Submitted on 29 Mar 2025]

Title:Towards Understanding the Optimization Mechanisms in Deep Learning

Authors:Binchuan Qi, Wei Gong, Li Li

View PDF HTML (experimental)

Abstract:In this paper, we adopt a probability distribution estimation perspective to explore the optimization mechanisms of supervised classification using deep neural networks. We demonstrate that, when employing the Fenchel-Young loss, despite the non-convex nature of the fitting error with respect to the model's parameters, global optimal solutions can be approximated by simultaneously minimizing both the gradient norm and the structural error. The former can be controlled through gradient descent algorithms. For the latter, we prove that it can be managed by increasing the number of parameters and ensuring parameter independence, thereby providing theoretical insights into mechanisms such as over-parameterization and random initialization. Ultimately, the paper validates the key conclusions of the proposed method through empirical results, illustrating its practical effectiveness.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2503.23016 [cs.LG]
	(or arXiv:2503.23016v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2503.23016

Submission history

From: Binchuan Qi [view email]
[v1] Sat, 29 Mar 2025 08:46:13 UTC (940 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2025-03

Change to browse by:

cs
cs.AI

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Towards Understanding the Optimization Mechanisms in Deep Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards Understanding the Optimization Mechanisms in Deep Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators