Convergence of mean-field Langevin dynamics: Time and space discretization, stochastic gradient, and variance reduction

Suzuki, Taiji; Wu, Denny; Nitanda, Atsushi

Computer Science > Machine Learning

arXiv:2306.07221 (cs)

[Submitted on 12 Jun 2023]

Title:Convergence of mean-field Langevin dynamics: Time and space discretization, stochastic gradient, and variance reduction

Authors:Taiji Suzuki, Denny Wu, Atsushi Nitanda

View PDF

Abstract:The mean-field Langevin dynamics (MFLD) is a nonlinear generalization of the Langevin dynamics that incorporates a distribution-dependent drift, and it naturally arises from the optimization of two-layer neural networks via (noisy) gradient descent. Recent works have shown that MFLD globally minimizes an entropy-regularized convex functional in the space of measures. However, all prior analyses assumed the infinite-particle or continuous-time limit, and cannot handle stochastic gradient updates. We provide an general framework to prove a uniform-in-time propagation of chaos for MFLD that takes into account the errors due to finite-particle approximation, time-discretization, and stochastic gradient approximation. To demonstrate the wide applicability of this framework, we establish quantitative convergence rate guarantees to the regularized global optimal solution under (i) a wide range of learning problems such as neural network in the mean-field regime and MMD minimization, and (ii) different gradient estimators including SGD and SVRG. Despite the generality of our results, we achieve an improved convergence rate in both the SGD and SVRG settings when specialized to the standard Langevin dynamics.

Comments:	37 pages
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2306.07221 [cs.LG]
	(or arXiv:2306.07221v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.07221

Submission history

From: Taiji Suzuki [view email]
[v1] Mon, 12 Jun 2023 16:28:11 UTC (100 KB)

Computer Science > Machine Learning

Title:Convergence of mean-field Langevin dynamics: Time and space discretization, stochastic gradient, and variance reduction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Convergence of mean-field Langevin dynamics: Time and space discretization, stochastic gradient, and variance reduction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators