Randomised Splitting Methods and Stochastic Gradient Descent

Shaw, Luke; Whalley, Peter A.

Mathematics > Optimization and Control

arXiv:2504.04274 (math)

[Submitted on 5 Apr 2025]

Title:Randomised Splitting Methods and Stochastic Gradient Descent

Authors:Luke Shaw, Peter A. Whalley

View PDF HTML (experimental)

Abstract:We explore an explicit link between stochastic gradient descent using common batching strategies and splitting methods for ordinary differential equations. From this perspective, we introduce a new minibatching strategy (called Symmetric Minibatching Strategy) for stochastic gradient optimisation which shows greatly reduced stochastic gradient bias (from $\mathcal{O}(h^2)$ to $\mathcal{O}(h^4)$ in the optimiser stepsize $h$), when combined with momentum-based optimisers. We justify why momentum is needed to obtain the improved performance using the theory of backward analysis for splitting integrators and provide a detailed analytic computation of the stochastic gradient bias on a simple example.
Further, we provide improved convergence guarantees for this new minibatching strategy using Lyapunov techniques that show reduced stochastic gradient bias for a fixed stepsize (or learning rate) over the class of strongly-convex and smooth objective functions. Via the same techniques we also improve the known results for the Random Reshuffling strategy for stochastic gradient descent methods with momentum. We argue that this also leads to a faster convergence rate when considering a decreasing stepsize schedule. Both the reduced bias and efficacy of decreasing stepsizes are demonstrated numerically on several motivating examples.

Comments:	34 pages, 3 figures
Subjects:	Optimization and Control (math.OC); Numerical Analysis (math.NA); Machine Learning (stat.ML)
MSC classes:	65L20, 90C25, 93C15
Cite as:	arXiv:2504.04274 [math.OC]
	(or arXiv:2504.04274v1 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2504.04274

Submission history

From: Peter Archibald Whalley [view email]
[v1] Sat, 5 Apr 2025 20:07:34 UTC (2,501 KB)

Mathematics > Optimization and Control

Title:Randomised Splitting Methods and Stochastic Gradient Descent

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Randomised Splitting Methods and Stochastic Gradient Descent

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators