AlphaGrad: Non-Linear Gradient Normalization Optimizer

Sane, Soham

Computer Science > Machine Learning

arXiv:2504.16020 (cs)

[Submitted on 22 Apr 2025 (v1), last revised 23 Apr 2025 (this version, v2)]

Title:AlphaGrad: Non-Linear Gradient Normalization Optimizer

Authors:Soham Sane

View PDF HTML (experimental)

Abstract:We introduce AlphaGrad, a memory-efficient, conditionally stateless optimizer addressing the memory overhead and hyperparameter complexity of adaptive methods like Adam. AlphaGrad enforces scale invariance via tensor-wise L2 gradient normalization followed by a smooth hyperbolic tangent transformation, $g' = \tanh(\alpha \cdot \tilde{g})$, controlled by a single steepness parameter $\alpha$. Our contributions include: (1) the AlphaGrad algorithm formulation; (2) a formal non-convex convergence analysis guaranteeing stationarity; (3) extensive empirical evaluation on diverse RL benchmarks (DQN, TD3, PPO). Compared to Adam, AlphaGrad demonstrates a highly context-dependent performance profile. While exhibiting instability in off-policy DQN, it provides enhanced training stability with competitive results in TD3 (requiring careful $\alpha$ tuning) and achieves substantially superior performance in on-policy PPO. These results underscore the critical importance of empirical $\alpha$ selection, revealing strong interactions between the optimizer's dynamics and the underlying RL algorithm. AlphaGrad presents a compelling alternative optimizer for memory-constrained scenarios and shows significant promise for on-policy learning regimes where its stability and efficiency advantages can be particularly impactful.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
Cite as:	arXiv:2504.16020 [cs.LG]
	(or arXiv:2504.16020v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2504.16020

Submission history

From: Soham Sane [view email]
[v1] Tue, 22 Apr 2025 16:33:14 UTC (1,811 KB)
[v2] Wed, 23 Apr 2025 01:25:32 UTC (1,812 KB)

Computer Science > Machine Learning

Title:AlphaGrad: Non-Linear Gradient Normalization Optimizer

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:AlphaGrad: Non-Linear Gradient Normalization Optimizer

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators