Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics

Das, Indrashis; Safari, Mahmoud; Adriaensen, Steven; Hutter, Frank

Computer Science > Machine Learning

arXiv:2502.03654 (cs)

[Submitted on 5 Feb 2025]

Title:Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics

Authors:Indrashis Das, Mahmoud Safari, Steven Adriaensen, Frank Hutter

View PDF HTML (experimental)

Abstract:Activation functions are fundamental elements of deep learning architectures as they significantly influence training dynamics. ReLU, while widely used, is prone to the dying neuron problem, which has been mitigated by variants such as LeakyReLU, PReLU, and ELU that better handle negative neuron outputs. Recently, self-gated activations like GELU and Swish have emerged as state-of-the-art alternatives, leveraging their smoothness to ensure stable gradient flow and prevent neuron inactivity. In this work, we introduce the Gompertz Linear Unit (GoLU), a novel self-gated activation function defined as $\mathrm{GoLU}(x) = x \, \mathrm{Gompertz}(x)$, where $\mathrm{Gompertz}(x) = e^{-e^{-x}}$. The GoLU activation leverages the asymmetry in the Gompertz function to reduce variance in the latent space more effectively compared to GELU and Swish, while preserving robust gradient flow. Extensive experiments across diverse tasks, including Image Classification, Language Modeling, Semantic Segmentation, Object Detection, Instance Segmentation, and Diffusion, highlight GoLU's superior performance relative to state-of-the-art activation functions, establishing GoLU as a robust alternative to existing activation functions.

Comments:	8 pages, excluding references and appendix
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2502.03654 [cs.LG]
	(or arXiv:2502.03654v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2502.03654

Submission history

From: Mahmoud Safari [view email]
[v1] Wed, 5 Feb 2025 22:32:22 UTC (14,132 KB)

Computer Science > Machine Learning

Title:Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators