Generalized Activation via Multivariate Projection

Li, Jiayun; Cheng, Yuxiao; Xia, Zhuofan; Mo, Yilin; Huang, Gao

Computer Science > Machine Learning

arXiv:2309.17194v1 (cs)

[Submitted on 29 Sep 2023 (this version), latest version 27 Jan 2024 (v2)]

Title:Generalized Activation via Multivariate Projection

Authors:Jiayun Li, Yuxiao Cheng, Zhuofan Xia, Yilin Mo, Gao Huang

View PDF

Abstract:Activation functions are essential to introduce nonlinearity into neural networks, with the Rectified Linear Unit (ReLU) often favored for its simplicity and effectiveness. Motivated by the structural similarity between a shallow Feedforward Neural Network (FNN) and a single iteration of the Projected Gradient Descent (PGD) algorithm, a standard approach for solving constrained optimization problems, we consider ReLU as a projection from R onto the nonnegative half-line R+. Building on this interpretation, we extend ReLU by substituting it with a generalized projection operator onto a convex cone, such as the Second-Order Cone (SOC) projection, thereby naturally extending it to a Multivariate Projection Unit (MPU), an activation function with multiple inputs and multiple outputs. We further provide a mathematical proof establishing that FNNs activated by SOC projections outperform those utilizing ReLU in terms of expressive power. Experimental evaluations on widely-adopted architectures further corroborate MPU's effectiveness against a broader range of existing activation functions.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2309.17194 [cs.LG]
	(or arXiv:2309.17194v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2309.17194

Submission history

From: Yuxiao Cheng [view email]
[v1] Fri, 29 Sep 2023 12:44:27 UTC (976 KB)
[v2] Sat, 27 Jan 2024 09:50:08 UTC (1,086 KB)

Computer Science > Machine Learning

Title:Generalized Activation via Multivariate Projection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Generalized Activation via Multivariate Projection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators