Sparse deep neural networks for modeling aluminum electrolysis dynamics

Lundby, Erlend Torje Berg; Rasheed, Adil; Halvorsen, Ivar Johan; Gravdahl, Jan Tommy

doi:10.1016/j.asoc.2023.109989

Physics > Chemical Physics

arXiv:2209.05832 (physics)

[Submitted on 13 Sep 2022 (v1), last revised 13 Jan 2023 (this version, v2)]

Title:Sparse deep neural networks for modeling aluminum electrolysis dynamics

Authors:Erlend Torje Berg Lundby, Adil Rasheed, Ivar Johan Halvorsen, Jan Tommy Gravdahl

View PDF

Abstract:Deep neural networks have become very popular in modeling complex nonlinear processes due to their extraordinary ability to fit arbitrary nonlinear functions from data with minimal expert intervention. However, they are almost always overparameterized and challenging to interpret due to their internal complexity. Furthermore, the optimization process to find the learned model parameters can be unstable due to the process getting stuck in local minima. In this work, we demonstrate the value of sparse regularization techniques to significantly reduce the model complexity. We demonstrate this for the case of an aluminium extraction process, which is highly nonlinear system with many interrelated subprocesses. We trained a densely connected deep neural network to model the process and then compared the effects of sparsity promoting l1 regularization on generalizability, interpretability, and training stability. We found that the regularization significantly reduces model complexity compared to a corresponding dense neural network. We argue that this makes the model more interpretable, and show that training an ensemble of sparse neural networks with different parameter initializations often converges to similar model structures with similar learned input features. Furthermore, the empirical study shows that the resulting sparse models generalize better from small training sets than their dense counterparts.

Comments:	32 pages, 18 figures
Subjects:	Chemical Physics (physics.chem-ph); Machine Learning (cs.LG)
Cite as:	arXiv:2209.05832 [physics.chem-ph]
	(or arXiv:2209.05832v2 [physics.chem-ph] for this version)
	https://doi.org/10.48550/arXiv.2209.05832
Related DOI:	https://doi.org/10.1016/j.asoc.2023.109989

Submission history

From: Erlend Lundby [view email]
[v1] Tue, 13 Sep 2022 09:11:50 UTC (2,712 KB)
[v2] Fri, 13 Jan 2023 09:26:43 UTC (3,227 KB)

✅2024-10-01: arxiv.org is back to normal.✅

Physics > Chemical Physics

Title:Sparse deep neural networks for modeling aluminum electrolysis dynamics

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Physics > Chemical Physics

Title:Sparse deep neural networks for modeling aluminum electrolysis dynamics

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators