A Two-Stage Training Method for Modeling Constrained Systems With Neural Networks

Coelho, C.; Costa, M. Fernanda P.; Ferrás, L. L.

Computer Science > Machine Learning

arXiv:2403.02730 (cs)

[Submitted on 5 Mar 2024]

Title:A Two-Stage Training Method for Modeling Constrained Systems With Neural Networks

Authors:C. Coelho, M. Fernanda P. Costa, L.L. Ferrás

View PDF

Abstract:Real-world systems are often formulated as constrained optimization problems. Techniques to incorporate constraints into Neural Networks (NN), such as Neural Ordinary Differential Equations (Neural ODEs), have been used. However, these introduce hyperparameters that require manual tuning through trial and error, raising doubts about the successful incorporation of constraints into the generated model. This paper describes in detail the two-stage training method for Neural ODEs, a simple, effective, and penalty parameter-free approach to model constrained systems. In this approach the constrained optimization problem is rewritten as two unconstrained sub-problems that are solved in two stages. The first stage aims at finding feasible NN parameters by minimizing a measure of constraints violation. The second stage aims to find the optimal NN parameters by minimizing the loss function while keeping inside the feasible region. We experimentally demonstrate that our method produces models that satisfy the constraints and also improves their predictive performance. Thus, ensuring compliance with critical system properties and also contributing to reducing data quantity requirements. Furthermore, we show that the proposed method improves the convergence to an optimal solution and improves the explainability of Neural ODE models. Our proposed two-stage training method can be used with any NN architectures.

Subjects:	Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Optimization and Control (math.OC)
MSC classes:	35A01, 65L10, 65L12, 65L20, 65L70
ACM classes:	I.5.1; G.1.6
Cite as:	arXiv:2403.02730 [cs.LG]
	(or arXiv:2403.02730v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2403.02730

Submission history

From: Cecília Coelho [view email]
[v1] Tue, 5 Mar 2024 07:37:47 UTC (836 KB)

Computer Science > Machine Learning

Title:A Two-Stage Training Method for Modeling Constrained Systems With Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Two-Stage Training Method for Modeling Constrained Systems With Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators