Towards Stable Machine Learning Model Retraining via Slowly Varying Sequences

Digalakis Jr, Vassilis; Ma, Yu; Paschalidis, Phevos; Bertsimas, Dimitris

Computer Science > Machine Learning

arXiv:2403.19871v1 (cs)

[Submitted on 28 Mar 2024 (this version), latest version 22 May 2024 (v4)]

Title:Towards Stable Machine Learning Model Retraining via Slowly Varying Sequences

Authors:Vassilis Digalakis Jr, Yu Ma, Phevos Paschalidis, Dimitris Bertsimas

View PDF HTML (experimental)

Abstract:Retraining machine learning models remains an important task for real-world machine learning model deployment. Existing methods focus largely on greedy approaches to find the best-performing model without considering the stability of trained model structures across different retraining evolutions. In this study, we develop a mixed integer optimization algorithm that holistically considers the problem of retraining machine learning models across different data batch updates. Our method focuses on retaining consistent analytical insights - which is important to model interpretability, ease of implementation, and fostering trust with users - by using custom-defined distance metrics that can be directly incorporated into the optimization problem. Importantly, our method shows stronger stability than greedily trained models with a small, controllable sacrifice in model performance in a real-world production case study. Finally, important analytical insights, as demonstrated using SHAP feature importance, are shown to be consistent across retraining iterations.

Comments:	For correspondence, contact Yu Ma, midsumer@mit.edu
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
Cite as:	arXiv:2403.19871 [cs.LG]
	(or arXiv:2403.19871v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2403.19871

Submission history

From: Yu Ma [view email]
[v1] Thu, 28 Mar 2024 22:45:38 UTC (3,484 KB)
[v2] Mon, 8 Apr 2024 21:52:11 UTC (3,484 KB)
[v3] Mon, 29 Apr 2024 15:12:25 UTC (3,503 KB)
[v4] Wed, 22 May 2024 19:15:23 UTC (3,543 KB)

Computer Science > Machine Learning

Title:Towards Stable Machine Learning Model Retraining via Slowly Varying Sequences

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards Stable Machine Learning Model Retraining via Slowly Varying Sequences

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators