When Code Smells Meet ML: On the Lifecycle of ML-specific Code Smells in ML-enabled Systems

Recupito, Gilberto; Giordano, Giammaria; Ferrucci, Filomena; Di Nucci, Dario; Palomba, Fabio

Computer Science > Software Engineering

arXiv:2403.08311 (cs)

[Submitted on 13 Mar 2024]

Title:When Code Smells Meet ML: On the Lifecycle of ML-specific Code Smells in ML-enabled Systems

Authors:Gilberto Recupito, Giammaria Giordano, Filomena Ferrucci, Dario Di Nucci, Fabio Palomba

View PDF HTML (experimental)

Abstract:Context. The adoption of Machine Learning (ML)--enabled systems is steadily increasing. Nevertheless, there is a shortage of ML-specific quality assurance approaches, possibly because of the limited knowledge of how quality-related concerns emerge and evolve in ML-enabled systems. Objective. We aim to investigate the emergence and evolution of specific types of quality-related concerns known as ML-specific code smells, i.e., sub-optimal implementation solutions applied on ML pipelines that may significantly decrease both the quality and maintainability of ML-enabled systems. More specifically, we present a plan to study ML-specific code smells by empirically analyzing (i) their prevalence in real ML-enabled systems, (ii) how they are introduced and removed, and (iii) their survivability. Method. We will conduct an exploratory study, mining a large dataset of ML-enabled systems and analyzing over 400k commits about 337 projects. We will track and inspect the introduction and evolution of ML smells through CodeSmile, a novel ML smell detector that we will build to enable our investigation and to detect ML-specific code smells.

Comments:	6 pages, 1 figure
Subjects:	Software Engineering (cs.SE)
ACM classes:	D.2.7
Cite as:	arXiv:2403.08311 [cs.SE]
	(or arXiv:2403.08311v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2403.08311

Submission history

From: Gilberto Recupito [view email]
[v1] Wed, 13 Mar 2024 07:43:45 UTC (1,355 KB)

Computer Science > Software Engineering

Title:When Code Smells Meet ML: On the Lifecycle of ML-specific Code Smells in ML-enabled Systems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:When Code Smells Meet ML: On the Lifecycle of ML-specific Code Smells in ML-enabled Systems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators