A Continual Development Methodology for Large-scale Multitask Dynamic ML Systems

Gesmundo, Andrea

Computer Science > Machine Learning

arXiv:2209.07326 (cs)

[Submitted on 15 Sep 2022 (v1), last revised 6 Nov 2022 (this version, v3)]

Title:A Continual Development Methodology for Large-scale Multitask Dynamic ML Systems

Authors:Andrea Gesmundo

View PDF

Abstract:The traditional Machine Learning (ML) methodology requires to fragment the development and experimental process into disconnected iterations whose feedback is used to guide design or tuning choices. This methodology has multiple efficiency and scalability disadvantages, such as leading to spend significant resources into the creation of multiple trial models that do not contribute to the final this http URL presented work is based on the intuition that defining ML models as modular and extensible artefacts allows to introduce a novel ML development methodology enabling the integration of multiple design and evaluation iterations into the continuous enrichment of a single unbounded intelligent system. We define a novel method for the generation of dynamic multitask ML models as a sequence of extensions and generalizations. We first analyze the capabilities of the proposed method by using the standard ML empirical evaluation methodology. Finally, we propose a novel continuous development methodology that allows to dynamically extend a pre-existing multitask large-scale ML system while analyzing the properties of the proposed method extensions. This results in the generation of an ML model capable of jointly solving 124 image classification tasks achieving state of the art quality with improved size and compute cost.

Comments:	arXiv admin note: text overlap with arXiv:2205.12755
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2209.07326 [cs.LG]
	(or arXiv:2209.07326v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2209.07326

Submission history

From: Andrea Gesmundo [view email]
[v1] Thu, 15 Sep 2022 14:36:17 UTC (2,343 KB)
[v2] Fri, 30 Sep 2022 10:59:45 UTC (4,682 KB)
[v3] Sun, 6 Nov 2022 08:45:40 UTC (7,030 KB)

Computer Science > Machine Learning

Title:A Continual Development Methodology for Large-scale Multitask Dynamic ML Systems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Continual Development Methodology for Large-scale Multitask Dynamic ML Systems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators