Toward Data Efficient Model Merging between Different Datasets without Performance Degradation

Yamada, Masanori; Yamashita, Tomoya; Yamaguchi, Shin'ya; Chijiwa, Daiki

Computer Science > Machine Learning

arXiv:2306.05641 (cs)

[Submitted on 9 Jun 2023 (v1), last revised 20 Sep 2024 (this version, v2)]

Title:Toward Data Efficient Model Merging between Different Datasets without Performance Degradation

Authors:Masanori Yamada, Tomoya Yamashita, Shin'ya Yamaguchi, Daiki Chijiwa

View PDF HTML (experimental)

Abstract:Model merging is attracting attention as a novel method for creating a new model by combining the weights of different trained models. While previous studies reported that model merging works well for models trained on a single dataset with different random seeds, model merging between different datasets remains unsolved. In this paper, we attempt to reveal the difficulty in merging such models trained on different datasets and alleviate it. Our empirical analyses show that, in contrast to the single-dataset scenarios, dataset information needs to be accessed to achieve high accuracy when merging models trained on different datasets. However, the requirement to use full datasets not only incurs significant computational costs but also becomes a major limitation when integrating models developed and shared by others. To address this, we demonstrate that dataset reduction techniques, such as coreset selection and dataset condensation, effectively reduce the data requirement for model merging. In our experiments with SPLIT-CIFAR10 model merging, the accuracy is significantly improved by $31%$ when using the full dataset and $24%$ when using the sampled subset compared with not using the dataset.

Comments:	29 pages; comments are welcome, accepted at ACML 2024
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2306.05641 [cs.LG]
	(or arXiv:2306.05641v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.05641

Submission history

From: Masanori Yamada [view email]
[v1] Fri, 9 Jun 2023 03:00:34 UTC (3,608 KB)
[v2] Fri, 20 Sep 2024 08:27:13 UTC (3,667 KB)

Computer Science > Machine Learning

Title:Toward Data Efficient Model Merging between Different Datasets without Performance Degradation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Toward Data Efficient Model Merging between Different Datasets without Performance Degradation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators