Meta-ticket: Finding optimal subnetworks for few-shot learning within randomly initialized neural networks

Chijiwa, Daiki; Yamaguchi, Shin'ya; Kumagai, Atsutoshi; Ida, Yasutoshi

Computer Science > Machine Learning

arXiv:2205.15619 (cs)

[Submitted on 31 May 2022 (v1), last revised 9 Feb 2023 (this version, v2)]

Title:Meta-ticket: Finding optimal subnetworks for few-shot learning within randomly initialized neural networks

Authors:Daiki Chijiwa, Shin'ya Yamaguchi, Atsutoshi Kumagai, Yasutoshi Ida

View PDF

Abstract:Few-shot learning for neural networks (NNs) is an important problem that aims to train NNs with a few data. The main challenge is how to avoid overfitting since over-parameterized NNs can easily overfit to such small dataset. Previous work (e.g. MAML by Finn et al. 2017) tackles this challenge by meta-learning, which learns how to learn from a few data by using various tasks. On the other hand, one conventional approach to avoid overfitting is restricting hypothesis spaces by endowing sparse NN structures like convolution layers in computer vision. However, although such manually-designed sparse structures are sample-efficient for sufficiently large datasets, they are still insufficient for few-shot learning. Then the following questions naturally arise: (1) Can we find sparse structures effective for few-shot learning by meta-learning? (2) What benefits will it bring in terms of meta-generalization? In this work, we propose a novel meta-learning approach, called Meta-ticket, to find optimal sparse subnetworks for few-shot learning within randomly initialized NNs. We empirically validated that Meta-ticket successfully discover sparse subnetworks that can learn specialized features for each given task. Due to this task-wise adaptation ability, Meta-ticket achieves superior meta-generalization compared to MAML-based methods especially with large NNs. The code is available at: this https URL

Comments:	36th Conference on Neural Information Processing Systems (NeurIPS 2022)
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
Cite as:	arXiv:2205.15619 [cs.LG]
	(or arXiv:2205.15619v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2205.15619

Submission history

From: Daiki Chijiwa [view email]
[v1] Tue, 31 May 2022 09:03:57 UTC (1,062 KB)
[v2] Thu, 9 Feb 2023 08:48:18 UTC (2,803 KB)

Computer Science > Machine Learning

Title:Meta-ticket: Finding optimal subnetworks for few-shot learning within randomly initialized neural networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Meta-ticket: Finding optimal subnetworks for few-shot learning within randomly initialized neural networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators