Is Synthetic Image Useful for Transfer Learning? An Investigation into Data Generation, Volume, and Utilization

Li, Yuhang; Dong, Xin; Chen, Chen; Li, Jingtao; Wen, Yuxin; Spranger, Michael; Lyu, Lingjuan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.19866 (cs)

[Submitted on 28 Mar 2024 (v1), last revised 2 Apr 2024 (this version, v2)]

Title:Is Synthetic Image Useful for Transfer Learning? An Investigation into Data Generation, Volume, and Utilization

Authors:Yuhang Li, Xin Dong, Chen Chen, Jingtao Li, Yuxin Wen, Michael Spranger, Lingjuan Lyu

View PDF HTML (experimental)

Abstract:Synthetic image data generation represents a promising avenue for training deep learning models, particularly in the realm of transfer learning, where obtaining real images within a specific domain can be prohibitively expensive due to privacy and intellectual property considerations. This work delves into the generation and utilization of synthetic images derived from text-to-image generative models in facilitating transfer learning paradigms. Despite the high visual fidelity of the generated images, we observe that their naive incorporation into existing real-image datasets does not consistently enhance model performance due to the inherent distribution gap between synthetic and real images. To address this issue, we introduce a novel two-stage framework called bridged transfer, which initially employs synthetic images for fine-tuning a pre-trained model to improve its transferability and subsequently uses real data for rapid adaptation. Alongside, We propose dataset style inversion strategy to improve the stylistic alignment between synthetic and real images. Our proposed methods are evaluated across 10 different datasets and 5 distinct models, demonstrating consistent improvements, with up to 30% accuracy increase on classification tasks. Intriguingly, we note that the enhancements were not yet saturated, indicating that the benefits may further increase with an expanded volume of synthetic data.

Comments:	ICLR24 Score 6865 this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2403.19866 [cs.CV]
	(or arXiv:2403.19866v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.19866

Submission history

From: Jingtao Li [view email]
[v1] Thu, 28 Mar 2024 22:25:05 UTC (3,720 KB)
[v2] Tue, 2 Apr 2024 22:41:53 UTC (3,720 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Is Synthetic Image Useful for Transfer Learning? An Investigation into Data Generation, Volume, and Utilization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Is Synthetic Image Useful for Transfer Learning? An Investigation into Data Generation, Volume, and Utilization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators