Mixing Natural and Synthetic Images for Robust Self-Supervised Representations

Bafghi, Reza Akbarian; Harilal, Nidhin; Monteleoni, Claire; Raissi, Maziar

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.12368 (cs)

[Submitted on 18 Jun 2024]

Title:Mixing Natural and Synthetic Images for Robust Self-Supervised Representations

Authors:Reza Akbarian Bafghi, Nidhin Harilal, Claire Monteleoni, Maziar Raissi

View PDF HTML (experimental)

Abstract:This paper introduces DiffMix, a new self-supervised learning (SSL) pre-training framework that combines real and synthetic images. Unlike traditional SSL methods that predominantly use real images, DiffMix uses a variant of Stable Diffusion to replace an augmented instance of a real image, facilitating the learning of cross real-synthetic image representations. The key insight is that while SSL methods trained solely on synthetic images underperform compared to those trained on real images, a blended training approach using both real and synthetic images leads to more robust and adaptable representations. Experiments demonstrate that DiffMix enhances the SSL methods SimCLR, BarlowTwins, and DINO, across various robustness datasets and domain transfer tasks. DiffMix boosts SimCLR's accuracy on ImageNet-1K by 4.56\%. These results challenge the notion that high-quality real images are crucial for SSL pre-training by showing that lower quality synthetic images can also produce strong representations. DiffMix also reduces the need for image augmentations in SSL, offering new optimization strategies.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.12368 [cs.CV]
	(or arXiv:2406.12368v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.12368

Submission history

From: Nidhin Harilal [view email]
[v1] Tue, 18 Jun 2024 07:49:11 UTC (10,537 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Mixing Natural and Synthetic Images for Robust Self-Supervised Representations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Mixing Natural and Synthetic Images for Robust Self-Supervised Representations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators