Fill-Up: Balancing Long-Tailed Data with Generative Models

Shin, Joonghyuk; Kang, Minguk; Park, Jaesik

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.07200 (cs)

[Submitted on 12 Jun 2023]

Title:Fill-Up: Balancing Long-Tailed Data with Generative Models

Authors:Joonghyuk Shin, Minguk Kang, Jaesik Park

View PDF

Abstract:Modern text-to-image synthesis models have achieved an exceptional level of photorealism, generating high-quality images from arbitrary text descriptions. In light of the impressive synthesis ability, several studies have exhibited promising results in exploiting generated data for image recognition. However, directly supplementing data-hungry situations in the real-world (e.g. few-shot or long-tailed scenarios) with existing approaches result in marginal performance gains, as they suffer to thoroughly reflect the distribution of the real data. Through extensive experiments, this paper proposes a new image synthesis pipeline for long-tailed situations using Textual Inversion. The study demonstrates that generated images from textual-inverted text tokens effectively aligns with the real domain, significantly enhancing the recognition ability of a standard ResNet50 backbone. We also show that real-world data imbalance scenarios can be successfully mitigated by filling up the imbalanced data with synthetic images. In conjunction with techniques in the area of long-tailed recognition, our method achieves state-of-the-art results on standard long-tailed benchmarks when trained from scratch.

Comments:	32 pages, 19 Figures, and 10 Tables. Project webpage at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2306.07200 [cs.CV]
	(or arXiv:2306.07200v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.07200

Submission history

From: Joonghyuk Shin [view email]
[v1] Mon, 12 Jun 2023 16:01:20 UTC (33,734 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Fill-Up: Balancing Long-Tailed Data with Generative Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Fill-Up: Balancing Long-Tailed Data with Generative Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators