Missingness Augmentation: A General Approach for Improving Generative Imputation Models

Wang, Yufeng; Li, Dan; Xu, Cong; Yang, Min

Computer Science > Machine Learning

arXiv:2108.02566 (cs)

[Submitted on 31 Jul 2021 (v1), last revised 6 Apr 2023 (this version, v2)]

Title:Missingness Augmentation: A General Approach for Improving Generative Imputation Models

Authors:Yufeng Wang, Dan Li, Cong Xu, Min Yang

View PDF

Abstract:Missing data imputation is a fundamental problem in data analysis, and many studies have been conducted to improve its performance by exploring model structures and learning procedures. However, data augmentation, as a simple yet effective method, has not received enough attention in this area. In this paper, we propose a novel data augmentation method called Missingness Augmentation (MisA) for generative imputation models. Our approach dynamically produces incomplete samples at each epoch by utilizing the generator's output, constraining the augmented samples using a simple reconstruction loss, and combining this loss with the original loss to form the final optimization objective. As a general augmentation technique, MisA can be easily integrated into generative imputation frameworks, providing a simple yet effective way to enhance their performance. Experimental results demonstrate that MisA significantly improves the performance of many recently proposed generative imputation models on a variety of tabular and image datasets. The code is available at \url{this https URL}.

Comments:	20 pages
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2108.02566 [cs.LG]
	(or arXiv:2108.02566v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2108.02566

Submission history

From: Min Yang [view email]
[v1] Sat, 31 Jul 2021 08:51:46 UTC (3,764 KB)
[v2] Thu, 6 Apr 2023 06:05:14 UTC (3,630 KB)

Computer Science > Machine Learning

Title:Missingness Augmentation: A General Approach for Improving Generative Imputation Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Missingness Augmentation: A General Approach for Improving Generative Imputation Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators