Label Denoising through Cross-Model Agreement

Wang, Yu; Xin, Xin; Meng, Zaiqiao; Jose, Joemon; Feng, Fuli

Computer Science > Machine Learning

arXiv:2308.13976 (cs)

[Submitted on 27 Aug 2023 (v1), last revised 19 Dec 2023 (this version, v3)]

Title:Label Denoising through Cross-Model Agreement

Authors:Yu Wang, Xin Xin, Zaiqiao Meng, Joemon Jose, Fuli Feng

View PDF

Abstract:Learning from corrupted labels is very common in real-world machine-learning applications. Memorizing such noisy labels could affect the learning of the model, leading to sub-optimal performances. In this work, we propose a novel framework to learn robust machine-learning models from noisy labels. Through an empirical study, we find that different models make relatively similar predictions on clean examples, while the predictions on noisy examples vary much more across different models. Motivated by this observation, we propose \em denoising with cross-model agreement \em (DeCA) which aims to minimize the KL-divergence between the true label distributions parameterized by two machine learning models while maximizing the likelihood of data observation. We employ the proposed DeCA on both the binary label scenario and the multiple label scenario. For the binary label scenario, we select implicit feedback recommendation as the downstream task and conduct experiments with four state-of-the-art recommendation models on four datasets. For the multiple-label scenario, the downstream application is image classification on two benchmark datasets. Experimental results demonstrate that the proposed methods significantly improve the model performance compared with normal training and other denoising methods on both binary and multiple-label scenarios.

Comments:	arXiv admin note: substantial text overlap with arXiv:2105.09605
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2308.13976 [cs.LG]
	(or arXiv:2308.13976v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2308.13976

Submission history

From: Yu Wang [view email]
[v1] Sun, 27 Aug 2023 00:31:04 UTC (13,757 KB)
[v2] Fri, 1 Sep 2023 07:38:59 UTC (13,757 KB)
[v3] Tue, 19 Dec 2023 04:44:35 UTC (13,758 KB)

Computer Science > Machine Learning

Title:Label Denoising through Cross-Model Agreement

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Label Denoising through Cross-Model Agreement

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators