Uncovering bias in the PlantVillage dataset

Noyan, Mehmet Alican

Computer Science > Computer Vision and Pattern Recognition

arXiv:2206.04374 (cs)

[Submitted on 9 Jun 2022]

Title:Uncovering bias in the PlantVillage dataset

Authors:Mehmet Alican Noyan

View PDF

Abstract:We report our investigation on the use of the popular PlantVillage dataset for training deep learning based plant disease detection models. We trained a machine learning model using only 8 pixels from the PlantVillage image backgrounds. The model achieved 49.0% accuracy on the held-out test set, well above the random guessing accuracy of 2.6%. This result indicates that the PlantVillage dataset contains noise correlated with the labels and deep learning models can easily exploit this bias to make predictions. Possible approaches to alleviate this problem are discussed.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2206.04374 [cs.CV]
	(or arXiv:2206.04374v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2206.04374

Submission history

From: Mehmet Alican Noyan [view email]
[v1] Thu, 9 Jun 2022 09:32:35 UTC (4,536 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2022-06

Change to browse by:

cs
cs.LG

References & Citations

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Uncovering bias in the PlantVillage dataset

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Uncovering bias in the PlantVillage dataset

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators