Statistical Challenges with Dataset Construction: Why You Will Never Have Enough Images

Goldman, Josh; Tsotsos, John K.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2408.11160 (cs)

[Submitted on 20 Aug 2024]

Title:Statistical Challenges with Dataset Construction: Why You Will Never Have Enough Images

Authors:Josh Goldman, John K. Tsotsos

View PDF HTML (experimental)

Abstract:Deep neural networks have achieved impressive performance on many computer vision benchmarks in recent years. However, can we be confident that impressive performance on benchmarks will translate to strong performance in real-world environments? Many environments in the real world are safety critical, and even slight model failures can be catastrophic. Therefore, it is crucial to test models rigorously before deployment. We argue, through both statistical theory and empirical evidence, that selecting representative image datasets for testing a model is likely implausible in many domains. Furthermore, performance statistics calculated with non-representative image datasets are highly unreliable. As a consequence, we cannot guarantee that models which perform well on withheld test images will also perform well in the real world. Creating larger and larger datasets will not help, and bias aware datasets cannot solve this problem either. Ultimately, there is little statistical foundation for evaluating models using withheld test sets. We recommend that future evaluation methodologies focus on assessing a model's decision-making process, rather than metrics such as accuracy.

Comments:	13 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
Cite as:	arXiv:2408.11160 [cs.CV]
	(or arXiv:2408.11160v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2408.11160

Submission history

From: Josh Goldman [view email]
[v1] Tue, 20 Aug 2024 19:33:24 UTC (230 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Statistical Challenges with Dataset Construction: Why You Will Never Have Enough Images

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Statistical Challenges with Dataset Construction: Why You Will Never Have Enough Images

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators