Benchmarking Deep Learning Classifiers: Beyond Accuracy

Dai, Wei; Berleant, Daniel

Computer Science > Machine Learning

arXiv:2103.03102v1 (cs)

A newer version of this paper has been withdrawn by Daniel Berleant

[Submitted on 2 Mar 2021 (this version), latest version 26 Jun 2023 (v5)]

Title:Benchmarking Deep Learning Classifiers: Beyond Accuracy

Authors:Wei Dai, Daniel Berleant

View PDF

Abstract:Previous research evaluating deep learning (DL) classifiers has often used top-1/top-5 accuracy. However, the accuracy of DL classifiers is unstable in that it often changes significantly when retested on imperfect or adversarial images. This paper adds to the small but fundamental body of work on benchmarking the robustness of DL classifiers on imperfect images by proposing a two-dimensional metric, consisting of mean accuracy and coefficient of variation, to measure the robustness of DL classifiers. Spearman's rank correlation coefficient and Pearson's correlation coefficient are used and their independence evaluated. A statistical plot we call mCV is presented which aims to help visualize the robustness of the performance of DL classifiers across varying amounts of imperfection in tested images. Finally, we demonstrate that defective images corrupted by two-factor corruption could be used to improve the robustness of DL classifiers. All source codes and related image sets are shared on a website (this http URL) to support future research projects.

Comments:	7 pages, 6 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF)
Cite as:	arXiv:2103.03102 [cs.LG]
	(or arXiv:2103.03102v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2103.03102

Submission history

From: Wei Dai [view email]
[v1] Tue, 2 Mar 2021 02:10:54 UTC (755 KB)
[v2] Sat, 31 Jul 2021 20:04:25 UTC (1,584 KB)
[v3] Tue, 7 Sep 2021 16:38:56 UTC (1,483 KB)
[v4] Sun, 21 Nov 2021 22:00:46 UTC (863 KB)
[v5] Mon, 26 Jun 2023 17:50:00 UTC (1 KB) (withdrawn)

Computer Science > Machine Learning

Title:Benchmarking Deep Learning Classifiers: Beyond Accuracy

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Benchmarking Deep Learning Classifiers: Beyond Accuracy

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators