Self-Supervised Radiograph Anatomical Region Classification -- How Clean Is Your Real-World Data?

Langer, Simon; Ritter, Jessica; Braren, Rickmer; Rueckert, Daniel; Hager, Paul

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.15967 (cs)

[Submitted on 20 Dec 2024]

Title:Self-Supervised Radiograph Anatomical Region Classification -- How Clean Is Your Real-World Data?

Authors:Simon Langer, Jessica Ritter, Rickmer Braren, Daniel Rueckert, Paul Hager

View PDF HTML (experimental)

Abstract:Modern deep learning-based clinical imaging workflows rely on accurate labels of the examined anatomical region. Knowing the anatomical region is required to select applicable downstream models and to effectively generate cohorts of high quality data for future medical and machine learning research efforts. However, this information may not be available in externally sourced data or generally contain data entry errors. To address this problem, we show the effectiveness of self-supervised methods such as SimCLR and BYOL as well as supervised contrastive deep learning methods in assigning one of 14 anatomical region classes in our in-house dataset of 48,434 skeletal radiographs. We achieve a strong linear evaluation accuracy of 96.6% with a single model and 97.7% using an ensemble approach. Furthermore, only a few labeled instances (1% of the training set) suffice to achieve an accuracy of 92.2%, enabling usage in low-label and thus low-resource scenarios. Our model can be used to correct data entry mistakes: a follow-up analysis of the test set errors of our best-performing single model by an expert radiologist identified 35% incorrect labels and 11% out-of-domain images. When accounted for, the radiograph anatomical region labelling performance increased -- without and with an ensemble, respectively -- to a theoretical accuracy of 98.0% and 98.8%.

Comments:	12 pages, 4 figures, 2 supplementary figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2412.15967 [cs.CV]
	(or arXiv:2412.15967v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.15967

Submission history

From: Paul Hager [view email]
[v1] Fri, 20 Dec 2024 15:07:55 UTC (1,955 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Self-Supervised Radiograph Anatomical Region Classification -- How Clean Is Your Real-World Data?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Self-Supervised Radiograph Anatomical Region Classification -- How Clean Is Your Real-World Data?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators