MedYOLO: A Medical Image Object Detection Framework

Sobek, Joseph; Inojosa, Jose R. Medina; Inojosa, Betsy J. Medina; Rassoulinejad-Mousavi, S. M.; Conte, Gian Marco; Lopez-Jimenez, Francisco; Erickson, Bradley J.

doi:10.1007/s10278-024-01138-2

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2312.07729 (eess)

[Submitted on 12 Dec 2023 (v1), last revised 7 Jun 2024 (this version, v2)]

Title:MedYOLO: A Medical Image Object Detection Framework

Authors:Joseph Sobek, Jose R. Medina Inojosa, Betsy J. Medina Inojosa, S. M. Rassoulinejad-Mousavi, Gian Marco Conte, Francisco Lopez-Jimenez, Bradley J. Erickson

View PDF

Abstract:Artificial intelligence-enhanced identification of organs, lesions, and other structures in medical imaging is typically done using convolutional neural networks (CNNs) designed to make voxel-accurate segmentations of the region of interest. However, the labels required to train these CNNs are time-consuming to generate and require attention from subject matter experts to ensure quality. For tasks where voxel-level precision is not required, object detection models offer a viable alternative that can reduce annotation effort. Despite this potential application, there are few options for general purpose object detection frameworks available for 3-D medical imaging. We report on MedYOLO, a 3-D object detection framework using the one-shot detection method of the YOLO family of models and designed for use with medical imaging. We tested this model on four different datasets: BRaTS, LIDC, an abdominal organ Computed Tomography (CT) dataset, and an ECG-gated heart CT dataset. We found our models achieve high performance on commonly present medium and large-sized structures such as the heart, liver, and pancreas even without hyperparameter tuning. However, the models struggle with very small or rarely present structures.

Comments:	J Digit Imaging. Inform. med. (2024)
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2312.07729 [eess.IV]
	(or arXiv:2312.07729v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2312.07729
Related DOI:	https://doi.org/10.1007/s10278-024-01138-2

Submission history

From: Joseph Sobek [view email]
[v1] Tue, 12 Dec 2023 20:46:14 UTC (1,152 KB)
[v2] Fri, 7 Jun 2024 16:53:15 UTC (629 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:MedYOLO: A Medical Image Object Detection Framework

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:MedYOLO: A Medical Image Object Detection Framework

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators