OdontoAI: A human-in-the-loop labeled data set and an online platform to boost research on dental panoramic radiographs

Silva, Bernardo; Pinheiro, Laís; Sobrinho, Brenda; Lima, Fernanda; Sobrinho, Bruna; Abdalla, Kalyf; Pithon, Matheus; Cury, Patrícia; Oliveira, Luciano

Computer Science > Computer Vision and Pattern Recognition

arXiv:2203.15856 (cs)

[Submitted on 29 Mar 2022]

Title:OdontoAI: A human-in-the-loop labeled data set and an online platform to boost research on dental panoramic radiographs

Authors:Bernardo Silva, Laís Pinheiro, Brenda Sobrinho, Fernanda Lima, Bruna Sobrinho, Kalyf Abdalla, Matheus Pithon, Patrícia Cury, Luciano Oliveira

View PDF

Abstract:Deep learning has remarkably advanced in the last few years, supported by large labeled data sets. These data sets are precious yet scarce because of the time-consuming labeling procedures, discouraging researchers from producing them. This scarcity is especially true in dentistry, where deep learning applications are still in an embryonic stage. Motivated by this background, we address in this study the construction of a public data set of dental panoramic radiographs. Our objects of interest are the teeth, which are segmented and numbered, as they are the primary targets for dentists when screening a panoramic radiograph. We benefited from the human-in-the-loop (HITL) concept to expedite the labeling procedure, using predictions from deep neural networks as provisional labels, later verified by human annotators. All the gathering and labeling procedures of this novel data set is thoroughly analyzed. The results were consistent and behaved as expected: At each HITL iteration, the model predictions improved. Our results demonstrated a 51% labeling time reduction using HITL, saving us more than 390 continuous working hours. In a novel online platform, called OdontoAI, created to work as task central for this novel data set, we released 4,000 images, from which 2,000 have their labels publicly available for model fitting. The labels of the other 2,000 images are private and used for model evaluation considering instance and semantic segmentation and numbering. To the best of our knowledge, this is the largest-scale publicly available data set for panoramic radiographs, and the OdontoAI is the first platform of its kind in dentistry.

Comments:	45 pages, 11 figures, journal preprint
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2203.15856 [cs.CV]
	(or arXiv:2203.15856v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2203.15856

Submission history

From: Luciano Oliveira [view email]
[v1] Tue, 29 Mar 2022 18:57:23 UTC (17,828 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:OdontoAI: A human-in-the-loop labeled data set and an online platform to boost research on dental panoramic radiographs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:OdontoAI: A human-in-the-loop labeled data set and an online platform to boost research on dental panoramic radiographs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators