Exploring Optical Flow Inclusion into nnU-Net Framework for Surgical Instrument Segmentation

Fernández-Rodríguez, Marcos; Silva, Bruno; Queirós, Sandro; Torres, Helena R.; Oliveira, Bruno; Morais, Pedro; Buschle, Lukas R.; Correia-Pinto, Jorge; Lima, Estevão; Vilaça, João L.

doi:10.1117/12.3006855

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.10216 (cs)

[Submitted on 15 Mar 2024]

Title:Exploring Optical Flow Inclusion into nnU-Net Framework for Surgical Instrument Segmentation

Authors:Marcos Fernández-Rodríguez, Bruno Silva, Sandro Queirós, Helena R. Torres, Bruno Oliveira, Pedro Morais, Lukas R. Buschle, Jorge Correia-Pinto, Estevão Lima, João L. Vilaça

View PDF HTML (experimental)

Abstract:Surgical instrument segmentation in laparoscopy is essential for computer-assisted surgical systems. Despite the Deep Learning progress in recent years, the dynamic setting of laparoscopic surgery still presents challenges for precise segmentation. The nnU-Net framework excelled in semantic segmentation analyzing single frames without temporal information. The framework's ease of use, including its ability to be automatically configured, and its low expertise requirements, have made it a popular base framework for comparisons. Optical flow (OF) is a tool commonly used in video tasks to estimate motion and represent it in a single frame, containing temporal information. This work seeks to employ OF maps as an additional input to the nnU-Net architecture to improve its performance in the surgical instrument segmentation task, taking advantage of the fact that instruments are the main moving objects in the surgical field. With this new input, the temporal component would be indirectly added without modifying the architecture. Using CholecSeg8k dataset, three different representations of movement were estimated and used as new inputs, comparing them with a baseline model. Results showed that the use of OF maps improves the detection of classes with high movement, even when these are scarce in the dataset. To further improve performance, future work may focus on implementing other OF-preserving augmentations.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2403.10216 [cs.CV]
	(or arXiv:2403.10216v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.10216
Journal reference:	Proceedings Volume 12928, Medical Imaging 2024: Image-Guided Procedures, Robotic Interventions, and Modeling; 1292827 (2024)
Related DOI:	https://doi.org/10.1117/12.3006855

Submission history

From: Marcos Fernández-Rodríguez [view email]
[v1] Fri, 15 Mar 2024 11:36:26 UTC (7,853 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Exploring Optical Flow Inclusion into nnU-Net Framework for Surgical Instrument Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Exploring Optical Flow Inclusion into nnU-Net Framework for Surgical Instrument Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators