Divide and Conquer: Grounding a Bleeding Areas in Gastrointestinal Image with Two-Stage Model

Lin, Yu-Fan; Qiu, Bo-Cheng; Lee, Chia-Ming; Hsu, Chih-Chung

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.16723 (cs)

[Submitted on 21 Dec 2024]

Title:Divide and Conquer: Grounding a Bleeding Areas in Gastrointestinal Image with Two-Stage Model

Authors:Yu-Fan Lin, Bo-Cheng Qiu, Chia-Ming Lee, Chih-Chung Hsu

View PDF HTML (experimental)

Abstract:Accurate detection and segmentation of gastrointestinal bleeding are critical for diagnosing diseases such as peptic ulcers and colorectal cancer. This study proposes a two-stage framework that decouples classification and grounding to address the inherent challenges posed by traditional Multi-Task Learning models, which jointly optimizes classification and segmentation. Our approach separates these tasks to achieve targeted optimization for each. The model first classifies images as bleeding or non-bleeding, thereby isolating subsequent grounding from inter-task interference and label heterogeneity. To further enhance performance, we incorporate Stochastic Weight Averaging and Test-Time Augmentation, which improve model robustness against domain shifts and annotation inconsistencies. Our method is validated on the Auto-WCEBleedGen Challenge V2 Challenge dataset and achieving second place. Experimental results demonstrate significant improvements in classification accuracy and segmentation precision, especially on sequential datasets with consistent visual patterns. This study highlights the practical benefits of a two-stage strategy for medical image analysis and sets a new standard for GI bleeding detection and segmentation. Our code is publicly available at this GitHub repository.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2412.16723 [cs.CV]
	(or arXiv:2412.16723v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.16723

Submission history

From: Bo-Cheng Qiu [view email]
[v1] Sat, 21 Dec 2024 18:18:12 UTC (1,428 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Divide and Conquer: Grounding a Bleeding Areas in Gastrointestinal Image with Two-Stage Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Divide and Conquer: Grounding a Bleeding Areas in Gastrointestinal Image with Two-Stage Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators