Enhancing Free-hand 3D Photoacoustic and Ultrasound Reconstruction using Deep Learning

Lee, SiYeoul; Kim, SeonHo; Seo, Minkyung; Park, SeongKyu; Imrus, Salehin; Ashok, Kambaluru; Lee, DongEon; Park, Chunsu; Lee, SeonYeong; Kim, Jiye; Yoo, Jae-Heung; Kim, MinWoo

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2502.03505 (eess)

[Submitted on 5 Feb 2025]

Title:Enhancing Free-hand 3D Photoacoustic and Ultrasound Reconstruction using Deep Learning

Authors:SiYeoul Lee, SeonHo Kim, Minkyung Seo, SeongKyu Park, Salehin Imrus, Kambaluru Ashok, DongEon Lee, Chunsu Park, SeonYeong Lee, Jiye Kim, Jae-Heung Yoo, MinWoo Kim

View PDF HTML (experimental)

Abstract:This study introduces a motion-based learning network with a global-local self-attention module (MoGLo-Net) to enhance 3D reconstruction in handheld photoacoustic and ultrasound (PAUS) imaging. Standard PAUS imaging is often limited by a narrow field of view and the inability to effectively visualize complex 3D structures. The 3D freehand technique, which aligns sequential 2D images for 3D reconstruction, faces significant challenges in accurate motion estimation without relying on external positional sensors. MoGLo-Net addresses these limitations through an innovative adaptation of the self-attention mechanism, which effectively exploits the critical regions, such as fully-developed speckle area or high-echogenic tissue area within successive ultrasound images to accurately estimate motion parameters. This facilitates the extraction of intricate features from individual frames. Additionally, we designed a patch-wise correlation operation to generate a correlation volume that is highly correlated with the scanning motion. A custom loss function was also developed to ensure robust learning with minimized bias, leveraging the characteristics of the motion parameters. Experimental evaluations demonstrated that MoGLo-Net surpasses current state-of-the-art methods in both quantitative and qualitative performance metrics. Furthermore, we expanded the application of 3D reconstruction technology beyond simple B-mode ultrasound volumes to incorporate Doppler ultrasound and photoacoustic imaging, enabling 3D visualization of vasculature. The source code for this study is publicly available at: this https URL

Subjects:	Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2502.03505 [eess.IV]
	(or arXiv:2502.03505v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2502.03505

Submission history

From: SiYeoul Lee [view email]
[v1] Wed, 5 Feb 2025 11:59:23 UTC (24,883 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Enhancing Free-hand 3D Photoacoustic and Ultrasound Reconstruction using Deep Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Enhancing Free-hand 3D Photoacoustic and Ultrasound Reconstruction using Deep Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators