Swin-X2S: Reconstructing 3D Shape from 2D Biplanar X-ray with Swin Transformers

Liu, Kuan; Ying, Zongyuan; Jin, Jie; Li, Dongyan; Huang, Ping; Wu, Wenjian; Chen, Zhe; Qi, Jin; Lu, Yong; Deng, Lianfu; Chen, Bo

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.05961 (cs)

[Submitted on 10 Jan 2025]

Title:Swin-X2S: Reconstructing 3D Shape from 2D Biplanar X-ray with Swin Transformers

Authors:Kuan Liu, Zongyuan Ying, Jie Jin, Dongyan Li, Ping Huang, Wenjian Wu, Zhe Chen, Jin Qi, Yong Lu, Lianfu Deng, Bo Chen

View PDF HTML (experimental)

Abstract:The conversion from 2D X-ray to 3D shape holds significant potential for improving diagnostic efficiency and safety. However, existing reconstruction methods often rely on hand-crafted features, manual intervention, and prior knowledge, resulting in unstable shape errors and additional processing costs. In this paper, we introduce Swin-X2S, an end-to-end deep learning method for directly reconstructing 3D segmentation and labeling from 2D biplanar orthogonal X-ray images. Swin-X2S employs an encoder-decoder architecture: the encoder leverages 2D Swin Transformer for X-ray information extraction, while the decoder employs 3D convolution with cross-attention to integrate structural features from orthogonal views. A dimension-expanding module is introduced to bridge the encoder and decoder, ensuring a smooth conversion from 2D pixels to 3D voxels. We evaluate proposed method through extensive qualitative and quantitative experiments across nine publicly available datasets covering four anatomies (femur, hip, spine, and rib), with a total of 54 categories. Significant improvements over previous methods have been observed not only in the segmentation and labeling metrics but also in the clinically relevant parameters that are of primary concern in practical applications, which demonstrates the promise of Swin-X2S to provide an effective option for anatomical shape reconstruction in clinical scenarios. Code implementation is available at: \url{this https URL}.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Cite as:	arXiv:2501.05961 [cs.CV]
	(or arXiv:2501.05961v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.05961

Submission history

From: Kuan Liu [view email]
[v1] Fri, 10 Jan 2025 13:41:10 UTC (9,564 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Swin-X2S: Reconstructing 3D Shape from 2D Biplanar X-ray with Swin Transformers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Swin-X2S: Reconstructing 3D Shape from 2D Biplanar X-ray with Swin Transformers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators