GenerateCT: Text-Guided 3D Chest CT Generation

Hamamci, Ibrahim Ethem; Er, Sezgin; Simsar, Enis; Tezcan, Alperen; Simsek, Ayse Gulnihan; Almas, Furkan; Esirgun, Sevval Nil; Reynaud, Hadrien; Pati, Sarthak; Bluethgen, Christian; Menze, Bjoern

Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.16037v2 (cs)

[Submitted on 25 May 2023 (v1), revised 26 May 2023 (this version, v2), latest version 12 Jul 2024 (v5)]

Title:GenerateCT: Text-Guided 3D Chest CT Generation

Authors:Ibrahim Ethem Hamamci, Sezgin Er, Enis Simsar, Alperen Tezcan, Ayse Gulnihan Simsek, Furkan Almas, Sevval Nil Esirgun, Hadrien Reynaud, Sarthak Pati, Christian Bluethgen, Bjoern Menze

View PDF

Abstract:Generative modeling has experienced substantial progress in recent years, particularly in text-to-image and text-to-video synthesis. However, the medical field has not yet fully exploited the potential of large-scale foundational models for synthetic data generation. In this paper, we introduce GenerateCT, the first method for text-conditional computed tomography (CT) generation, addressing the limitations in 3D medical imaging research and making our entire framework open-source. GenerateCT consists of a pre-trained large language model, a transformer-based text-conditional 3D chest CT generation architecture, and a text-conditional spatial super-resolution diffusion model. We also propose CT-ViT, which efficiently compresses CT volumes while preserving auto-regressiveness in-depth, enabling the generation of 3D CT volumes with variable numbers of axial slices. Our experiments demonstrate that GenerateCT can produce realistic, high-resolution, and high-fidelity 3D chest CT volumes consistent with medical language text prompts. We further investigate the potential of GenerateCT by training a model using generated CT volumes for multi-abnormality classification of chest CT volumes. Our contributions provide a valuable foundation for future research in text-conditional 3D medical image generation and have the potential to accelerate advancements in medical imaging research. Our code, pre-trained models, and generated data are available at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2305.16037 [cs.CV]
	(or arXiv:2305.16037v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.16037

Submission history

From: Ibrahim Hamamci Mr. [view email]
[v1] Thu, 25 May 2023 13:16:39 UTC (2,776 KB)
[v2] Fri, 26 May 2023 08:47:50 UTC (2,775 KB)
[v3] Sun, 26 Nov 2023 20:59:44 UTC (7,194 KB)
[v4] Mon, 11 Mar 2024 14:37:26 UTC (9,079 KB)
[v5] Fri, 12 Jul 2024 11:28:05 UTC (9,078 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:GenerateCT: Text-Guided 3D Chest CT Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:GenerateCT: Text-Guided 3D Chest CT Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators