HC-LLM: Historical-Constrained Large Language Models for Radiology Report Generation

Liu, Tengfei; Wang, Jiapu; Hu, Yongli; Li, Mingjie; Yi, Junfei; Chang, Xiaojun; Gao, Junbin; Yin, Baocai

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.11070 (cs)

[Submitted on 15 Dec 2024]

Title:HC-LLM: Historical-Constrained Large Language Models for Radiology Report Generation

Authors:Tengfei Liu, Jiapu Wang, Yongli Hu, Mingjie Li, Junfei Yi, Xiaojun Chang, Junbin Gao, Baocai Yin

View PDF HTML (experimental)

Abstract:Radiology report generation (RRG) models typically focus on individual exams, often overlooking the integration of historical visual or textual data, which is crucial for patient follow-ups. Traditional methods usually struggle with long sequence dependencies when incorporating historical information, but large language models (LLMs) excel at in-context learning, making them well-suited for analyzing longitudinal medical data. In light of this, we propose a novel Historical-Constrained Large Language Models (HC-LLM) framework for RRG, empowering LLMs with longitudinal report generation capabilities by constraining the consistency and differences between longitudinal images and their corresponding reports. Specifically, our approach extracts both time-shared and time-specific features from longitudinal chest X-rays and diagnostic reports to capture disease progression. Then, we ensure consistent representation by applying intra-modality similarity constraints and aligning various features across modalities with multimodal contrastive and structural constraints. These combined constraints effectively guide the LLMs in generating diagnostic reports that accurately reflect the progression of the disease, achieving state-of-the-art results on the Longitudinal-MIMIC dataset. Notably, our approach performs well even without historical data during testing and can be easily adapted to other multimodal large models, enhancing its versatility.

Comments:	Accepted by AAAI2025
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2412.11070 [cs.CV]
	(or arXiv:2412.11070v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.11070

Submission history

From: Tengfei Liu [view email]
[v1] Sun, 15 Dec 2024 06:04:16 UTC (2,443 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:HC-LLM: Historical-Constrained Large Language Models for Radiology Report Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:HC-LLM: Historical-Constrained Large Language Models for Radiology Report Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators