MEDFuse: Multimodal EHR Data Fusion with Masked Lab-Test Modeling and Large Language Models

Phan, Thao Minh Nguyen; Dao, Cong-Tinh; Wu, Chenwei; Wang, Jian-Zhe; Liu, Shun; Ding, Jun-En; Restrepo, David; Liu, Feng; Hung, Fang-Ming; Peng, Wen-Chih

Computer Science > Computation and Language

arXiv:2407.12309 (cs)

[Submitted on 17 Jul 2024]

Title:MEDFuse: Multimodal EHR Data Fusion with Masked Lab-Test Modeling and Large Language Models

Authors:Thao Minh Nguyen Phan, Cong-Tinh Dao, Chenwei Wu, Jian-Zhe Wang, Shun Liu, Jun-En Ding, David Restrepo, Feng Liu, Fang-Ming Hung, Wen-Chih Peng

View PDF HTML (experimental)

Abstract:Electronic health records (EHRs) are multimodal by nature, consisting of structured tabular features like lab tests and unstructured clinical notes. In real-life clinical practice, doctors use complementary multimodal EHR data sources to get a clearer picture of patients' health and support clinical decision-making. However, most EHR predictive models do not reflect these procedures, as they either focus on a single modality or overlook the inter-modality interactions/redundancy. In this work, we propose MEDFuse, a Multimodal EHR Data Fusion framework that incorporates masked lab-test modeling and large language models (LLMs) to effectively integrate structured and unstructured medical data. MEDFuse leverages multimodal embeddings extracted from two sources: LLMs fine-tuned on free clinical text and masked tabular transformers trained on structured lab test results. We design a disentangled transformer module, optimized by a mutual information loss to 1) decouple modality-specific and modality-shared information and 2) extract useful joint representation from the noise and redundancy present in clinical notes. Through comprehensive validation on the public MIMIC-III dataset and the in-house FEMH dataset, MEDFuse demonstrates great potential in advancing clinical predictions, achieving over 90% F1 score in the 10-disease multi-label classification task.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2407.12309 [cs.CL]
	(or arXiv:2407.12309v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.12309

Submission history

From: Jun-En Ding [view email]
[v1] Wed, 17 Jul 2024 04:17:09 UTC (3,790 KB)

Computer Science > Computation and Language

Title:MEDFuse: Multimodal EHR Data Fusion with Masked Lab-Test Modeling and Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:MEDFuse: Multimodal EHR Data Fusion with Masked Lab-Test Modeling and Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators