Multimodal AI on Wound Images and Clinical Notes for Home Patient Referral

Fard, Reza Saadati; Agu, Emmanuel; Busaranuvong, Palawat; Kumar, Deepak; Gautam, Shefalika; Tulu, Bengisu; Strong, Diane

Computer Science > Machine Learning

arXiv:2501.13247 (cs)

[Submitted on 22 Jan 2025]

Title:Multimodal AI on Wound Images and Clinical Notes for Home Patient Referral

Authors:Reza Saadati Fard, Emmanuel Agu, Palawat Busaranuvong, Deepak Kumar, Shefalika Gautam, Bengisu Tulu, Diane Strong

View PDF HTML (experimental)

Abstract:Chronic wounds affect 8.5 million Americans, particularly the elderly and patients with diabetes. These wounds can take up to nine months to heal, making regular care essential to ensure healing and prevent severe outcomes like limb amputations. Many patients receive care at home from visiting nurses with varying levels of wound expertise, leading to inconsistent care. Problematic, non-healing wounds should be referred to wound specialists, but referral decisions in non-clinical settings are often erroneous, delayed, or unnecessary.
This paper introduces the Deep Multimodal Wound Assessment Tool (DM-WAT), a machine learning framework designed to assist visiting nurses in deciding whether to refer chronic wound patients. DM-WAT analyzes smartphone-captured wound images and clinical notes from Electronic Health Records (EHRs). It uses DeiT-Base-Distilled, a Vision Transformer (ViT), to extract visual features from images and DeBERTa-base to extract text features from clinical notes. DM-WAT combines visual and text features using an intermediate fusion approach. To address challenges posed by a small and imbalanced dataset, it integrates image and text augmentation with transfer learning to achieve high performance. In evaluations, DM-WAT achieved 77% with std 3% accuracy and a 70% with std 2% F1 score, outperforming prior approaches. Score-CAM and Captum interpretation algorithms provide insights into specific parts of image and text inputs that influence recommendations, enhancing interpretability and trust.

Comments:	arXiv admin note: text overlap with arXiv:2208.05051 by other authors
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Cite as:	arXiv:2501.13247 [cs.LG]
	(or arXiv:2501.13247v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2501.13247

Submission history

From: Reza Saadati Fard [view email]
[v1] Wed, 22 Jan 2025 21:58:04 UTC (4,445 KB)

Computer Science > Machine Learning

Title:Multimodal AI on Wound Images and Clinical Notes for Home Patient Referral

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Multimodal AI on Wound Images and Clinical Notes for Home Patient Referral

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators