MedRAT: Unpaired Medical Report Generation via Auxiliary Tasks

Hirsch, Elad; Dawidowicz, Gefen; Tal, Ayellet

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.03919 (cs)

[Submitted on 4 Jul 2024 (v1), last revised 22 Jul 2024 (this version, v2)]

Title:MedRAT: Unpaired Medical Report Generation via Auxiliary Tasks

Authors:Elad Hirsch, Gefen Dawidowicz, Ayellet Tal

View PDF HTML (experimental)

Abstract:Medical report generation from X-ray images is a challenging task, particularly in an unpaired setting where paired image-report data is unavailable for training. To address this challenge, we propose a novel model that leverages the available information in two distinct datasets, one comprising reports and the other consisting of images. The core idea of our model revolves around the notion that combining auto-encoding report generation with multi-modal (report-image) alignment can offer a solution. However, the challenge persists regarding how to achieve this alignment when pair correspondence is absent. Our proposed solution involves the use of auxiliary tasks, particularly contrastive learning and classification, to position related images and reports in close proximity to each other. This approach differs from previous methods that rely on pre-processing steps, such as using external information stored in a knowledge graph. Our model, named MedRAT, surpasses previous state-of-the-art methods, demonstrating the feasibility of generating comprehensive medical reports without the need for paired data or external tools.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.03919 [cs.CV]
	(or arXiv:2407.03919v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.03919

Submission history

From: Elad Hirsch [view email]
[v1] Thu, 4 Jul 2024 13:31:47 UTC (1,437 KB)
[v2] Mon, 22 Jul 2024 07:49:34 UTC (1,091 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MedRAT: Unpaired Medical Report Generation via Auxiliary Tasks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MedRAT: Unpaired Medical Report Generation via Auxiliary Tasks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators