Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection

Niizumi, Daisuke; Takeuchi, Daiki; Ohishi, Yasunori; Harada, Noboru; Kashino, Kunio

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2404.17107 (eess)

[Submitted on 26 Apr 2024]

Title:Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection

Authors:Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino

View PDF HTML (experimental)

Abstract:To reduce the need for skilled clinicians in heart sound interpretation, recent studies on automating cardiac auscultation have explored deep learning approaches. However, despite the demands for large data for deep learning, the size of the heart sound datasets is limited, and no pre-trained model is available. On the contrary, many pre-trained models for general audio tasks are available as general-purpose audio representations. This study explores the potential of general-purpose audio representations pre-trained on large-scale datasets for transfer learning in heart murmur detection. Experiments on the CirCor DigiScope heart sound dataset show that the recent self-supervised learning Masked Modeling Duo (M2D) outperforms previous methods with the results of a weighted accuracy of 0.832 and an unweighted average recall of 0.713. Experiments further confirm improved performance by ensembling M2D with other models. These results demonstrate the effectiveness of general-purpose audio representation in processing heart sounds and open the way for further applications. Our code is available online which runs on a 24 GB consumer GPU at this https URL

Comments:	4 pages, 1 figure, and 4 tables. Accepted by IEEE EMBC 2024
Subjects:	Audio and Speech Processing (eess.AS); Sound (cs.SD)
MSC classes:	68T07
Cite as:	arXiv:2404.17107 [eess.AS]
	(or arXiv:2404.17107v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2404.17107

Submission history

From: Daisuke Niizumi [view email]
[v1] Fri, 26 Apr 2024 01:52:50 UTC (442 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators