Hierarchical Label Propagation: A Model-Size-Dependent Performance Booster for AudioSet Tagging

Tuncay, Ludovic; Labbé, Etienne; Pellegrini, Thomas

doi:10.1109/ICASSP49660.2025.10888798

Computer Science > Sound

arXiv:2503.21826 (cs)

[Submitted on 26 Mar 2025]

Title:Hierarchical Label Propagation: A Model-Size-Dependent Performance Booster for AudioSet Tagging

Authors:Ludovic Tuncay (IRIT-SAMoVA), Etienne Labbé (IRIT-SAMoVA), Thomas Pellegrini (IRIT-SAMoVA, UT3)

View PDF

Abstract:AudioSet is one of the most used and largest datasets in audio tagging, containing about 2 million audio samples that are manually labeled with 527 event categories organized into an ontology. However, the annotations contain inconsistencies, particularly where categories that should be labeled as positive according to the ontology are frequently mislabeled as negative. To address this issue, we apply Hierarchical Label Propagation (HLP), which propagates labels up the ontology hierarchy, resulting in a mean increase in positive labels per audio clip from 1.98 to 2.39 and affecting 109 out of the 527 classes. Our results demonstrate that HLP provides performance benefits across various model architectures, including convolutional neural networks (PANN's CNN6 and ConvNeXT) and transformers (PaSST), with smaller models showing more improvements. Finally, on FSD50K, another widely used dataset, models trained on AudioSet with HLP consistently outperformed those trained without HLP. Our source code will be made available on GitHub.

Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2503.21826 [cs.SD]
	(or arXiv:2503.21826v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2503.21826
Journal reference:	ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2025, Hyderabad, India. pp.1-5
Related DOI:	https://doi.org/10.1109/ICASSP49660.2025.10888798

Submission history

From: Ludovic Tuncay [view email] [via CCSD proxy]
[v1] Wed, 26 Mar 2025 08:45:43 UTC (105 KB)

Computer Science > Sound

Title:Hierarchical Label Propagation: A Model-Size-Dependent Performance Booster for AudioSet Tagging

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Hierarchical Label Propagation: A Model-Size-Dependent Performance Booster for AudioSet Tagging

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators