Speech Robust Bench: A Robustness Benchmark For Speech Recognition

Shah, Muhammad A.; Noguero, David Solans; Heikkila, Mikko A.; Raj, Bhiksha; Kourtellis, Nicolas

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2403.07937 (eess)

[Submitted on 8 Mar 2024 (v1), last revised 25 Sep 2024 (this version, v2)]

Title:Speech Robust Bench: A Robustness Benchmark For Speech Recognition

Authors:Muhammad A. Shah, David Solans Noguero, Mikko A. Heikkila, Bhiksha Raj, Nicolas Kourtellis

View PDF HTML (experimental)

Abstract:As Automatic Speech Recognition (ASR) models become ever more pervasive, it is important to ensure that they make reliable predictions under corruptions present in the physical and digital world. We propose Speech Robust Bench (SRB), a comprehensive benchmark for evaluating the robustness of ASR models to diverse corruptions. SRB is composed of 114 input perturbations which simulate an heterogeneous range of corruptions that ASR models may encounter when deployed in the wild. We use SRB to evaluate the robustness of several state-of-the-art ASR models and observe that model size and certain modeling choices such as the use of discrete representations, or self-training appear to be conducive to robustness. We extend this analysis to measure the robustness of ASR models on data from various demographic subgroups, namely English and Spanish speakers, and males and females. Our results revealed noticeable disparities in the model's robustness across subgroups. We believe that SRB will significantly facilitate future research towards robust ASR models, by making it easier to conduct comprehensive and comparable robustness evaluations.

Comments:	submitted to NeurIPS datasets and benchmark track 2025
Subjects:	Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
Cite as:	arXiv:2403.07937 [eess.AS]
	(or arXiv:2403.07937v2 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2403.07937

Submission history

From: Muhammad Shah [view email]
[v1] Fri, 8 Mar 2024 08:10:29 UTC (5,276 KB)
[v2] Wed, 25 Sep 2024 00:28:55 UTC (7,130 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Speech Robust Bench: A Robustness Benchmark For Speech Recognition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Speech Robust Bench: A Robustness Benchmark For Speech Recognition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators