Complexity Scaling for Speech Denoising

Chen, Hangting; Yu, Jianwei; Weng, Chao

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2309.07757 (eess)

[Submitted on 14 Sep 2023]

Title:Complexity Scaling for Speech Denoising

Authors:Hangting Chen, Jianwei Yu, Chao Weng

View PDF

Abstract:Computational complexity is critical when deploying deep learning-based speech denoising models for on-device applications. Most prior research focused on optimizing model architectures to meet specific computational cost constraints, often creating distinct neural network architectures for different complexity limitations. This study conducts complexity scaling for speech denoising tasks, aiming to consolidate models with various complexities into a unified architecture. We present a Multi-Path Transform-based (MPT) architecture to handle both low- and high-complexity scenarios. A series of MPT networks present high performance covering a wide range of computational complexities on the DNS challenge dataset. Moreover, inspired by the scaling experiments in natural language processing, we explore the empirical relationship between model performance and computational cost on the denoising task. As the complexity number of multiply-accumulate operations (MACs) is scaled from 50M/s to 15G/s on MPT networks, we observe a linear increase in the values of PESQ-WB and SI-SNR, proportional to the logarithm of MACs, which might contribute to the understanding and application of complexity scaling in speech denoising tasks.

Comments:	Submitted to ICASSP2024
Subjects:	Audio and Speech Processing (eess.AS); Sound (cs.SD)
Cite as:	arXiv:2309.07757 [eess.AS]
	(or arXiv:2309.07757v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2309.07757

Submission history

From: Hangting Chen [view email]
[v1] Thu, 14 Sep 2023 14:45:17 UTC (263 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Complexity Scaling for Speech Denoising

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Complexity Scaling for Speech Denoising

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators