Noise-aware Speech Separation with Contrastive Learning

Zhang, Zizheng; Chen, Chen; Liu, Xiang; Hu, Yuchen; Chng, Eng Siong

Computer Science > Sound

arXiv:2305.10761v1 (cs)

[Submitted on 18 May 2023 (this version), latest version 8 Jan 2024 (v3)]

Title:Noise-aware Speech Separation with Contrastive Learning

Authors:Zizheng Zhang, Chen Chen, Xiang Liu, Yuchen Hu, Eng Siong Chng

View PDF

Abstract:Recently, speech separation (SS) task has achieved remarkable progress driven by deep learning technique. However, it is still challenging to separate target signals from noisy mixture, as neural model is vulnerable to assign background noise to each speaker. In this paper, we propose a noise-aware SS method called NASS, which aims to improve the speech quality of separated signals in noisy conditions. Specifically, NASS views background noise as an independent speaker and predicts it with other speakers in a mask-based manner. Then we conduct patch-wise contrastive learning on feature level to minimize the mutual information between the predicted noise-speaker and other speakers, which suppresses the noise information in separated signals. The experimental results show that NASS effectively improves the noise-robustness for different mask-based separation backbones with less than 0.1M parameter increase. Furthermore, SI-SNRi results demonstrate that NASS achieves state-of-the-art performance on WHAM! dataset.

Subjects:	Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2305.10761 [cs.SD]
	(or arXiv:2305.10761v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2305.10761

Submission history

From: Zizheng Zhang [view email]
[v1] Thu, 18 May 2023 07:06:15 UTC (7,434 KB)
[v2] Mon, 11 Sep 2023 14:23:27 UTC (5,497 KB)
[v3] Mon, 8 Jan 2024 05:24:51 UTC (5,498 KB)

Computer Science > Sound

Title:Noise-aware Speech Separation with Contrastive Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Noise-aware Speech Separation with Contrastive Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators