Spatial Loss for Unsupervised Multi-channel Source Separation

Saijo, Kohei; Scheibler, Robin

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2204.00210 (eess)

[Submitted on 1 Apr 2022]

Title:Spatial Loss for Unsupervised Multi-channel Source Separation

Authors:Kohei Saijo, Robin Scheibler

View PDF

Abstract:We propose a spatial loss for unsupervised multi-channel source separation. The proposed loss exploits the duality of direction of arrival (DOA) and beamforming: the steering and beamforming vectors should be aligned for the target source, but orthogonal for interfering ones. The spatial loss encourages consistency between the mixing and demixing systems from a classic DOA estimator and a neural separator, respectively. With the proposed loss, we train the neural separators based on minimum variance distortionless response (MVDR) beamforming and independent vector analysis (IVA). We also investigate the effectiveness of combining our spatial loss and a signal loss, which uses the outputs of blind source separation as the reference. We evaluate our proposed method on synthetic and recorded (LibriCSS) mixtures. We find that the spatial loss is most effective to train IVA-based separators. For the neural MVDR beamformer, it performs best when combined with a signal loss. On synthetic mixtures, the proposed unsupervised loss leads to the same performance as a supervised loss in terms of word error rate. On LibriCSS, we obtain close to state-of-the-art performance without any labeled training data.

Comments:	Submitted to INTERSPEECH2022
Subjects:	Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2204.00210 [eess.AS]
	(or arXiv:2204.00210v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2204.00210

Submission history

From: Kohei Saijo [view email]
[v1] Fri, 1 Apr 2022 05:13:17 UTC (79 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Spatial Loss for Unsupervised Multi-channel Source Separation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Spatial Loss for Unsupervised Multi-channel Source Separation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators