Complex-valued Spatial Autoencoders for Multichannel Speech Enhancement

Halimeh, Mhd Modar; Kellermann, Walter

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2108.03130 (eess)

[Submitted on 6 Aug 2021]

Title:Complex-valued Spatial Autoencoders for Multichannel Speech Enhancement

Authors:Mhd Modar Halimeh, Walter Kellermann

View PDF

Abstract:In this contribution, we present a novel online approach to multichannel speech enhancement. The proposed method estimates the enhanced signal through a filter-and-sum framework. More specifically, complex-valued masks are estimated by a deep complex-valued neural network, termed the complex-valued spatial autoencoder. The proposed network is capable of exploiting as well as manipulating both the phase and the amplitude of the microphone signals. As shown by the experimental results, the proposed approach is able to exploit both spatial and spectral characteristics of the desired source signal resulting in a physically plausible spatial selectivity and superior speech quality compared to other baseline methods.

Subjects:	Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2108.03130 [eess.AS]
	(or arXiv:2108.03130v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2108.03130

Submission history

From: Mhd Modar Halimeh [view email]
[v1] Fri, 6 Aug 2021 14:03:20 UTC (944 KB)

Full-text links:

Access Paper:

view license

Current browse context:

eess.AS

< prev | next >

new | recent | 2021-08

Change to browse by:

eess

References & Citations

export BibTeX citation

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Complex-valued Spatial Autoencoders for Multichannel Speech Enhancement

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Complex-valued Spatial Autoencoders for Multichannel Speech Enhancement

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators