Configurable DOA Estimation using Incremental Learning

Xiao, Yang; Das, Rohan Kumar

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2407.03661 (eess)

[Submitted on 4 Jul 2024 (v1), last revised 26 Aug 2024 (this version, v2)]

Title:Configurable DOA Estimation using Incremental Learning

Authors:Yang Xiao, Rohan Kumar Das

View PDF HTML (experimental)

Abstract:This study introduces a progressive neural network (PNN) model for direction of arrival (DOA) estimation, DOA-PNN, addressing the challenge due to catastrophic forgetting in adapting dynamic acoustic environments. While traditional methods such as GCC, MUSIC, and SRP-PHAT are effective in static settings, they perform worse in noisy, reverberant conditions. Deep learning models, particularly CNNs, offer improvements but struggle with a mismatch configuration between the training and inference phases. The proposed DOA-PNN overcomes these limitations by incorporating task incremental learning of continual learning, allowing for adaptation across varying acoustic scenarios with less forgetting of previously learned knowledge. Featuring task-specific sub-networks and a scaling mechanism, DOA-PNN efficiently manages parameter growth, ensuring high performance across incremental microphone configurations. We study DOA-PNN on a simulated data under various mic distance based microphone settings. The studies reveal its capability to maintain performance with minimal parameter increase, presenting an efficient solution for DOA estimation.

Comments:	Submitted to ICASSP 2025
Subjects:	Audio and Speech Processing (eess.AS); Sound (cs.SD)
Cite as:	arXiv:2407.03661 [eess.AS]
	(or arXiv:2407.03661v2 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2407.03661

Submission history

From: Yang Xiao [view email]
[v1] Thu, 4 Jul 2024 06:02:52 UTC (748 KB)
[v2] Mon, 26 Aug 2024 16:38:30 UTC (592 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Configurable DOA Estimation using Incremental Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Configurable DOA Estimation using Incremental Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators