Improved Robust ASR for Social Robots in Public Spaces

Jankowski, Charles; Mruthyunjaya, Vishwas; Lin, Ruixi

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2001.04619 (eess)

[Submitted on 14 Jan 2020]

Title:Improved Robust ASR for Social Robots in Public Spaces

Authors:Charles Jankowski, Vishwas Mruthyunjaya, Ruixi Lin

View PDF

Abstract:Social robots deployed in public spaces present a challenging task for ASR because of a variety of factors, including noise SNR of 20 to 5 dB. Existing ASR models perform well for higher SNRs in this range, but degrade considerably with more noise. This work explores methods for providing improved ASR performance in such conditions. We use the AiShell-1 Chinese speech corpus and the Kaldi ASR toolkit for evaluations. We were able to exceed state-of-the-art ASR performance with SNR lower than 20 dB, demonstrating the feasibility of achieving relatively high performing ASR with open-source toolkits and hundreds of hours of training data, which is commonly available.

Subjects:	Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
Cite as:	arXiv:2001.04619 [eess.AS]
	(or arXiv:2001.04619v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2001.04619

Submission history

From: Ruixi Lin [view email]
[v1] Tue, 14 Jan 2020 04:21:18 UTC (572 KB)

Full-text links:

Access Paper:

view license

Current browse context:

eess.AS

< prev | next >

new | recent | 2020-01

Change to browse by:

cs
cs.CL
cs.SD
eess

References & Citations

export BibTeX citation

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Improved Robust ASR for Social Robots in Public Spaces

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Improved Robust ASR for Social Robots in Public Spaces

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators