Learning To Detect Keyword Parts And Whole By Smoothed Max Pooling

Park, Hyun-Jin; Violette, Patrick; Subrahmanya, Niranjan

Computer Science > Computation and Language

arXiv:2001.09246 (cs)

[Submitted on 25 Jan 2020]

Title:Learning To Detect Keyword Parts And Whole By Smoothed Max Pooling

Authors:Hyun-Jin Park, Patrick Violette, Niranjan Subrahmanya

View PDF

Abstract:We propose smoothed max pooling loss and its application to keyword spotting systems. The proposed approach jointly trains an encoder (to detect keyword parts) and a decoder (to detect whole keyword) in a semi-supervised manner. The proposed new loss function allows training a model to detect parts and whole of a keyword, without strictly depending on frame-level labeling from LVCSR (Large vocabulary continuous speech recognition), making further optimization possible. The proposed system outperforms the baseline keyword spotting model in [1] due to increased optimizability. Further, it can be more easily adapted for on-device learning applications due to reduced dependency on LVCSR.

Comments:	Accepted in International Conference on Acoustics, Speech, and Signal Processing 2020
Subjects:	Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2001.09246 [cs.CL]
	(or arXiv:2001.09246v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2001.09246

Submission history

From: Hyun-Jin Park [view email]
[v1] Sat, 25 Jan 2020 01:19:19 UTC (554 KB)

Full-text links:

Access Paper:

view license

Current browse context:

eess.AS

< prev | next >

new | recent | 2020-01

Change to browse by:

cs
cs.CL
cs.SD
eess

References & Citations

DBLP - CS Bibliography

listing | bibtex

Hyun-Jin Park

export BibTeX citation

Computer Science > Computation and Language

Title:Learning To Detect Keyword Parts And Whole By Smoothed Max Pooling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Learning To Detect Keyword Parts And Whole By Smoothed Max Pooling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators