Federated Learning for Keyword Spotting

Leroy, David; Coucke, Alice; Lavril, Thibaut; Gisselbrecht, Thibault; Dureau, Joseph

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:1810.05512 (eess)

[Submitted on 9 Oct 2018 (v1), last revised 18 Feb 2019 (this version, v4)]

Title:Federated Learning for Keyword Spotting

Authors:David Leroy, Alice Coucke, Thibaut Lavril, Thibault Gisselbrecht, Joseph Dureau

View PDF

Abstract:We propose a practical approach based on federated learning to solve out-of-domain issues with continuously running embedded speech-based models such as wake word detectors. We conduct an extensive empirical study of the federated averaging algorithm for the "Hey Snips" wake word based on a crowdsourced dataset that mimics a federation of wake word users. We empirically demonstrate that using an adaptive averaging strategy inspired from Adam in place of standard weighted model averaging highly reduces the number of communication rounds required to reach our target performance. The associated upstream communication costs per user are estimated at 8 MB, which is a reasonable in the context of smart home voice assistants. Additionally, the dataset used for these experiments is being open sourced with the aim of fostering further transparent research in the application of federated learning to speech data.

Comments:	Accepted for publication to ICASSP 2019
Subjects:	Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
Cite as:	arXiv:1810.05512 [eess.AS]
	(or arXiv:1810.05512v4 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.1810.05512

Submission history

From: David Leroy [view email]
[v1] Tue, 9 Oct 2018 09:41:15 UTC (290 KB)
[v2] Wed, 31 Oct 2018 10:07:18 UTC (75 KB)
[v3] Tue, 18 Dec 2018 09:31:52 UTC (75 KB)
[v4] Mon, 18 Feb 2019 18:41:00 UTC (75 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Federated Learning for Keyword Spotting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Federated Learning for Keyword Spotting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators