Few-shot learning with attention-based sequence-to-sequence models

Higy, Bertrand; Bell, Peter

Computer Science > Computation and Language

arXiv:1811.03519 (cs)

[Submitted on 8 Nov 2018 (v1), last revised 22 Mar 2019 (this version, v2)]

Title:Few-shot learning with attention-based sequence-to-sequence models

Authors:Bertrand Higy, Peter Bell

View PDF

Abstract:End-to-end approaches have recently become popular as a means of simplifying the training and deployment of speech recognition systems. However, they often require large amounts of data to perform well on large vocabulary tasks. With the aim of making end-to-end approaches usable by a broader range of researchers, we explore the potential to use end-to-end methods in small vocabulary contexts where smaller datasets may be used. A significant drawback of small-vocabulary systems is the difficulty of expanding the vocabulary beyond the original training samples -- therefore we also study strategies to extend the vocabulary with only few examples per new class (few-shot learning).
Our results show that an attention-based encoder-decoder can be competitive against a strong baseline on a small vocabulary keyword classification task, reaching 97.5% of accuracy on Tensorflow's Speech Commands dataset. It also shows promising results on the few-shot learning problem where a simple strategy achieved 68.8\% of accuracy on new keywords with only 10 examples for each new class. This score goes up to 88.4\% with a larger set of 100 examples.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1811.03519 [cs.CL]
	(or arXiv:1811.03519v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1811.03519

Submission history

From: Bertrand Higy [view email]
[v1] Thu, 8 Nov 2018 16:05:50 UTC (61 KB)
[v2] Fri, 22 Mar 2019 11:15:52 UTC (80 KB)

Computer Science > Computation and Language

Title:Few-shot learning with attention-based sequence-to-sequence models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Few-shot learning with attention-based sequence-to-sequence models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators