Trace norm regularization and faster inference for embedded speech recognition RNNs

Kliegl, Markus; Goyal, Siddharth; Zhao, Kexin; Srinet, Kavya; Shoeybi, Mohammad

Computer Science > Machine Learning

arXiv:1710.09026 (cs)

[Submitted on 25 Oct 2017 (v1), last revised 6 Feb 2018 (this version, v2)]

Title:Trace norm regularization and faster inference for embedded speech recognition RNNs

Authors:Markus Kliegl, Siddharth Goyal, Kexin Zhao, Kavya Srinet, Mohammad Shoeybi

View PDF

Abstract:We propose and evaluate new techniques for compressing and speeding up dense matrix multiplications as found in the fully connected and recurrent layers of neural networks for embedded large vocabulary continuous speech recognition (LVCSR). For compression, we introduce and study a trace norm regularization technique for training low rank factored versions of matrix multiplications. Compared to standard low rank training, we show that our method leads to good accuracy versus number of parameter trade-offs and can be used to speed up training of large models. For speedup, we enable faster inference on ARM processors through new open sourced kernels optimized for small batch sizes, resulting in 3x to 7x speed ups over the widely used gemmlowp library. Beyond LVCSR, we expect our techniques and kernels to be more generally applicable to embedded neural networks with large fully connected or recurrent layers.

Comments:	Our optimized inference kernels are available at: this https URL (Note: This paper was submitted to, but rejected from, ICLR 2018. We believe it may still be of value to others. Please see the discussion here: this https URL)
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
Cite as:	arXiv:1710.09026 [cs.LG]
	(or arXiv:1710.09026v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1710.09026

Submission history

From: Markus Kliegl [view email]
[v1] Wed, 25 Oct 2017 00:20:55 UTC (1,584 KB)
[v2] Tue, 6 Feb 2018 10:00:10 UTC (1,586 KB)

Computer Science > Machine Learning

Title:Trace norm regularization and faster inference for embedded speech recognition RNNs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Trace norm regularization and faster inference for embedded speech recognition RNNs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators