Learned Step Size Quantization

Esser, Steven K.; McKinstry, Jeffrey L.; Bablani, Deepika; Appuswamy, Rathinakumar; Modha, Dharmendra S.

Computer Science > Machine Learning

arXiv:1902.08153v1 (cs)

[Submitted on 21 Feb 2019 (this version), latest version 7 May 2020 (v3)]

Title:Learned Step Size Quantization

Authors:Steven K. Esser, Jeffrey L. McKinstry, Deepika Bablani, Rathinakumar Appuswamy, Dharmendra S. Modha

View PDF

Abstract:We present here Learned Step Size Quantization, a method for training deep networks such that they can run at inference time using low precision integer matrix multipliers, which offer power and space advantages over high precision alternatives. The essence of our approach is to learn the step size parameter of a uniform quantizer by backpropagation of the training loss, applying a scaling factor to its learning rate, and computing its associated loss gradient by ignoring the discontinuity present in the quantizer. This quantization approach can be applied to activations or weights, using different levels of precision as needed for a given system, and requiring only a simple modification of existing training code. As demonstrated on the ImageNet dataset, our approach achieves better accuracy than all previous published methods for creating quantized networks on several ResNet network architectures at 2-, 3- and 4-bits of precision.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1902.08153 [cs.LG]
	(or arXiv:1902.08153v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1902.08153

Submission history

From: Steven Esser [view email]
[v1] Thu, 21 Feb 2019 17:31:32 UTC (73 KB)
[v2] Wed, 25 Sep 2019 21:18:07 UTC (80 KB)
[v3] Thu, 7 May 2020 03:30:49 UTC (241 KB)

Computer Science > Machine Learning

Title:Learned Step Size Quantization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learned Step Size Quantization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators