CUTIE: Beyond PetaOp/s/W Ternary DNN Inference Acceleration with Better-than-Binary Energy Efficiency

Scherer, Moritz; Rutishauser, Georg; Cavigelli, Lukas; Benini, Luca

Computer Science > Hardware Architecture

arXiv:2011.01713 (cs)

[Submitted on 3 Nov 2020 (v1), last revised 4 Feb 2021 (this version, v2)]

Title:CUTIE: Beyond PetaOp/s/W Ternary DNN Inference Acceleration with Better-than-Binary Energy Efficiency

Authors:Moritz Scherer, Georg Rutishauser, Lukas Cavigelli, Luca Benini

View PDF

Abstract:We present a 3.1 POp/s/W fully digital hardware accelerator for ternary neural networks. CUTIE, the Completely Unrolled Ternary Inference Engine, focuses on minimizing non-computational energy and switching activity so that dynamic power spent on storing (locally or globally) intermediate results is minimized. This is achieved by 1) a data path architecture completely unrolled in the feature map and filter dimensions to reduce switching activity by favoring silencing over iterative computation and maximizing data re-use, 2) targeting ternary neural networks which, in contrast to binary NNs, allow for sparse weights which reduce switching activity, and 3) introducing an optimized training method for higher sparsity of the filter weights, resulting in a further reduction of the switching activity. Compared with state-of-the-art accelerators, CUTIE achieves greater or equal accuracy while decreasing the overall core inference energy cost by a factor of 4.8x-21x.

Subjects:	Hardware Architecture (cs.AR)
Cite as:	arXiv:2011.01713 [cs.AR]
	(or arXiv:2011.01713v2 [cs.AR] for this version)
	https://doi.org/10.48550/arXiv.2011.01713

Submission history

From: Moritz Scherer [view email]
[v1] Tue, 3 Nov 2020 14:00:55 UTC (5,860 KB)
[v2] Thu, 4 Feb 2021 09:30:51 UTC (8,672 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AR

< prev | next >

new | recent | 2020-11

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Lukas Cavigelli
Luca Benini

export BibTeX citation

Computer Science > Hardware Architecture

Title:CUTIE: Beyond PetaOp/s/W Ternary DNN Inference Acceleration with Better-than-Binary Energy Efficiency

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Hardware Architecture

Title:CUTIE: Beyond PetaOp/s/W Ternary DNN Inference Acceleration with Better-than-Binary Energy Efficiency

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators