Dynamically Sacrificing Accuracy for Reduced Computation: Cascaded Inference Based on Softmax Confidence

Berestizshevsky, Konstantin; Even, Guy

doi:10.1007/978-3-030-30484-3_26

Computer Science > Machine Learning

arXiv:1805.10982 (cs)

[Submitted on 28 May 2018 (v1), last revised 11 Nov 2020 (this version, v2)]

Title:Dynamically Sacrificing Accuracy for Reduced Computation: Cascaded Inference Based on Softmax Confidence

Authors:Konstantin Berestizshevsky, Guy Even

View PDF

Abstract:We study the tradeoff between computational effort and classification accuracy in a cascade of deep neural networks. During inference, the user sets the acceptable accuracy degradation which then automatically determines confidence thresholds for the intermediate classifiers. As soon as the confidence threshold is met, inference terminates immediately without having to compute the output of the complete network. Confidence levels are derived directly from the softmax outputs of intermediate classifiers, as we do not train special decision functions. We show that using a softmax output as a confidence measure in a cascade of deep neural networks leads to a reduction of 15%-50% in the number of MAC operations while degrading the classification accuracy by roughly 1%. Our method can be easily incorporated into pre-trained non-cascaded architectures, as we exemplify on ResNet. Our main contribution is a method that dynamically adjusts the tradeoff between accuracy and computation without retraining the model.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1805.10982 [cs.LG]
	(or arXiv:1805.10982v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1805.10982
Related DOI:	https://doi.org/10.1007/978-3-030-30484-3_26

Submission history

From: Konstantin Berestizshevsky [view email]
[v1] Mon, 28 May 2018 15:44:13 UTC (210 KB)
[v2] Wed, 11 Nov 2020 13:04:31 UTC (175 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-05

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Konstantin Berestizshevsky
Guy Even

export BibTeX citation

Computer Science > Machine Learning

Title:Dynamically Sacrificing Accuracy for Reduced Computation: Cascaded Inference Based on Softmax Confidence

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Dynamically Sacrificing Accuracy for Reduced Computation: Cascaded Inference Based on Softmax Confidence

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators