Efficient On-device Training via Gradient Filtering

Yang, Yuedong; Li, Guihong; Marculescu, Radu

Computer Science > Computer Vision and Pattern Recognition

arXiv:2301.00330 (cs)

[Submitted on 1 Jan 2023 (v1), last revised 25 Mar 2023 (this version, v2)]

Title:Efficient On-device Training via Gradient Filtering

Authors:Yuedong Yang, Guihong Li, Radu Marculescu

View PDF

Abstract:Despite its importance for federated learning, continuous learning and many other applications, on-device training remains an open problem for EdgeAI. The problem stems from the large number of operations (e.g., floating point multiplications and additions) and memory consumption required during training by the back-propagation algorithm. Consequently, in this paper, we propose a new gradient filtering approach which enables on-device CNN model training. More precisely, our approach creates a special structure with fewer unique elements in the gradient map, thus significantly reducing the computational complexity and memory consumption of back propagation during training. Extensive experiments on image classification and semantic segmentation with multiple CNN models (e.g., MobileNet, DeepLabV3, UPerNet) and devices (e.g., Raspberry Pi and Jetson Nano) demonstrate the effectiveness and wide applicability of our approach. For example, compared to SOTA, we achieve up to 19$\times$ speedup and 77.1% memory savings on ImageNet classification with only 0.1% accuracy loss. Finally, our method is easy to implement and deploy; over 20$\times$ speedup and 90% energy savings have been observed compared to highly optimized baselines in MKLDNN and CUDNN on NVIDIA Jetson Nano. Consequently, our approach opens up a new direction of research with a huge potential for on-device training.

Comments:	CVPR2023, 19 pages, 13 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2301.00330 [cs.CV]
	(or arXiv:2301.00330v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2301.00330

Submission history

From: Yuedong Yang [view email]
[v1] Sun, 1 Jan 2023 02:33:03 UTC (1,190 KB)
[v2] Sat, 25 Mar 2023 02:12:09 UTC (1,260 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Efficient On-device Training via Gradient Filtering

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Efficient On-device Training via Gradient Filtering

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators