PowerYOLO: Mixed Precision Model for Hardware Efficient Object Detection with Event Data

Przewlocka-Rus, Dominika; Kryjak, Tomasz; Gorgon, Marek

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.08272 (cs)

[Submitted on 11 Jul 2024]

Title:PowerYOLO: Mixed Precision Model for Hardware Efficient Object Detection with Event Data

Authors:Dominika Przewlocka-Rus, Tomasz Kryjak, Marek Gorgon

View PDF HTML (experimental)

Abstract:The performance of object detection systems in automotive solutions must be as high as possible, with minimal response time and, due to the often battery-powered operation, low energy consumption. When designing such solutions, we therefore face challenges typical for embedded vision systems: the problem of fitting algorithms of high memory and computational complexity into small low-power devices. In this paper we propose PowerYOLO - a mixed precision solution, which targets three essential elements of such application. First, we propose a system based on a Dynamic Vision Sensor (DVS), a novel sensor, that offers low power requirements and operates well in conditions with variable illumination. It is these features that may make event cameras a preferential choice over frame cameras in some applications. Second, to ensure high accuracy and low memory and computational complexity, we propose to use 4-bit width Powers-of-Two (PoT) quantisation for convolution weights of the YOLO detector, with all other parameters quantised linearly. Finally, we embrace from PoT scheme and replace multiplication with bit-shifting to increase the efficiency of hardware acceleration of such solution, with a special convolution-batch normalisation fusion scheme. The use of specific sensor with PoT quantisation and special batch normalisation fusion leads to a unique system with almost 8x reduction in memory complexity and vast computational simplifications, with relation to a standard approach. This efficient system achieves high accuracy of mAP 0.301 on the GEN1 DVS dataset, marking the new state-of-the-art for such compressed model.

Comments:	The paper has been accepted for the 27th Euromicro Conference Series on Digital System Design (DSD) 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Cite as:	arXiv:2407.08272 [cs.CV]
	(or arXiv:2407.08272v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.08272

Submission history

From: Tomasz Kryjak [view email]
[v1] Thu, 11 Jul 2024 08:17:35 UTC (1,536 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:PowerYOLO: Mixed Precision Model for Hardware Efficient Object Detection with Event Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:PowerYOLO: Mixed Precision Model for Hardware Efficient Object Detection with Event Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators