Deep Learning Techniques for Visual Counting

Ciampi, Luca

Computer Science > Computer Vision and Pattern Recognition

arXiv:2206.03033 (cs)

[Submitted on 7 Jun 2022 (v1), last revised 8 Jun 2022 (this version, v2)]

Title:Deep Learning Techniques for Visual Counting

Authors:Luca Ciampi

View PDF

Abstract:In this dissertation, we investigated and enhanced Deep Learning (DL) techniques for counting objects, like pedestrians, cells or vehicles, in still images or video frames. In particular, we tackled the challenge related to the lack of data needed for training current DL-based solutions. Given that the budget for labeling is limited, data scarcity still represents an open problem that prevents the scalability of existing solutions based on the supervised learning of neural networks and that is responsible for a significant drop in performance at inference time when new scenarios are presented to these algorithms. We introduced solutions addressing this issue from several complementary sides, collecting datasets gathered from virtual environments automatically labeled, proposing Domain Adaptation strategies aiming at mitigating the domain gap existing between the training and test data distributions, and presenting a counting strategy in a weakly labeled data scenario, i.e., in the presence of non-negligible disagreement between multiple annotators. Moreover, we tackled the non-trivial engineering challenges coming out of the adoption of Convolutional Neural Network-based techniques in environments with limited power resources, introducing solutions for counting vehicles and pedestrians directly onboard embedded vision systems, i.e., devices equipped with constrained computational capabilities that can capture images and elaborate them.

Comments:	Version with high-quality images can be found at this https URL. arXiv admin note: text overlap with arXiv:1802.03601, arXiv:1707.01202, arXiv:1809.02165, arXiv:1901.06026, arXiv:1808.01244 by other authors
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2206.03033 [cs.CV]
	(or arXiv:2206.03033v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2206.03033

Submission history

From: Luca Ciampi [view email]
[v1] Tue, 7 Jun 2022 06:20:40 UTC (48,314 KB)
[v2] Wed, 8 Jun 2022 16:29:22 UTC (48,314 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Learning Techniques for Visual Counting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Learning Techniques for Visual Counting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators