DISCount: Counting in Large Image Collections with Detector-Based Importance Sampling

Perez, Gustavo; Maji, Subhransu; Sheldon, Daniel

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.03151 (cs)

[Submitted on 5 Jun 2023]

Title:DISCount: Counting in Large Image Collections with Detector-Based Importance Sampling

Authors:Gustavo Perez, Subhransu Maji, Daniel Sheldon

View PDF

Abstract:Many modern applications use computer vision to detect and count objects in massive image collections. However, when the detection task is very difficult or in the presence of domain shifts, the counts may be inaccurate even with significant investments in training data and model development. We propose DISCount -- a detector-based importance sampling framework for counting in large image collections that integrates an imperfect detector with human-in-the-loop screening to produce unbiased estimates of counts. We propose techniques for solving counting problems over multiple spatial or temporal regions using a small number of screened samples and estimate confidence intervals. This enables end-users to stop screening when estimates are sufficiently accurate, which is often the goal in a scientific study. On the technical side we develop variance reduction techniques based on control variates and prove the (conditional) unbiasedness of the estimators. DISCount leads to a 9-12x reduction in the labeling costs over naive screening for tasks we consider, such as counting birds in radar imagery or estimating damaged buildings in satellite imagery, and also surpasses alternative covariate-based screening approaches in efficiency.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2306.03151 [cs.CV]
	(or arXiv:2306.03151v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.03151

Submission history

From: Gustavo Perez [view email]
[v1] Mon, 5 Jun 2023 18:04:57 UTC (1,685 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DISCount: Counting in Large Image Collections with Detector-Based Importance Sampling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DISCount: Counting in Large Image Collections with Detector-Based Importance Sampling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators