"ScatSpotter" 2024 -- A Distributed Dog Poop Detection Dataset

Crall, Jon

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.16473 (cs)

[Submitted on 21 Dec 2024]

Title:"ScatSpotter" 2024 -- A Distributed Dog Poop Detection Dataset

Authors:Jon Crall

View PDF HTML (experimental)

Abstract:We introduce a new -- currently 42 gigabyte -- ``living'' dataset of phone images of dog feces, annotated with manually drawn or AI-assisted polygon labels. There are 6k full resolution images and 4k detailed polygon annotations. The collection and annotation of images started in late 2020 and the dataset grows by roughly 1GB a month. We train VIT and MaskRCNN baseline models to explore the difficulty of the dataset. The best model achieves a pixelwise average precision of 0.858 on a 691-image validation set and 0.847 on a small independently captured 30-image contributor test set. The most recent snapshot of dataset is made publicly available through three different distribution methods: one centralized (Girder) and two decentralized (IPFS and BitTorrent). We study of the trade-offs between distribution methods and discuss the feasibility of each with respect to reliably sharing open scientific data. The code to reproduce the experiments is hosted on GitHub, and the data is published under the Creative Commons Attribution 4.0 International license. Model weights are made publicly available with the dataset. Experimental hardware, time, energy, and emissions are quantified.

Comments:	dataset paper, unreviewed
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2412.16473 [cs.CV]
	(or arXiv:2412.16473v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.16473

Submission history

From: Jonathan Crall [view email]
[v1] Sat, 21 Dec 2024 04:05:29 UTC (31,206 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:"ScatSpotter" 2024 -- A Distributed Dog Poop Detection Dataset

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:"ScatSpotter" 2024 -- A Distributed Dog Poop Detection Dataset

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators