Taste More, Taste Better: Diverse Data and Strong Model Boost Semi-Supervised Crowd Counting

Yang, Maochen; Li, Zekun; Zhang, Jian; Qi, Lei; Shi, Yinghuan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2503.17984 (cs)

[Submitted on 23 Mar 2025]

Title:Taste More, Taste Better: Diverse Data and Strong Model Boost Semi-Supervised Crowd Counting

Authors:Maochen Yang, Zekun Li, Jian Zhang, Lei Qi, Yinghuan Shi

View PDF HTML (experimental)

Abstract:Semi-supervised crowd counting is crucial for addressing the high annotation costs of densely populated scenes. Although several methods based on pseudo-labeling have been proposed, it remains challenging to effectively and accurately utilize unlabeled data. In this paper, we propose a novel framework called Taste More Taste Better (TMTB), which emphasizes both data and model aspects. Firstly, we explore a data augmentation technique well-suited for the crowd counting task. By inpainting the background regions, this technique can effectively enhance data diversity while preserving the fidelity of the entire scenes. Secondly, we introduce the Visual State Space Model as backbone to capture the global context information from crowd scenes, which is crucial for extremely crowded, low-light, and adverse weather scenarios. In addition to the traditional regression head for exact prediction, we employ an Anti-Noise classification head to provide less exact but more accurate supervision, since the regression head is sensitive to noise in manual annotations. We conduct extensive experiments on four benchmark datasets and show that our method outperforms state-of-the-art methods by a large margin. Code is publicly available on this https URL.

Comments:	Accepted by CVPR 2025
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2503.17984 [cs.CV]
	(or arXiv:2503.17984v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2503.17984

Submission history

From: Maochen Yang [view email]
[v1] Sun, 23 Mar 2025 08:38:01 UTC (8,766 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Taste More, Taste Better: Diverse Data and Strong Model Boost Semi-Supervised Crowd Counting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Taste More, Taste Better: Diverse Data and Strong Model Boost Semi-Supervised Crowd Counting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators