OccludeNet: A Causal Journey into Mixed-View Actor-Centric Video Action Recognition under Occlusions

Zhou, Guanyu; Liu, Wenxuan; Huang, Wenxin; Jia, Xuemei; Zhong, Xian; Lin, Chia-Wen

Computer Science > Computer Vision and Pattern Recognition

arXiv:2411.15729 (cs)

[Submitted on 24 Nov 2024]

Title:OccludeNet: A Causal Journey into Mixed-View Actor-Centric Video Action Recognition under Occlusions

Authors:Guanyu Zhou, Wenxuan Liu, Wenxin Huang, Xuemei Jia, Xian Zhong, Chia-Wen Lin

View PDF HTML (experimental)

Abstract:The lack of occlusion data in commonly used action recognition video datasets limits model robustness and impedes sustained performance improvements. We construct OccludeNet, a large-scale occluded video dataset that includes both real-world and synthetic occlusion scene videos under various natural environments. OccludeNet features dynamic tracking occlusion, static scene occlusion, and multi-view interactive occlusion, addressing existing gaps in data. Our analysis reveals that occlusion impacts action classes differently, with actions involving low scene relevance and partial body visibility experiencing greater accuracy degradation. To overcome the limitations of current occlusion-focused approaches, we propose a structural causal model for occluded scenes and introduce the Causal Action Recognition (CAR) framework, which employs backdoor adjustment and counterfactual reasoning. This framework enhances key actor information, improving model robustness to occlusion. We anticipate that the challenges posed by OccludeNet will stimulate further exploration of causal relations in occlusion scenarios and encourage a reevaluation of class correlations, ultimately promoting sustainable performance improvements. The code and full dataset will be released soon.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2411.15729 [cs.CV]
	(or arXiv:2411.15729v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2411.15729

Submission history

From: Guanyu Zhou [view email]
[v1] Sun, 24 Nov 2024 06:10:05 UTC (19,831 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:OccludeNet: A Causal Journey into Mixed-View Actor-Centric Video Action Recognition under Occlusions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:OccludeNet: A Causal Journey into Mixed-View Actor-Centric Video Action Recognition under Occlusions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators