SMART-Vision: Survey of Modern Action Recognition Techniques in Vision

AlShami, Ali K.; Rabinowitz, Ryan; Lam, Khang; Shleibik, Yousra; Mersha, Melkamu; Boult, Terrance; Kalita, Jugal

doi:10.1007/s11042-024-20484-5

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.13066 (cs)

[Submitted on 22 Jan 2025]

Title:SMART-Vision: Survey of Modern Action Recognition Techniques in Vision

Authors:Ali K. AlShami, Ryan Rabinowitz, Khang Lam, Yousra Shleibik, Melkamu Mersha, Terrance Boult, Jugal Kalita

View PDF HTML (experimental)

Abstract:Human Action Recognition (HAR) is a challenging domain in computer vision, involving recognizing complex patterns by analyzing the spatiotemporal dynamics of individuals' movements in videos. These patterns arise in sequential data, such as video frames, which are often essential to accurately distinguish actions that would be ambiguous in a single image. HAR has garnered considerable interest due to its broad applicability, ranging from robotics and surveillance systems to sports motion analysis, healthcare, and the burgeoning field of autonomous vehicles. While several taxonomies have been proposed to categorize HAR approaches in surveys, they often overlook hybrid methodologies and fail to demonstrate how different models incorporate various architectures and modalities. In this comprehensive survey, we present the novel SMART-Vision taxonomy, which illustrates how innovations in deep learning for HAR complement one another, leading to hybrid approaches beyond traditional categories. Our survey provides a clear roadmap from foundational HAR works to current state-of-the-art systems, highlighting emerging research directions and addressing unresolved challenges in discussion sections for architectures within the HAR domain. We provide details of the research datasets that various approaches used to measure and compare goodness HAR approaches. We also explore the rapidly emerging field of Open-HAR systems, which challenges HAR systems by presenting samples from unknown, novel classes during test time.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2501.13066 [cs.CV]
	(or arXiv:2501.13066v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.13066
Journal reference:	Multimedia Tools and Applications, Springer, 2024, pp. 1-72
Related DOI:	https://doi.org/10.1007/s11042-024-20484-5

Submission history

From: Ali K. AlShami [view email]
[v1] Wed, 22 Jan 2025 18:21:55 UTC (8,623 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SMART-Vision: Survey of Modern Action Recognition Techniques in Vision

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SMART-Vision: Survey of Modern Action Recognition Techniques in Vision

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators