About Time: Advances, Challenges, and Outlooks of Action Understanding

Stergiou, Alexandros; Poppe, Ronald

Computer Science > Computer Vision and Pattern Recognition

arXiv:2411.15106 (cs)

[Submitted on 22 Nov 2024]

Title:About Time: Advances, Challenges, and Outlooks of Action Understanding

Authors:Alexandros Stergiou, Ronald Poppe

View PDF HTML (experimental)

Abstract:We have witnessed impressive advances in video action understanding. Increased dataset sizes, variability, and computation availability have enabled leaps in performance and task diversification. Current systems can provide coarse- and fine-grained descriptions of video scenes, extract segments corresponding to queries, synthesize unobserved parts of videos, and predict context. This survey comprehensively reviews advances in uni- and multi-modal action understanding across a range of tasks. We focus on prevalent challenges, overview widely adopted datasets, and survey seminal works with an emphasis on recent advances. We broadly distinguish between three temporal scopes: (1) recognition tasks of actions observed in full, (2) prediction tasks for ongoing partially observed actions, and (3) forecasting tasks for subsequent unobserved action. This division allows us to identify specific action modeling and video representation challenges. Finally, we outline future directions to address current shortcomings.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2411.15106 [cs.CV]
	(or arXiv:2411.15106v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2411.15106

Submission history

From: Alexandros Stergiou [view email]
[v1] Fri, 22 Nov 2024 18:09:27 UTC (17,828 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:About Time: Advances, Challenges, and Outlooks of Action Understanding

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:About Time: Advances, Challenges, and Outlooks of Action Understanding

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators