RGB Video Based Tennis Action Recognition Using a Deep Historical Long Short-Term Memory

Cai, Jiaxin; Tang, Xin

Computer Science > Computer Vision and Pattern Recognition

arXiv:1808.00845 (cs)

[Submitted on 2 Aug 2018 (v1), last revised 25 Sep 2018 (this version, v2)]

Title:RGB Video Based Tennis Action Recognition Using a Deep Historical Long Short-Term Memory

Authors:Jiaxin Cai, Xin Tang

View PDF

Abstract:Action recognition has attracted increasing attention from RGB input in computer vision partially due to potential applications on somatic simulation and statistics of sport such as virtual tennis game and tennis techniques and tactics analysis by video. Recently, deep learning based methods have achieved promising performance for action recognition. In this paper, we propose weighted Long Short-Term Memory adopted with convolutional neural network representations for three dimensional tennis shots recognition. First, the local two-dimensional convolutional neural network spatial representations are extracted from each video frame individually using a pre-trained Inception network. Then, a weighted Long Short-Term Memory decoder is introduced to take the output state at time t and the historical embedding feature at time t-1 to generate feature vector using a score weighting scheme. Finally, we use the adopted CNN and weighted LSTM to map the original visual features into a vector space to generate the spatial-temporal semantical description of visual sequences and classify the action video content. Experiments on the benchmark demonstrate that our method using only simple raw RGB video can achieve better performance than the state-of-the-art baselines for tennis shot recognition.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1808.00845 [cs.CV]
	(or arXiv:1808.00845v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1808.00845

Submission history

From: Jiaxin Cai [view email]
[v1] Thu, 2 Aug 2018 14:58:51 UTC (15 KB)
[v2] Tue, 25 Sep 2018 14:28:06 UTC (192 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:RGB Video Based Tennis Action Recognition Using a Deep Historical Long Short-Term Memory

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:RGB Video Based Tennis Action Recognition Using a Deep Historical Long Short-Term Memory

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators