Action Machine: Rethinking Action Recognition in Trimmed Videos

Zhu, Jiagang; Zou, Wei; Xu, Liang; Hu, Yiming; Zhu, Zheng; Chang, Manyu; Huang, Junjie; Huang, Guan; Du, Dalong

Computer Science > Computer Vision and Pattern Recognition

arXiv:1812.05770 (cs)

[Submitted on 14 Dec 2018 (v1), last revised 17 Dec 2018 (this version, v2)]

Title:Action Machine: Rethinking Action Recognition in Trimmed Videos

Authors:Jiagang Zhu, Wei Zou, Liang Xu, Yiming Hu, Zheng Zhu, Manyu Chang, Junjie Huang, Guan Huang, Dalong Du

View PDF

Abstract:Existing methods in video action recognition mostly do not distinguish human body from the environment and easily overfit the scenes and objects. In this work, we present a conceptually simple, general and high-performance framework for action recognition in trimmed videos, aiming at person-centric modeling. The method, called Action Machine, takes as inputs the videos cropped by person bounding boxes. It extends the Inflated 3D ConvNet (I3D) by adding a branch for human pose estimation and a 2D CNN for pose-based action recognition, being fast to train and test. Action Machine can benefit from the multi-task training of action recognition and pose estimation, the fusion of predictions from RGB images and poses. On NTU RGB-D, Action Machine achieves the state-of-the-art performance with top-1 accuracies of 97.2% and 94.3% on cross-view and cross-subject respectively. Action Machine also achieves competitive performance on another three smaller action recognition datasets: Northwestern UCLA Multiview Action3D, MSR Daily Activity3D and UTD-MHAD. Code will be made available.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1812.05770 [cs.CV]
	(or arXiv:1812.05770v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1812.05770

Submission history

From: Jiagang Zhu [view email]
[v1] Fri, 14 Dec 2018 03:43:54 UTC (3,401 KB)
[v2] Mon, 17 Dec 2018 08:12:06 UTC (3,401 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2018-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jiagang Zhu
Wei Zou
Liang Xu
Yiming Hu
Zheng Zhu

…

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Action Machine: Rethinking Action Recognition in Trimmed Videos

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Action Machine: Rethinking Action Recognition in Trimmed Videos

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators