Weakly-supervised Action Localization with Background Modeling

Nguyen, Phuc Xuan; Ramanan, Deva; Fowlkes, Charless C.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1908.06552 (cs)

[Submitted on 19 Aug 2019]

Title:Weakly-supervised Action Localization with Background Modeling

Authors:Phuc Xuan Nguyen, Deva Ramanan, Charless C. Fowlkes

View PDF

Abstract:We describe a latent approach that learns to detect actions in long sequences given training videos with only whole-video class labels. Our approach makes use of two innovations to attention-modeling in weakly-supervised learning. First, and most notably, our framework uses an attention model to extract both foreground and background frames whose appearance is explicitly modeled. Most prior works ignore the background, but we show that modeling it allows our system to learn a richer notion of actions and their temporal extents. Second, we combine bottom-up, class-agnostic attention modules with top-down, class-specific activation maps, using the latter as form of self-supervision for the former. Doing so allows our model to learn a more accurate model of attention without explicit temporal supervision. These modifications lead to 10% AP@IoU=0.5 improvement over existing systems on THUMOS14. Our proposed weaklysupervised system outperforms recent state-of-the-arts by at least 4.3% AP@IoU=0.5. Finally, we demonstrate that weakly-supervised learning can be used to aggressively scale-up learning to in-the-wild, uncurated Instagram videos. The addition of these videos significantly improves localization performance of our weakly-supervised model

Comments:	To appear at ICCV 2019
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1908.06552 [cs.CV]
	(or arXiv:1908.06552v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1908.06552

Submission history

From: Phuc Nguyen X [view email]
[v1] Mon, 19 Aug 2019 01:33:14 UTC (2,365 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Weakly-supervised Action Localization with Background Modeling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Weakly-supervised Action Localization with Background Modeling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators