Tracking People with 3D Representations

Rajasegaran, Jathushan; Pavlakos, Georgios; Kanazawa, Angjoo; Malik, Jitendra

Computer Science > Computer Vision and Pattern Recognition

arXiv:2111.07868 (cs)

[Submitted on 15 Nov 2021]

Title:Tracking People with 3D Representations

Authors:Jathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, Jitendra Malik

View PDF

Abstract:We present a novel approach for tracking multiple people in video. Unlike past approaches which employ 2D representations, we focus on using 3D representations of people, located in three-dimensional space. To this end, we develop a method, Human Mesh and Appearance Recovery (HMAR) which in addition to extracting the 3D geometry of the person as a SMPL mesh, also extracts appearance as a texture map on the triangles of the mesh. This serves as a 3D representation for appearance that is robust to viewpoint and pose changes. Given a video clip, we first detect bounding boxes corresponding to people, and for each one, we extract 3D appearance, pose, and location information using HMAR. These embedding vectors are then sent to a transformer, which performs spatio-temporal aggregation of the representations over the duration of the sequence. The similarity of the resulting representations is used to solve for associations that assigns each person to a tracklet. We evaluate our approach on the Posetrack, MuPoTs and AVA datasets. We find that 3D representations are more effective than 2D representations for tracking in these settings, and we obtain state-of-the-art performance. Code and results are available at: this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2111.07868 [cs.CV]
	(or arXiv:2111.07868v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2111.07868

Submission history

From: Jathushan Rajasegaran [view email]
[v1] Mon, 15 Nov 2021 16:15:21 UTC (73,199 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Tracking People with 3D Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Tracking People with 3D Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators