This paper proposes a temporal tracking algorithm based on Random Forest that uses depth images to estimate and track the 3D pose of a rigid object in real-time. Compared to the state of the art aimed at the same goal, our algorithm holds important attributes such as high robustness against holes and occlusion, low computational cost of both learning and tracking stages, and low memory consumption. These are obtained (a) by a novel formulation of the learning strategy, based on a dense sampling of the camera viewpoints and learning independent trees from a single image for each camera view, as well as, (b) by an insightful occlusion handling strategy that enforces the forest to recognize the object's local and global structures. Due to these attributes, we report state-of-the-art tracking accuracy on benchmark datasets, and accomplish remarkable scalability with the number of targets, being able to simultaneously track the pose of over a hundred objects at 30~fps with an off-the-shelf CPU. In addition, the fast learning time enables us to extend our algorithm as a robust online tracker for model-free 3D objects under different viewpoints and appearance changes as demonstrated by the experiments.

A versatile learning-based 3d temporal tracker: Scalable, robust, online / Tan, David Joseph; Tombari, Federico; Ilic, Slobodan; Navab, Nassir. - ELETTRONICO. - 11-18-:(2015), pp. 7410443.693-7410443.701. (Intervento presentato al convegno 15th IEEE International Conference on Computer Vision, ICCV 2015 tenutosi a Santiago, Chile nel 2015) [10.1109/ICCV.2015.86].

A versatile learning-based 3d temporal tracker: Scalable, robust, online

TOMBARI, FEDERICO;
2015

Abstract

This paper proposes a temporal tracking algorithm based on Random Forest that uses depth images to estimate and track the 3D pose of a rigid object in real-time. Compared to the state of the art aimed at the same goal, our algorithm holds important attributes such as high robustness against holes and occlusion, low computational cost of both learning and tracking stages, and low memory consumption. These are obtained (a) by a novel formulation of the learning strategy, based on a dense sampling of the camera viewpoints and learning independent trees from a single image for each camera view, as well as, (b) by an insightful occlusion handling strategy that enforces the forest to recognize the object's local and global structures. Due to these attributes, we report state-of-the-art tracking accuracy on benchmark datasets, and accomplish remarkable scalability with the number of targets, being able to simultaneously track the pose of over a hundred objects at 30~fps with an off-the-shelf CPU. In addition, the fast learning time enables us to extend our algorithm as a robust online tracker for model-free 3D objects under different viewpoints and appearance changes as demonstrated by the experiments.
2015
Proceedings of the IEEE International Conference on Computer Vision
693
701
A versatile learning-based 3d temporal tracker: Scalable, robust, online / Tan, David Joseph; Tombari, Federico; Ilic, Slobodan; Navab, Nassir. - ELETTRONICO. - 11-18-:(2015), pp. 7410443.693-7410443.701. (Intervento presentato al convegno 15th IEEE International Conference on Computer Vision, ICCV 2015 tenutosi a Santiago, Chile nel 2015) [10.1109/ICCV.2015.86].
Tan, David Joseph; Tombari, Federico; Ilic, Slobodan; Navab, Nassir
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/553972
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 39
  • ???jsp.display-item.citation.isi??? 26
social impact