A versatile learning-based 3d temporal tracker: Scalable, robust, online

Tan, David Joseph; Tombari, Federico; Ilic, Slobodan; Navab, Nassir

doi:10.1109/ICCV.2015.86

This paper proposes a temporal tracking algorithm based on Random Forest that uses depth images to estimate and track the 3D pose of a rigid object in real-time. Compared to the state of the art aimed at the same goal, our algorithm holds important attributes such as high robustness against holes and occlusion, low computational cost of both learning and tracking stages, and low memory consumption. These are obtained (a) by a novel formulation of the learning strategy, based on a dense sampling of the camera viewpoints and learning independent trees from a single image for each camera view, as well as, (b) by an insightful occlusion handling strategy that enforces the forest to recognize the object's local and global structures. Due to these attributes, we report state-of-the-art tracking accuracy on benchmark datasets, and accomplish remarkable scalability with the number of targets, being able to simultaneously track the pose of over a hundred objects at 30~fps with an off-the-shelf CPU. In addition, the fast learning time enables us to extend our algorithm as a robust online tracker for model-free 3D objects under different viewpoints and appearance changes as demonstrated by the experiments.

A versatile learning-based 3d temporal tracker: Scalable, robust, online / Tan, David Joseph; Tombari, Federico; Ilic, Slobodan; Navab, Nassir. - ELETTRONICO. - 11-18-:(2015), pp. 7410443.693-7410443.701. (Intervento presentato al convegno 15th IEEE International Conference on Computer Vision, ICCV 2015 tenutosi a Santiago, Chile nel 2015) [10.1109/ICCV.2015.86].

A versatile learning-based 3d temporal tracker: Scalable, robust, online

Tan, David Joseph;TOMBARI, FEDERICO;Ilic, Slobodan;Navab, Nassir

2015

Abstract

This paper proposes a temporal tracking algorithm based on Random Forest that uses depth images to estimate and track the 3D pose of a rigid object in real-time. Compared to the state of the art aimed at the same goal, our algorithm holds important attributes such as high robustness against holes and occlusion, low computational cost of both learning and tracking stages, and low memory consumption. These are obtained (a) by a novel formulation of the learning strategy, based on a dense sampling of the camera viewpoints and learning independent trees from a single image for each camera view, as well as, (b) by an insightful occlusion handling strategy that enforces the forest to recognize the object's local and global structures. Due to these attributes, we report state-of-the-art tracking accuracy on benchmark datasets, and accomplish remarkable scalability with the number of targets, being able to simultaneously track the pose of over a hundred objects at 30~fps with an off-the-shelf CPU. In addition, the fast learning time enables us to extend our algorithm as a robust online tracker for model-free 3D objects under different viewpoints and appearance changes as demonstrated by the experiments.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
			2015
		
	Titolo del volume
	
			Proceedings of the IEEE International Conference on Computer Vision
		
	Pagina iniziale
	
			693
		
	Pagina finale
	
			701
		
	Rivista
	
			PROCEEDINGS IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION
		
	Codice DOI
	
			https://dx.doi.org/10.1109/ICCV.2015.86
		
	Citazione
	
			A versatile learning-based 3d temporal tracker: Scalable, robust, online / Tan, David Joseph; Tombari, Federico; Ilic, Slobodan; Navab, Nassir. - ELETTRONICO. - 11-18-:(2015), pp. 7410443.693-7410443.701. (Intervento presentato al  convegno 15th IEEE International Conference on Computer Vision, ICCV 2015 tenutosi a Santiago, Chile nel 2015) [10.1109/ICCV.2015.86].
		
	Tutti gli autori
	
			Tan, David Joseph; Tombari, Federico; Ilic, Slobodan; Navab, Nassir
		
	Appare nelle tipologie:
	
			4.01 Contributo in Atti di convegno

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/553972

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

39

26

CRIS Current Research Information System