CRIS Current Research Information System

The correct estimation of the head pose is a problem of the great importance for many applications. For instance, it is an enabling technology in automotive for driver attention monitoring. In this paper, we tackle the pose estimation problem through a deep learning network working in regression manner. Traditional methods usually rely on visual facial features, such as facial landmarks or nose tip position. In contrast, we exploit a Convolutional Neural Network (CNN) to perform head pose estimation directly from depth data. We exploit a Siamese architecture and we propose a novel loss function to improve the learning of the regression network layer. The system has been tested on two public datasets, Biwi Kinect Head Pose and ICT-3DHP database. The reported results demonstrate the improvement in accuracy with respect to current state-of-the-art approaches and the real time capabilities of the overall framework.

Venturelli, M., BORGHI, G., VEZZANI, R., CUCCHIARA, R. (2017). From Depth Data to Head Pose Estimation: a Siamese approach. SciTePress [10.5220/0006104501940201].

From Depth Data to Head Pose Estimation: a Siamese approach

Venturelli, Marco;BORGHI, GUIDO;VEZZANI, Roberto;CUCCHIARA, Rita

2017

Abstract

The correct estimation of the head pose is a problem of the great importance for many applications. For instance, it is an enabling technology in automotive for driver attention monitoring. In this paper, we tackle the pose estimation problem through a deep learning network working in regression manner. Traditional methods usually rely on visual facial features, such as facial landmarks or nose tip position. In contrast, we exploit a Convolutional Neural Network (CNN) to perform head pose estimation directly from depth data. We exploit a Siamese architecture and we propose a novel loss function to improve the learning of the regression network layer. The system has been tested on two public datasets, Biwi Kinect Head Pose and ICT-3DHP database. The reported results demonstrate the improvement in accuracy with respect to current state-of-the-art approaches and the real time capabilities of the overall framework.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2017
			
	Titolo del volume
	
				Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISAPP)
			
	Pagina iniziale
	
				194
			
	Pagina finale
	
				201
			
	Codice DOI
	
				https://dx.doi.org/10.5220/0006104501940201
			
	Citazione
	
				Venturelli, M., BORGHI, G., VEZZANI, R., CUCCHIARA, R. (2017). From Depth Data to Head Pose Estimation: a Siamese approach. SciTePress [10.5220/0006104501940201].
			
	Tutti gli autori
	
						Venturelli, Marco; BORGHI, GUIDO; VEZZANI, Roberto; CUCCHIARA, Rita
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/859617

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

14

13

social impact