CRIS Current Research Information System

Human Pose Estimation is a fundamental task for many applications in the Computer Vision community and it has been widely investigated in the 2D domain, i.e. intensity images. Therefore, most of the available methods for this task are mainly based on 2D Convolutional Neural Networks and huge manually-annotated RGB datasets, achieving stunning results. In this paper, we propose RefiNet, a multi-stage framework that regresses an extremely-precise 3D human pose estimation from a given 2D pose and a depth map. The framework consists of three different modules, each one specialized in a particular refinement and data representation, i.e. depth patches, 3D skeleton and point clouds. Moreover, we present a new dataset, called Baracca, acquired with RGB, depth and thermal cameras and specifically created for the automotive context. Experimental results confirm the quality of the refinement procedure that largely improves the human pose estimations of off-the-shelf 2D methods.

Andrea D’Eusanio, Stefano Pini, Guido Borghi, Roberto Vezzani, Rita Cucchiara (2020). RefiNet: 3D Human Pose Refinement with Depth Maps [10.1109/ICPR48806.2021.9412451].

RefiNet: 3D Human Pose Refinement with Depth Maps

Andrea D’Eusanio;Stefano Pini;Guido Borghi;Roberto Vezzani;Rita Cucchiara

2020

Abstract

Human Pose Estimation is a fundamental task for many applications in the Computer Vision community and it has been widely investigated in the 2D domain, i.e. intensity images. Therefore, most of the available methods for this task are mainly based on 2D Convolutional Neural Networks and huge manually-annotated RGB datasets, achieving stunning results. In this paper, we propose RefiNet, a multi-stage framework that regresses an extremely-precise 3D human pose estimation from a given 2D pose and a depth map. The framework consists of three different modules, each one specialized in a particular refinement and data representation, i.e. depth patches, 3D skeleton and point clouds. Moreover, we present a new dataset, called Baracca, acquired with RGB, depth and thermal cameras and specifically created for the automotive context. Experimental results confirm the quality of the refinement procedure that largely improves the human pose estimations of off-the-shelf 2D methods.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2020
			
	Titolo del volume
	
				2020 25th International Conference on Pattern Recognition (ICPR)
			
	Pagina iniziale
	
				2320
			
	Pagina finale
	
				2327
			
	Collana/Serie
	
				INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION
			
	Codice DOI
	
				https://dx.doi.org/10.1109/ICPR48806.2021.9412451
			
	Citazione
	
				Andrea D’Eusanio,  Stefano Pini,  Guido Borghi,  Roberto Vezzani,  Rita Cucchiara (2020). RefiNet: 3D Human Pose Refinement with Depth Maps [10.1109/ICPR48806.2021.9412451].
			
	Tutti gli autori
	
						Andrea D’Eusanio; Stefano Pini; Guido Borghi; Roberto Vezzani; Rita Cucchiara
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
ICPR_2020_Human_Pose_Estimation_compressed.pdf accesso aperto Tipo: Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review Licenza: Licenza per accesso libero gratuito Dimensione 930.56 kB Formato Adobe PDF Visualizza/Apri	930.56 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/859639

Citazioni

ND

10

8

social impact