CRIS Current Research Information System

Recent ground-breaking works have shown that deep neural networks can be trained end-to-end to regress dense disparity maps directly from image pairs. Computer generated imagery is deployed to gather the large data corpus required to train such networks, an additional fine-tuning allowing to adapt the model to work well also on real and possibly diverse environments. Yet, besides a few public datasets such as Kitti, the ground-truth needed to adapt the network to a new scenario is hardly available in practice. In this paper we propose a novel unsupervised adaptation approach that enables to fine-tune a deep learning stereo model without any ground-truth information. We rely on off-the-shelf stereo algorithms together with state-of-the-art confidence measures, the latter able to ascertain upon correctness of the measurements yielded by former. Thus, we train the network based on a novel loss-function that penalizes predictions disagreeing with the highly confident disparities provided by the algorithm and enforces a smoothness constraint. Experiments on popular datasets (KITTI 2012, KITTI 2015 and Middlebury 2014) and other challenging test images demonstrate the effectiveness of our proposal.

Tonioni, A., Poggi, M., Mattoccia, S., Luigi Di, S. (2017). Unsupervised Adaptation for Deep Stereo. IEEE [10.1109/ICCV.2017.178].

Unsupervised Adaptation for Deep Stereo

Tonioni, Alessio^{Membro del Collaboration Group};Poggi, Matteo^{Membro del Collaboration Group};Mattoccia, Stefano^{Membro del Collaboration Group};Stefano, Luigi Di^{Membro del Collaboration Group}

2017

Abstract

Recent ground-breaking works have shown that deep neural networks can be trained end-to-end to regress dense disparity maps directly from image pairs. Computer generated imagery is deployed to gather the large data corpus required to train such networks, an additional fine-tuning allowing to adapt the model to work well also on real and possibly diverse environments. Yet, besides a few public datasets such as Kitti, the ground-truth needed to adapt the network to a new scenario is hardly available in practice. In this paper we propose a novel unsupervised adaptation approach that enables to fine-tune a deep learning stereo model without any ground-truth information. We rely on off-the-shelf stereo algorithms together with state-of-the-art confidence measures, the latter able to ascertain upon correctness of the measurements yielded by former. Thus, we train the network based on a novel loss-function that penalizes predictions disagreeing with the highly confident disparities provided by the algorithm and enforces a smoothness constraint. Experiments on popular datasets (KITTI 2012, KITTI 2015 and Middlebury 2014) and other challenging test images demonstrate the effectiveness of our proposal.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2017
			
	Titolo del volume
	
				Proceedings 16th edition of the IEEE International Conference on Computer Vision
			
	Pagina iniziale
	
				1614
			
	Pagina finale
	
				1622
			
	Codice DOI
	
				https://dx.doi.org/10.1109/ICCV.2017.178
			
	Citazione
	
				Tonioni, A., Poggi, M., Mattoccia, S., Luigi Di, S. (2017). Unsupervised Adaptation for Deep Stereo. IEEE [10.1109/ICCV.2017.178].
			
	Tutti gli autori
	
						Tonioni, Alessio; Poggi, Matteo; Mattoccia, Stefano; Luigi Di, Stefano
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/619379

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

100

87

ND

social impact