Recent ground-breaking works have shown that deep neural networks can be trained end-to-end to regress dense disparity maps directly from image pairs. Computer generated imagery is deployed to gather the large data corpus required to train such networks, an additional fine-tuning allowing to adapt the model to work well also on real and possibly diverse environments. Yet, besides a few public datasets such as Kitti, the ground-truth needed to adapt the network to a new scenario is hardly available in practice. In this paper we propose a novel unsupervised adaptation approach that enables to fine-tune a deep learning stereo model without any ground-truth information. We rely on off-the-shelf stereo algorithms together with state-of-the-art confidence measures, the latter able to ascertain upon correctness of the measurements yielded by former. Thus, we train the network based on a novel loss-function that penalizes predictions disagreeing with the highly confident disparities provided by the algorithm and enforces a smoothness constraint. Experiments on popular datasets (KITTI 2012, KITTI 2015 and Middlebury 2014) and other challenging test images demonstrate the effectiveness of our proposal.

Unsupervised Adaptation for Deep Stereo / Tonioni, Alessio; Poggi, Matteo; Mattoccia, Stefano; Luigi Di, Stefano. - ELETTRONICO. - (2017), pp. 1614-1622. (Intervento presentato al convegno IEEE International Conference on Computer Vision tenutosi a Venice, Italy nel October 22-29, 2017) [10.1109/ICCV.2017.178].

Unsupervised Adaptation for Deep Stereo

Tonioni, Alessio
Membro del Collaboration Group
;
Poggi, Matteo
Membro del Collaboration Group
;
Mattoccia, Stefano
Membro del Collaboration Group
;
Stefano, Luigi Di
Membro del Collaboration Group
2017

Abstract

Recent ground-breaking works have shown that deep neural networks can be trained end-to-end to regress dense disparity maps directly from image pairs. Computer generated imagery is deployed to gather the large data corpus required to train such networks, an additional fine-tuning allowing to adapt the model to work well also on real and possibly diverse environments. Yet, besides a few public datasets such as Kitti, the ground-truth needed to adapt the network to a new scenario is hardly available in practice. In this paper we propose a novel unsupervised adaptation approach that enables to fine-tune a deep learning stereo model without any ground-truth information. We rely on off-the-shelf stereo algorithms together with state-of-the-art confidence measures, the latter able to ascertain upon correctness of the measurements yielded by former. Thus, we train the network based on a novel loss-function that penalizes predictions disagreeing with the highly confident disparities provided by the algorithm and enforces a smoothness constraint. Experiments on popular datasets (KITTI 2012, KITTI 2015 and Middlebury 2014) and other challenging test images demonstrate the effectiveness of our proposal.
2017
Proceedings 16th edition of the IEEE International Conference on Computer Vision
1614
1622
Unsupervised Adaptation for Deep Stereo / Tonioni, Alessio; Poggi, Matteo; Mattoccia, Stefano; Luigi Di, Stefano. - ELETTRONICO. - (2017), pp. 1614-1622. (Intervento presentato al convegno IEEE International Conference on Computer Vision tenutosi a Venice, Italy nel October 22-29, 2017) [10.1109/ICCV.2017.178].
Tonioni, Alessio; Poggi, Matteo; Mattoccia, Stefano; Luigi Di, Stefano
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/619379
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 86
  • ???jsp.display-item.citation.isi??? 68
social impact