Unsupervised Adaptation for Deep Stereo

Tonioni, Alessio; Poggi, Matteo; Mattoccia, Stefano; Stefano, Luigi Di

doi:10.1109/ICCV.2017.178

Recent ground-breaking works have shown that deep neural networks can be trained end-to-end to regress dense disparity maps directly from image pairs. Computer generated imagery is deployed to gather the large data corpus required to train such networks, an additional fine-tuning allowing to adapt the model to work well also on real and possibly diverse environments. Yet, besides a few public datasets such as Kitti, the ground-truth needed to adapt the network to a new scenario is hardly available in practice. In this paper we propose a novel unsupervised adaptation approach that enables to fine-tune a deep learning stereo model without any ground-truth information. We rely on off-the-shelf stereo algorithms together with state-of-the-art confidence measures, the latter able to ascertain upon correctness of the measurements yielded by former. Thus, we train the network based on a novel loss-function that penalizes predictions disagreeing with the highly confident disparities provided by the algorithm and enforces a smoothness constraint. Experiments on popular datasets (KITTI 2012, KITTI 2015 and Middlebury 2014) and other challenging test images demonstrate the effectiveness of our proposal.

Unsupervised Adaptation for Deep Stereo / Tonioni, Alessio; Poggi, Matteo; Mattoccia, Stefano; Luigi Di, Stefano. - ELETTRONICO. - (2017), pp. 1614-1622. (Intervento presentato al convegno IEEE International Conference on Computer Vision tenutosi a Venice, Italy nel October 22-29, 2017) [10.1109/ICCV.2017.178].