We introduce a novel framework for training deep stereo networks effortlessly and without any ground-truth. By leveraging state-of-the-art neural rendering solutions, we generate stereo training data from image sequences collected with a single handheld camera. On top of them, a NeRF-supervised training procedure is carried out, from which we exploit rendered stereo triplets to compensate for occlusions and depth maps as proxy labels. This results in stereo networks capable of predicting sharp and detailed disparity maps. Experimental results show that models trained under this regime yield a 30-40% improvement over existing self-supervised methods on the challenging Middle-bury dataset, filling the gap to supervised models and, most times, outperforming them at zero-shot generalization.

Tosi, F., Tonioni, A., De Gregorio, D., Poggi, M. (2023). NeRF-Supervised Deep Stereo. 10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1264 USA : IEEE COMPUTER SOC [10.1109/CVPR52729.2023.00089].

NeRF-Supervised Deep Stereo

Tosi, Fabio;De Gregorio, Daniele;Poggi, Matteo
2023

Abstract

We introduce a novel framework for training deep stereo networks effortlessly and without any ground-truth. By leveraging state-of-the-art neural rendering solutions, we generate stereo training data from image sequences collected with a single handheld camera. On top of them, a NeRF-supervised training procedure is carried out, from which we exploit rendered stereo triplets to compensate for occlusions and depth maps as proxy labels. This results in stereo networks capable of predicting sharp and detailed disparity maps. Experimental results show that models trained under this regime yield a 30-40% improvement over existing self-supervised methods on the challenging Middle-bury dataset, filling the gap to supervised models and, most times, outperforming them at zero-shot generalization.
2023
Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023)
855
866
Tosi, F., Tonioni, A., De Gregorio, D., Poggi, M. (2023). NeRF-Supervised Deep Stereo. 10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1264 USA : IEEE COMPUTER SOC [10.1109/CVPR52729.2023.00089].
Tosi, Fabio; Tonioni, Alessio; De Gregorio, Daniele; Poggi, Matteo
File in questo prodotto:
File Dimensione Formato  
Tosi_NeRF-Supervised_Deep_Stereo_CVPR_2023_paper.pdf

accesso aperto

Tipo: Versione (PDF) editoriale
Licenza: Licenza per accesso libero gratuito
Dimensione 2.53 MB
Formato Adobe PDF
2.53 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/957744
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 13
  • ???jsp.display-item.citation.isi??? 6
social impact