CRIS Current Research Information System

We introduce a novel framework for training deep stereo networks effortlessly and without any ground-truth. By leveraging state-of-the-art neural rendering solutions, we generate stereo training data from image sequences collected with a single handheld camera. On top of them, a NeRF-supervised training procedure is carried out, from which we exploit rendered stereo triplets to compensate for occlusions and depth maps as proxy labels. This results in stereo networks capable of predicting sharp and detailed disparity maps. Experimental results show that models trained under this regime yield a 30-40% improvement over existing self-supervised methods on the challenging Middle-bury dataset, filling the gap to supervised models and, most times, outperforming them at zero-shot generalization.

Tosi, F., Tonioni, A., De Gregorio, D., Poggi, M. (2023). NeRF-Supervised Deep Stereo. 10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1264 USA : IEEE COMPUTER SOC [10.1109/CVPR52729.2023.00089].

NeRF-Supervised Deep Stereo

Tosi, Fabio;Tonioni, Alessio;De Gregorio, Daniele;Poggi, Matteo

2023

Abstract

We introduce a novel framework for training deep stereo networks effortlessly and without any ground-truth. By leveraging state-of-the-art neural rendering solutions, we generate stereo training data from image sequences collected with a single handheld camera. On top of them, a NeRF-supervised training procedure is carried out, from which we exploit rendered stereo triplets to compensate for occlusions and depth maps as proxy labels. This results in stereo networks capable of predicting sharp and detailed disparity maps. Experimental results show that models trained under this regime yield a 30-40% improvement over existing self-supervised methods on the challenging Middle-bury dataset, filling the gap to supervised models and, most times, outperforming them at zero-shot generalization.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Titolo del volume
	
				Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023)
			
	Pagina iniziale
	
				855
			
	Pagina finale
	
				866
			
	Collana/Serie
	
				PROCEEDINGS IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION
			
	Codice DOI
	
				https://dx.doi.org/10.1109/CVPR52729.2023.00089
			
	Citazione
	
				Tosi, F., Tonioni, A., De Gregorio, D., Poggi, M. (2023). NeRF-Supervised Deep Stereo. 10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1264 USA : IEEE COMPUTER SOC [10.1109/CVPR52729.2023.00089].
			
	Tutti gli autori
	
						Tosi, Fabio; Tonioni, Alessio; De Gregorio, Daniele; Poggi, Matteo
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
Tosi_NeRF-Supervised_Deep_Stereo_CVPR_2023_paper.pdf accesso aperto Tipo: Versione (PDF) editoriale Licenza: Licenza per accesso libero gratuito Dimensione 2.53 MB Formato Adobe PDF Visualizza/Apri	2.53 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/957744

Citazioni

ND

18

10

social impact