CRIS Current Research Information System

We present TemporalStereo, a coarse-to-fine stereo matching network that is highly efficient, and able to effectively exploit the past geometry and context information to boost matching accuracy. Our network leverages sparse cost volume and proves to be effective when a single stereo pair is given. However, its peculiar ability to use spatio-temporal information across stereo sequences allows TemporalStereo to alleviate problems such as occlusions and reflective regions while enjoying high efficiency also in this latter case. Notably, our model - trained once with stereo videos - can run in both single-pair and temporal modes seamlessly. Experiments show that our network relying on camera motion is robust even to dynamic objects when running on videos. We validate TemporalStereo through extensive experiments on synthetic (SceneFlow, TartanAir) and real (KITTI 2012, KITTI 2015) datasets. Our model achieves state-of-the-art performance on any of these datasets.

Zhang, Y., Poggi, M., Mattoccia, S. (2023). TemporalStereo: Effcient Spatial-Temporal Stereo Matching Network [10.1109/iros55552.2023.10341598].

TemporalStereo: Effcient Spatial-Temporal Stereo Matching Network

Zhang, Youmin;Poggi, Matteo;Mattoccia, Stefano

2023

Abstract

We present TemporalStereo, a coarse-to-fine stereo matching network that is highly efficient, and able to effectively exploit the past geometry and context information to boost matching accuracy. Our network leverages sparse cost volume and proves to be effective when a single stereo pair is given. However, its peculiar ability to use spatio-temporal information across stereo sequences allows TemporalStereo to alleviate problems such as occlusions and reflective regions while enjoying high efficiency also in this latter case. Notably, our model - trained once with stereo videos - can run in both single-pair and temporal modes seamlessly. Experiments show that our network relying on camera motion is robust even to dynamic objects when running on videos. We validate TemporalStereo through extensive experiments on synthetic (SceneFlow, TartanAir) and real (KITTI 2012, KITTI 2015) datasets. Our model achieves state-of-the-art performance on any of these datasets.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Titolo del volume
	
				2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
			
	Pagina iniziale
	
				9528
			
	Pagina finale
	
				9535
			
	Collana/Serie
	
				PROCEEDINGS OF THE ... IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS
			
	Codice DOI
	
				https://dx.doi.org/10.1109/iros55552.2023.10341598
			
	Citazione
	
				Zhang, Y., Poggi, M., Mattoccia, S. (2023). TemporalStereo: Effcient Spatial-Temporal Stereo Matching Network [10.1109/iros55552.2023.10341598].
			
	Tutti gli autori
	
						Zhang, Youmin; Poggi, Matteo; Mattoccia, Stefano
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
2211.13755v2.pdf accesso aperto Tipo: Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review Licenza: Licenza per accesso libero gratuito Dimensione 18.11 MB Formato Adobe PDF Visualizza/Apri	18.11 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/961717

Citazioni

ND

19

14

19

social impact