CRIS Current Research Information System

We address the problem of registering synchronized color (RGB) and multi-spectral (MS) images featuring very different resolution by solving stereo matching correspondences. Purposely, we introduce a novel RGB-MS dataset framing 13 different scenes in indoor environments and providing a total of 34 image pairs annotated with semi-dense, high-resolution ground-truth labels in the form of disparity maps. To tackle the task, we propose a deep learning architecture trained in a self-supervised manner by exploiting a further RGB camera, required only during training data acquisition. In this setup, we can conveniently learn cross-modal matching in the absence of ground-truth labels by distilling knowledge from an easier RGB-RGB matching task based on a collection of about 11K unlabeled image triplets. Experiments show that the proposed pipeline sets a good performance bar (1.16 pixels average registration error) for future research on this novel, challenging task.

Tosi, F., Ramirez, P.Z., Poggi, M., Salti, S., Mattoccia, S., Di Stefano, L. (2022). RGB-Multispectral Matching: Dataset, Learning Methodology, Evaluation [10.1109/CVPR52688.2022.01549].

RGB-Multispectral Matching: Dataset, Learning Methodology, Evaluation

Tosi, Fabio;Ramirez, Pierluigi Zama;Poggi, Matteo;Salti, Samuele;Mattoccia, Stefano;Di Stefano, Luigi

2022

Abstract

We address the problem of registering synchronized color (RGB) and multi-spectral (MS) images featuring very different resolution by solving stereo matching correspondences. Purposely, we introduce a novel RGB-MS dataset framing 13 different scenes in indoor environments and providing a total of 34 image pairs annotated with semi-dense, high-resolution ground-truth labels in the form of disparity maps. To tackle the task, we propose a deep learning architecture trained in a self-supervised manner by exploiting a further RGB camera, required only during training data acquisition. In this setup, we can conveniently learn cross-modal matching in the absence of ground-truth labels by distilling knowledge from an easier RGB-RGB matching task based on a collection of about 11K unlabeled image triplets. Experiments show that the proposed pipeline sets a good performance bar (1.16 pixels average registration error) for future research on this novel, challenging task.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Titolo del volume
	
				2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
			
	Pagina iniziale
	
				15937
			
	Pagina finale
	
				15947
			
	Collana/Serie
	
				PROCEEDINGS IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION
			
	Codice DOI
	
				https://dx.doi.org/10.1109/CVPR52688.2022.01549
			
	Citazione
	
				Tosi, F., Ramirez, P.Z., Poggi, M., Salti, S., Mattoccia, S., Di Stefano, L. (2022). RGB-Multispectral Matching: Dataset, Learning Methodology, Evaluation [10.1109/CVPR52688.2022.01549].
			
	Tutti gli autori
	
						Tosi, Fabio; Ramirez, Pierluigi Zama; Poggi, Matteo; Salti, Samuele; Mattoccia, Stefano; Di Stefano, Luigi
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
22_CVPR_Tosi_RGB-Multispectral_Matching_Dataset_Learning_Methodology_Evaluation.pdf accesso aperto Tipo: Postprint Licenza: Licenza per accesso libero gratuito Dimensione 703.65 kB Formato Adobe PDF Visualizza/Apri	703.65 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/895292

Citazioni

ND

5

1

social impact