CRIS Current Research Information System

Bird's Eye View (BEV) semantic maps have recently garnered a lot of attention as a useful representation of the environment to tackle assisted and autonomous driving tasks. However most of the existing work focuses on the fully supervised setting training networks on large annotated datasets. In this work we present RendBEV a new method for the self-supervised training of BEV semantic segmentation networks leveraging differentiable volumetric rendering to receive supervision from semantic perspective views computed by a 2D semantic segmentation model. Our method enables zero-shot BEV semantic segmentation and already delivers competitive results in this challenging setting. When used as pretraining to then fine-tune on labeled BEV ground truth our method significantly boosts performance in low-annotation regimes and sets a new state of the art when fine-tuning on all available labels.

Pineiro, H., Taccari, L., Pjetri, A., Sambo, F., Salti, S. (2025). RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird's Eye View Segmentation. IEEE/CVF [10.1109/WACV61041.2025.00062].

RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird's Eye View Segmentation

Henrique Pineiro^Primo;Leonardo Taccari;Aurel Pjetri;Francesco Sambo;Samuele Salti^Ultimo

2025

Abstract

Bird's Eye View (BEV) semantic maps have recently garnered a lot of attention as a useful representation of the environment to tackle assisted and autonomous driving tasks. However most of the existing work focuses on the fully supervised setting training networks on large annotated datasets. In this work we present RendBEV a new method for the self-supervised training of BEV semantic segmentation networks leveraging differentiable volumetric rendering to receive supervision from semantic perspective views computed by a 2D semantic segmentation model. Our method enables zero-shot BEV semantic segmentation and already delivers competitive results in this challenging setting. When used as pretraining to then fine-tune on labeled BEV ground truth our method significantly boosts performance in low-annotation regimes and sets a new state of the art when fine-tuning on all available labels.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Titolo del volume
	
				IEEE/CVF Winter Conference on Applications of Computer Vision
			
	Pagina iniziale
	
				535
			
	Pagina finale
	
				544
			
	Collana/Serie
	
				IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION
			
	Codice DOI
	
				https://dx.doi.org/10.1109/WACV61041.2025.00062
			
	Citazione
	
				Pineiro, H., Taccari, L., Pjetri, A., Sambo, F., Salti, S. (2025). RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird's Eye View Segmentation. IEEE/CVF [10.1109/WACV61041.2025.00062].
			
	Tutti gli autori
	
						Pineiro, Henrique; Taccari, Leonardo; Pjetri, Aurel; Sambo, Francesco; Salti, Samuele
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
Monteagudo_RendBEV_Semantic_Novel_View_Synthesis_for_Self-Supervised_B.pdf Open Access dal 08/10/2025 Tipo: Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review Licenza: Licenza per accesso libero gratuito Dimensione 3.73 MB Formato Adobe PDF Visualizza/Apri	3.73 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1009703

Citazioni

ND

0

0

social impact