Bird's Eye View (BEV) semantic maps have recently garnered a lot of attention as a useful representation of the environment to tackle assisted and autonomous driving tasks. However most of the existing work focuses on the fully supervised setting training networks on large annotated datasets. In this work we present RendBEV a new method for the self-supervised training of BEV semantic segmentation networks leveraging differentiable volumetric rendering to receive supervision from semantic perspective views computed by a 2D semantic segmentation model. Our method enables zero-shot BEV semantic segmentation and already delivers competitive results in this challenging setting. When used as pretraining to then fine-tune on labeled BEV ground truth our method significantly boosts performance in low-annotation regimes and sets a new state of the art when fine-tuning on all available labels.

Pineiro, H., Taccari, L., Pjetri, A., Sambo, F., Salti, S. (2025). RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird's Eye View Segmentation. IEEE/CVF [10.1109/WACV61041.2025.00062].

RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird's Eye View Segmentation

Henrique Pineiro
Primo
;
Samuele Salti
Ultimo
2025

Abstract

Bird's Eye View (BEV) semantic maps have recently garnered a lot of attention as a useful representation of the environment to tackle assisted and autonomous driving tasks. However most of the existing work focuses on the fully supervised setting training networks on large annotated datasets. In this work we present RendBEV a new method for the self-supervised training of BEV semantic segmentation networks leveraging differentiable volumetric rendering to receive supervision from semantic perspective views computed by a 2D semantic segmentation model. Our method enables zero-shot BEV semantic segmentation and already delivers competitive results in this challenging setting. When used as pretraining to then fine-tune on labeled BEV ground truth our method significantly boosts performance in low-annotation regimes and sets a new state of the art when fine-tuning on all available labels.
2025
IEEE/CVF Winter Conference on Applications of Computer Vision
535
544
Pineiro, H., Taccari, L., Pjetri, A., Sambo, F., Salti, S. (2025). RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird's Eye View Segmentation. IEEE/CVF [10.1109/WACV61041.2025.00062].
Pineiro, Henrique; Taccari, Leonardo; Pjetri, Aurel; Sambo, Francesco; Salti, Samuele
File in questo prodotto:
File Dimensione Formato  
Monteagudo_RendBEV_Semantic_Novel_View_Synthesis_for_Self-Supervised_B.pdf

Open Access dal 08/10/2025

Tipo: Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review
Licenza: Licenza per accesso libero gratuito
Dimensione 3.73 MB
Formato Adobe PDF
3.73 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1009703
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact