Although recent semantic segmentation methods have made remarkable progress, they still rely on large amounts of annotated training data, which are often infeasible to collect in the autonomous driving scenario. Previous works usually tackle this issue with Unsupervised Domain Adaptation (UDA), which entails training a network on synthetic images and applying the model to real ones while minimizing the discrepancy between the two domains. Yet, these techniques do not consider additional information that may be obtained from other tasks. Differently, we propose to exploit self-supervised monocular depth estimation to improve UDA for semantic segmentation. On one hand, we deploy depth to realize a plug-in component which can inject complementary geometric cues into any existing UDA method. We further rely on depth to generate a large and varied set of samples to Self-Train the final model. Our whole proposal allows for achieving state-of-the-art performance (58.8 mIoU) in the GTA5->CS benchmark benchmark. Code is available at https://github.com/CVLAB-Unibo/d4-dbst.

Cardace, A., DE LUIGI, L., ZAMA RAMIREZ, P., Salti, S., DI STEFANO, L. (2022). Plugging Self-Supervised Monocular Depth Into Unsupervised Domain Adaptation for Semantic Segmentation. IEEE [10.1109/WACV51458.2022.00206].

Plugging Self-Supervised Monocular Depth Into Unsupervised Domain Adaptation for Semantic Segmentation

Adriano Cardace
;
Luca De Luigi;Pierluigi Zama Ramirez;Samuele Salti;Luigi Di Stefano
2022

Abstract

Although recent semantic segmentation methods have made remarkable progress, they still rely on large amounts of annotated training data, which are often infeasible to collect in the autonomous driving scenario. Previous works usually tackle this issue with Unsupervised Domain Adaptation (UDA), which entails training a network on synthetic images and applying the model to real ones while minimizing the discrepancy between the two domains. Yet, these techniques do not consider additional information that may be obtained from other tasks. Differently, we propose to exploit self-supervised monocular depth estimation to improve UDA for semantic segmentation. On one hand, we deploy depth to realize a plug-in component which can inject complementary geometric cues into any existing UDA method. We further rely on depth to generate a large and varied set of samples to Self-Train the final model. Our whole proposal allows for achieving state-of-the-art performance (58.8 mIoU) in the GTA5->CS benchmark benchmark. Code is available at https://github.com/CVLAB-Unibo/d4-dbst.
2022
2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
1999
2009
Cardace, A., DE LUIGI, L., ZAMA RAMIREZ, P., Salti, S., DI STEFANO, L. (2022). Plugging Self-Supervised Monocular Depth Into Unsupervised Domain Adaptation for Semantic Segmentation. IEEE [10.1109/WACV51458.2022.00206].
Cardace, Adriano; DE LUIGI, Luca; ZAMA RAMIREZ, Pierluigi; Salti, Samuele; DI STEFANO, Luigi
File in questo prodotto:
File Dimensione Formato  
main.pdf

accesso aperto

Tipo: Postprint
Licenza: Licenza per accesso libero gratuito
Dimensione 5.67 MB
Formato Adobe PDF
5.67 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/864973
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 14
  • ???jsp.display-item.citation.isi??? 3
social impact