Self-supervised single-view depth estimation, trained on video sequences, faces significant challenges when dynamic objects are present in the training data, as they violate the basic multi-view geometry assumptions used to compute photometric losses. We propose a novel approach that leverages the relationship between the depth of moving objects and their ground contact points. By iteratively propagating ground features to moving targets in perceptual layers, we recalibrate the depth of dynamic entities while preserving details. Our method maintains the end-to-end training paradigm without additional networks or complex training procedures. Our experiments demonstrate that our method achieves state-of-the-art performance when estimating depth for dynamic objects and attains superior generalization compared to existing approaches. The relevant experimental code can be accessed at: https://github.com/LiHuanLi/GroundMono

Li, H., Poggi, M., Tosi, F., Mattoccia, S. (2025). Self-supervised Monocular Depth Estimation for Dynamic Objects with Ground Propagation. Institute of Electrical and Electronics Engineers Inc. [10.1109/IROS60139.2025.11246123].

Self-supervised Monocular Depth Estimation for Dynamic Objects with Ground Propagation

Poggi M.;Tosi F.;Mattoccia S.
2025

Abstract

Self-supervised single-view depth estimation, trained on video sequences, faces significant challenges when dynamic objects are present in the training data, as they violate the basic multi-view geometry assumptions used to compute photometric losses. We propose a novel approach that leverages the relationship between the depth of moving objects and their ground contact points. By iteratively propagating ground features to moving targets in perceptual layers, we recalibrate the depth of dynamic entities while preserving details. Our method maintains the end-to-end training paradigm without additional networks or complex training procedures. Our experiments demonstrate that our method achieves state-of-the-art performance when estimating depth for dynamic objects and attains superior generalization compared to existing approaches. The relevant experimental code can be accessed at: https://github.com/LiHuanLi/GroundMono
2025
IEEE International Conference on Intelligent Robots and Systems
2384
2391
Li, H., Poggi, M., Tosi, F., Mattoccia, S. (2025). Self-supervised Monocular Depth Estimation for Dynamic Objects with Ground Propagation. Institute of Electrical and Electronics Engineers Inc. [10.1109/IROS60139.2025.11246123].
Li, H.; Poggi, M.; Tosi, F.; Mattoccia, S.
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1049057
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact