CRIS Current Research Information System

Self-supervised single-view depth estimation, trained on video sequences, faces significant challenges when dynamic objects are present in the training data, as they violate the basic multi-view geometry assumptions used to compute photometric losses. We propose a novel approach that leverages the relationship between the depth of moving objects and their ground contact points. By iteratively propagating ground features to moving targets in perceptual layers, we recalibrate the depth of dynamic entities while preserving details. Our method maintains the end-to-end training paradigm without additional networks or complex training procedures. Our experiments demonstrate that our method achieves state-of-the-art performance when estimating depth for dynamic objects and attains superior generalization compared to existing approaches. The relevant experimental code can be accessed at: https://github.com/LiHuanLi/GroundMono

Li, H., Poggi, M., Tosi, F., Mattoccia, S. (2025). Self-supervised Monocular Depth Estimation for Dynamic Objects with Ground Propagation. Institute of Electrical and Electronics Engineers Inc. [10.1109/IROS60139.2025.11246123].

Self-supervised Monocular Depth Estimation for Dynamic Objects with Ground Propagation

Li H.;Poggi M.;Tosi F.;Mattoccia S.

2025

Abstract

Self-supervised single-view depth estimation, trained on video sequences, faces significant challenges when dynamic objects are present in the training data, as they violate the basic multi-view geometry assumptions used to compute photometric losses. We propose a novel approach that leverages the relationship between the depth of moving objects and their ground contact points. By iteratively propagating ground features to moving targets in perceptual layers, we recalibrate the depth of dynamic entities while preserving details. Our method maintains the end-to-end training paradigm without additional networks or complex training procedures. Our experiments demonstrate that our method achieves state-of-the-art performance when estimating depth for dynamic objects and attains superior generalization compared to existing approaches. The relevant experimental code can be accessed at: https://github.com/LiHuanLi/GroundMono

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Titolo del volume
	
				IEEE International Conference on Intelligent Robots and Systems
			
	Pagina iniziale
	
				2384
			
	Pagina finale
	
				2391
			
	Collana/Serie
	
				PROCEEDINGS OF THE ... IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS
			
	Codice DOI
	
				https://dx.doi.org/10.1109/IROS60139.2025.11246123
			
	Citazione
	
				Li, H., Poggi, M., Tosi, F., Mattoccia, S. (2025). Self-supervised Monocular Depth Estimation for Dynamic Objects with Ground Propagation. Institute of Electrical and Electronics Engineers Inc. [10.1109/IROS60139.2025.11246123].
			
	Tutti gli autori
	
						Li, H.; Poggi, M.; Tosi, F.; Mattoccia, S.
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
IROS_25.pdf embargo fino al 26/11/2027 Tipo: Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review Licenza: Licenza per accesso libero gratuito Dimensione 1.47 MB Formato Adobe PDF Visualizza/Apri Contatta l'autore	1.47 MB	Adobe PDF	Visualizza/Apri Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1049057

Citazioni

ND

0

0

0

social impact