NeRF-based SLAM has recently achieved promising results in tracking and reconstruction. However, existing methods face challenges in providing sufficient scene representation, capturing structural information, and maintaining global consistency in scenes emerging significant movement or being forgotten. To this end, we present HS-SLAM to tackle these problems. To enhance scene representation capacity, we propose a hybrid encoding network that combines the complementary strengths of hash-grid, tri-planes, and one-blob, improving the completeness and smoothness of reconstruction. Additionally, we introduce structural supervision by sampling patches of non-local pixels rather than individual rays to better capture the scene structure. To ensure global consistency, we implement an active global bundle adjustment (BA) to eliminate camera drifts and mitigate accumulative errors. Experimental results demonstrate that HS-SLAM outperforms the baselines in tracking and reconstruction accuracy while maintaining the efficiency required for robotics.

Gong, Z., Tosi, F., Zhang, Y., Mattoccia, S., Poggi, M. (2025). HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM. Institute of Electrical and Electronics Engineers Inc. [10.1109/ICRA55743.2025.11127551].

HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM

Gong Z.;Tosi F.;Zhang Y.;Mattoccia S.;Poggi M.
2025

Abstract

NeRF-based SLAM has recently achieved promising results in tracking and reconstruction. However, existing methods face challenges in providing sufficient scene representation, capturing structural information, and maintaining global consistency in scenes emerging significant movement or being forgotten. To this end, we present HS-SLAM to tackle these problems. To enhance scene representation capacity, we propose a hybrid encoding network that combines the complementary strengths of hash-grid, tri-planes, and one-blob, improving the completeness and smoothness of reconstruction. Additionally, we introduce structural supervision by sampling patches of non-local pixels rather than individual rays to better capture the scene structure. To ensure global consistency, we implement an active global bundle adjustment (BA) to eliminate camera drifts and mitigate accumulative errors. Experimental results demonstrate that HS-SLAM outperforms the baselines in tracking and reconstruction accuracy while maintaining the efficiency required for robotics.
2025
Proceedings - IEEE International Conference on Robotics and Automation
8464
8470
Gong, Z., Tosi, F., Zhang, Y., Mattoccia, S., Poggi, M. (2025). HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM. Institute of Electrical and Electronics Engineers Inc. [10.1109/ICRA55743.2025.11127551].
Gong, Z.; Tosi, F.; Zhang, Y.; Mattoccia, S.; Poggi, M.
File in questo prodotto:
File Dimensione Formato  
HS_SLAM_Arxiv.pdf

embargo fino al 02/09/2027

Tipo: Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review
Licenza: Licenza per accesso libero gratuito
Dimensione 3.23 MB
Formato Adobe PDF
3.23 MB Adobe PDF   Visualizza/Apri   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1049022
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 2
  • OpenAlex ND
social impact