Spectral clustering is a powerful technique for data partitioning, but determining the optimal number of clusters remains challenging. This article introduces ALLE (ALgebraic Laplacian Estimator), an automatic method for estimating the number of clusters within the spectral clustering framework. By formulating the cluster recovery problem as a penalized minimization task, ALLE is able to systematically recover the number of clusters and the embedding space by assuming for the Laplacian matrix a low-rank plus sparse decomposition. Specifically, ALLE recovers the low-rank representation of the Laplacian matrix using nuclear norm plus ℓ1-norm penalization. ALLE is computed via a proximal gradient algorithm alternating Singular Value Thresholding and Soft Thresholding, and it’s very good performance is shown via a simulation study.

Di Nuzzo, C., Farne, M. (2025). Recovering the Number of Clusters From a Laplacian Matrix by Nuclear Norm Penalization. STATISTICAL ANALYSIS AND DATA MINING, 18(4 (August)), 1-6 [10.1002/sam.70031].

Recovering the Number of Clusters From a Laplacian Matrix by Nuclear Norm Penalization

Farne, Matteo
2025

Abstract

Spectral clustering is a powerful technique for data partitioning, but determining the optimal number of clusters remains challenging. This article introduces ALLE (ALgebraic Laplacian Estimator), an automatic method for estimating the number of clusters within the spectral clustering framework. By formulating the cluster recovery problem as a penalized minimization task, ALLE is able to systematically recover the number of clusters and the embedding space by assuming for the Laplacian matrix a low-rank plus sparse decomposition. Specifically, ALLE recovers the low-rank representation of the Laplacian matrix using nuclear norm plus ℓ1-norm penalization. ALLE is computed via a proximal gradient algorithm alternating Singular Value Thresholding and Soft Thresholding, and it’s very good performance is shown via a simulation study.
2025
Di Nuzzo, C., Farne, M. (2025). Recovering the Number of Clusters From a Laplacian Matrix by Nuclear Norm Penalization. STATISTICAL ANALYSIS AND DATA MINING, 18(4 (August)), 1-6 [10.1002/sam.70031].
Di Nuzzo, Cinzia; Farne, Matteo
File in questo prodotto:
File Dimensione Formato  
DiNuzzo_Farnè_Laplacian_SAM_2025.pdf

accesso aperto

Tipo: Versione (PDF) editoriale / Version Of Record
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 620.76 kB
Formato Adobe PDF
620.76 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1019035
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 0
social impact