CRIS Current Research Information System

Two robustness criteria are presented that are applicable to general clustering methods. Robustness and stability in cluster analysis are not only data dependent, but even cluster dependent. Robustness is in the present paper defined as a property of not only the clustering method, but also of every individual cluster in a data set. The main principles are: (a) dissimilarity measurement of an original cluster with the most similar cluster in the induced clustering obtained by adding data points, (b) the dissolution point, which is an adaptation of the breakdown point concept to single clusters, (c) isolation robustness: given a clustering method, is it possible to join, by addition of g points, arbitrarily well separated clusters? Results are derived for k-means, k-medoids (k estimated by average silhouette width), trimmed k-means, mixture models (with and without noise component, with and without estimation of the number of clusters by BIC), single and complete linkage. (C) 2007 Elsevier Inc. All rights reserved.

Hennig, C. (2008). Dissolution point and isolation robustness: Robustness criteria for general cluster analysis methods. JOURNAL OF MULTIVARIATE ANALYSIS, 99(6), 1154-1176 [10.1016/j.jmva.2007.07.002].

Dissolution point and isolation robustness: Robustness criteria for general cluster analysis methods

Hennig C

2008

Abstract

Two robustness criteria are presented that are applicable to general clustering methods. Robustness and stability in cluster analysis are not only data dependent, but even cluster dependent. Robustness is in the present paper defined as a property of not only the clustering method, but also of every individual cluster in a data set. The main principles are: (a) dissimilarity measurement of an original cluster with the most similar cluster in the induced clustering obtained by adding data points, (b) the dissolution point, which is an adaptation of the breakdown point concept to single clusters, (c) isolation robustness: given a clustering method, is it possible to join, by addition of g points, arbitrarily well separated clusters? Results are derived for k-means, k-medoids (k estimated by average silhouette width), trimmed k-means, mixture models (with and without noise component, with and without estimation of the number of clusters by BIC), single and complete linkage. (C) 2007 Elsevier Inc. All rights reserved.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2008
			
	Rivista
	
				JOURNAL OF MULTIVARIATE ANALYSIS
			
	Codice DOI
	
				https://dx.doi.org/10.1016/j.jmva.2007.07.002
			
	Citazione
	
				Hennig, C. (2008). Dissolution point and isolation robustness: Robustness criteria for general cluster analysis methods. JOURNAL OF MULTIVARIATE ANALYSIS, 99(6), 1154-1176 [10.1016/j.jmva.2007.07.002].
			
	Tutti gli autori
	
						Hennig, C

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1031444

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

ND

121

social impact