This paper explores trimmed factorial k-means (TFKM) in a clustering application to a cookie dataset. TFKM is a robust version of factorial k-means, where a robust covariance matrix input is used, and outliers in the identified reduced space are iteratively removed via a trimming procedure. The selected latent rank, number of clusters, and outlier proportion are those which maximize Hartigan’s statistic. The TFKM partition is thoroughly compared to two alternatives, like a robust tandem procedure and trimmed k-means, via a simulation study. An Internet cookie example shows that TFKM provides on analyzed data the most parsimonious and informative partition by cluster homogeneity.
Farne', M., Camillo, F. (2025). Trimmed Factorial K-Means: A Clustering Application to a Cookie Dataset. Cham : Springer [10.1007/978-3-031-84702-8_12].
Trimmed Factorial K-Means: A Clustering Application to a Cookie Dataset
Farne Matteo
;Camillo Furio
2025
Abstract
This paper explores trimmed factorial k-means (TFKM) in a clustering application to a cookie dataset. TFKM is a robust version of factorial k-means, where a robust covariance matrix input is used, and outliers in the identified reduced space are iteratively removed via a trimming procedure. The selected latent rank, number of clusters, and outlier proportion are those which maximize Hartigan’s statistic. The TFKM partition is thoroughly compared to two alternatives, like a robust tandem procedure and trimmed k-means, via a simulation study. An Internet cookie example shows that TFKM provides on analyzed data the most parsimonious and informative partition by cluster homogeneity.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


