In cluster analysis it is generally assumed that one single cluster structure is contained in a data matrix, and that this structure may be confined to a subset of the observed variables. This paper investigates a new solution that simultaneously selects the relevant variables and discovers multiple cluster structures from possibly dependent subsets of variables. The basic idea is to recast the problem as a model comparison problem in which conditional independence assumptions are introduced using multivariate regression models with correlated and non-normal error terms. A stepwise procedure for selecting a locally optimal model is also proposed. Results obtained from a Monte Carlo study are briefly described.

Detecting multiple cluster structures through model-based clustering methods / G. Soffritti; G. Galimberti. - STAMPA. - (2009), pp. 263-266. (Intervento presentato al convegno 7° Meeting of the Classification and Data Analysis Group of the Italian Statistical Society tenutosi a Catania, Italy nel September 9-11 2009).

Detecting multiple cluster structures through model-based clustering methods

SOFFRITTI, GABRIELE;GALIMBERTI, GIULIANO
2009

Abstract

In cluster analysis it is generally assumed that one single cluster structure is contained in a data matrix, and that this structure may be confined to a subset of the observed variables. This paper investigates a new solution that simultaneously selects the relevant variables and discovers multiple cluster structures from possibly dependent subsets of variables. The basic idea is to recast the problem as a model comparison problem in which conditional independence assumptions are introduced using multivariate regression models with correlated and non-normal error terms. A stepwise procedure for selecting a locally optimal model is also proposed. Results obtained from a Monte Carlo study are briefly described.
2009
Classification and Data Analysis 2009
263
266
Detecting multiple cluster structures through model-based clustering methods / G. Soffritti; G. Galimberti. - STAMPA. - (2009), pp. 263-266. (Intervento presentato al convegno 7° Meeting of the Classification and Data Analysis Group of the Italian Statistical Society tenutosi a Catania, Italy nel September 9-11 2009).
G. Soffritti; G. Galimberti
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/84942
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact