The monitoring of the expression profiles of thousands of genes have proved to be particularly promising for biological classification, particularly for cancer diagnosis. However, microarray data present major challenges due to the complex, multiclass nature and the overwhelming number of variables characterizing gene expression profiles. We introduce a methodology that combine dimension reduction method and classification based on finite mixture of Gaussian densities. Information on the dimension reduction subspace is based on the variation of components means for each class, which in turn are obtained by modeling the within class distribution of the predictors through finite mixtures of Gaussian densities. The proposed approach is applied to the leukemia data, a well known dataset in the microarray literature. We show that the combination of dimension reduction and model-based clustering is a powerful technique to find groups among gene expression data.

Scrucca, L., Bar-Hen, A. (2013). A model-based dimension reduction approach to classification of gene expression data. Boca Raton, FL : Springer [10.1007/978-3-642-35588-2_21].

A model-based dimension reduction approach to classification of gene expression data

Scrucca L.
Primo
;
2013

Abstract

The monitoring of the expression profiles of thousands of genes have proved to be particularly promising for biological classification, particularly for cancer diagnosis. However, microarray data present major challenges due to the complex, multiclass nature and the overwhelming number of variables characterizing gene expression profiles. We introduce a methodology that combine dimension reduction method and classification based on finite mixture of Gaussian densities. Information on the dimension reduction subspace is based on the variation of components means for each class, which in turn are obtained by modeling the within class distribution of the predictors through finite mixtures of Gaussian densities. The proposed approach is applied to the leukemia data, a well known dataset in the microarray literature. We show that the combination of dimension reduction and model-based clustering is a powerful technique to find groups among gene expression data.
2013
Advances in Theoretical and Applied Statistics
221
230
Scrucca, L., Bar-Hen, A. (2013). A model-based dimension reduction approach to classification of gene expression data. Boca Raton, FL : Springer [10.1007/978-3-642-35588-2_21].
Scrucca, L.; Bar-Hen, A.
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1011881
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact