Background: High-throughput measurement of transcript intensities using Affymetrix type oligonucleotide microarrays has produced a massive quantity of data during the last decade. Different preprocessing techniques exist to convert the raw signal intensities measured by these chips into gene expression estimates. Although these techniques have been widely benchmarked in the context of differential gene expression analysis, there are only few examples where their performance has been assessed in respect to coexpression-based studies such as sample classification.Results: In the present paper we benchmark the three most used normalization procedures (MAS5, RMA and GCRMA) in the context of inter-array correlation analysis, confirming and extending the finding that RMA and GCRMA consistently overestimate sample similarity upon normalization. We determine that median polish summarization is responsible for generating a large proportion of these over-similarity artifacts. Furthermore, we show that most affected probesets show also internal signal disagreement, and tend to be composed by individual probes hitting different gene transcripts. We finally provide a correction to the RMA/GCRMA summarization procedure that massively reduces inter-array correlation artifacts, without affecting the detection of differentially expressed genes.Conclusions: We propose tRMA as a modification of RMA to normalize microarray experiments for correlation-based analysis. © 2010 Giorgi et al; licensee BioMed Central Ltd.

Giorgi, F.M., Bolger, A.M., Lohse, M., Usadel, B. (2010). Algorithm-driven Artifacts in median polish summarization of Microarray data. BMC BIOINFORMATICS, 11, 1-12 [10.1186/1471-2105-11-553].

Algorithm-driven Artifacts in median polish summarization of Microarray data

Giorgi, Federico M.;
2010

Abstract

Background: High-throughput measurement of transcript intensities using Affymetrix type oligonucleotide microarrays has produced a massive quantity of data during the last decade. Different preprocessing techniques exist to convert the raw signal intensities measured by these chips into gene expression estimates. Although these techniques have been widely benchmarked in the context of differential gene expression analysis, there are only few examples where their performance has been assessed in respect to coexpression-based studies such as sample classification.Results: In the present paper we benchmark the three most used normalization procedures (MAS5, RMA and GCRMA) in the context of inter-array correlation analysis, confirming and extending the finding that RMA and GCRMA consistently overestimate sample similarity upon normalization. We determine that median polish summarization is responsible for generating a large proportion of these over-similarity artifacts. Furthermore, we show that most affected probesets show also internal signal disagreement, and tend to be composed by individual probes hitting different gene transcripts. We finally provide a correction to the RMA/GCRMA summarization procedure that massively reduces inter-array correlation artifacts, without affecting the detection of differentially expressed genes.Conclusions: We propose tRMA as a modification of RMA to normalize microarray experiments for correlation-based analysis. © 2010 Giorgi et al; licensee BioMed Central Ltd.
2010
Giorgi, F.M., Bolger, A.M., Lohse, M., Usadel, B. (2010). Algorithm-driven Artifacts in median polish summarization of Microarray data. BMC BIOINFORMATICS, 11, 1-12 [10.1186/1471-2105-11-553].
Giorgi, Federico M.; Bolger, Anthony M.; Lohse, Marc; Usadel, Bjoern*
File in questo prodotto:
File Dimensione Formato  
1471-2105-11-553.pdf

accesso aperto

Tipo: Versione (PDF) editoriale
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 2.59 MB
Formato Adobe PDF
2.59 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/657618
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 45
  • ???jsp.display-item.citation.isi??? 42
social impact