Cargando…

From hybridization theory to microarray data analysis: performance evaluation

BACKGROUND: Several preprocessing methods are available for the analysis of Affymetrix Genechips arrays. The most popular algorithms analyze the measured fluorescence intensities with statistical methods. Here we focus on a novel algorithm, AffyILM, available from Bioconductor, which relies on input...

Descripción completa

Detalles Bibliográficos
Autores principales: Berger, Fabrice, Carlon, Enrico
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3267830/
https://www.ncbi.nlm.nih.gov/pubmed/22136743
http://dx.doi.org/10.1186/1471-2105-12-464
_version_ 1782222330552909824
author Berger, Fabrice
Carlon, Enrico
author_facet Berger, Fabrice
Carlon, Enrico
author_sort Berger, Fabrice
collection PubMed
description BACKGROUND: Several preprocessing methods are available for the analysis of Affymetrix Genechips arrays. The most popular algorithms analyze the measured fluorescence intensities with statistical methods. Here we focus on a novel algorithm, AffyILM, available from Bioconductor, which relies on inputs from hybridization thermodynamics and uses an extended Langmuir isotherm model to compute transcript concentrations. These concentrations are then employed in the statistical analysis. We compared the performance of AffyILM and other traditional methods both in the old and in the newest generation of GeneChips. RESULTS: Tissue mixture and Latin Square datasets (provided by Affymetrix) were used to assess the performances of the differential expression analysis depending on the preprocessing strategy. A correlation analysis conducted on the tissue mixture data reveals that the median-polish algorithm allows to best summarize AffyILM concentrations computed at the probe-level. Those correlation results are equivalent to the best correlations observed using popular preprocessing methods relying on intensity values. The performances of each tested preprocessing algorithm were quantified using the Latin Square HG-U133A dataset, thanks to the comparison of differential analysis results with the list of spiked genes. The figures of merit generated illustrates that the performances associated to AffyILM(medianpolish), inferred from the present statistical analysis, are comparable to the best performing strategies previously reported. CONCLUSIONS: Converting probe intensities to estimates of target concentrations prior to the statistical analysis, AffyILM(medianpolish) is one of the best performing strategy currently available. Using hybridization theory, probe-level estimates of target concentrations should be identically distributed. In the future, a probe-level multivariate analysis of the concentrations should be compared to the univariate analysis of probe-set summarized expression data.
format Online
Article
Text
id pubmed-3267830
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32678302012-01-28 From hybridization theory to microarray data analysis: performance evaluation Berger, Fabrice Carlon, Enrico BMC Bioinformatics Research Article BACKGROUND: Several preprocessing methods are available for the analysis of Affymetrix Genechips arrays. The most popular algorithms analyze the measured fluorescence intensities with statistical methods. Here we focus on a novel algorithm, AffyILM, available from Bioconductor, which relies on inputs from hybridization thermodynamics and uses an extended Langmuir isotherm model to compute transcript concentrations. These concentrations are then employed in the statistical analysis. We compared the performance of AffyILM and other traditional methods both in the old and in the newest generation of GeneChips. RESULTS: Tissue mixture and Latin Square datasets (provided by Affymetrix) were used to assess the performances of the differential expression analysis depending on the preprocessing strategy. A correlation analysis conducted on the tissue mixture data reveals that the median-polish algorithm allows to best summarize AffyILM concentrations computed at the probe-level. Those correlation results are equivalent to the best correlations observed using popular preprocessing methods relying on intensity values. The performances of each tested preprocessing algorithm were quantified using the Latin Square HG-U133A dataset, thanks to the comparison of differential analysis results with the list of spiked genes. The figures of merit generated illustrates that the performances associated to AffyILM(medianpolish), inferred from the present statistical analysis, are comparable to the best performing strategies previously reported. CONCLUSIONS: Converting probe intensities to estimates of target concentrations prior to the statistical analysis, AffyILM(medianpolish) is one of the best performing strategy currently available. Using hybridization theory, probe-level estimates of target concentrations should be identically distributed. In the future, a probe-level multivariate analysis of the concentrations should be compared to the univariate analysis of probe-set summarized expression data. BioMed Central 2011-12-02 /pmc/articles/PMC3267830/ /pubmed/22136743 http://dx.doi.org/10.1186/1471-2105-12-464 Text en Copyright ©2011 Berger and Carlon; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Berger, Fabrice
Carlon, Enrico
From hybridization theory to microarray data analysis: performance evaluation
title From hybridization theory to microarray data analysis: performance evaluation
title_full From hybridization theory to microarray data analysis: performance evaluation
title_fullStr From hybridization theory to microarray data analysis: performance evaluation
title_full_unstemmed From hybridization theory to microarray data analysis: performance evaluation
title_short From hybridization theory to microarray data analysis: performance evaluation
title_sort from hybridization theory to microarray data analysis: performance evaluation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3267830/
https://www.ncbi.nlm.nih.gov/pubmed/22136743
http://dx.doi.org/10.1186/1471-2105-12-464
work_keys_str_mv AT bergerfabrice fromhybridizationtheorytomicroarraydataanalysisperformanceevaluation
AT carlonenrico fromhybridizationtheorytomicroarraydataanalysisperformanceevaluation