Cargando…
From hybridization theory to microarray data analysis: performance evaluation
BACKGROUND: Several preprocessing methods are available for the analysis of Affymetrix Genechips arrays. The most popular algorithms analyze the measured fluorescence intensities with statistical methods. Here we focus on a novel algorithm, AffyILM, available from Bioconductor, which relies on input...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3267830/ https://www.ncbi.nlm.nih.gov/pubmed/22136743 http://dx.doi.org/10.1186/1471-2105-12-464 |
_version_ | 1782222330552909824 |
---|---|
author | Berger, Fabrice Carlon, Enrico |
author_facet | Berger, Fabrice Carlon, Enrico |
author_sort | Berger, Fabrice |
collection | PubMed |
description | BACKGROUND: Several preprocessing methods are available for the analysis of Affymetrix Genechips arrays. The most popular algorithms analyze the measured fluorescence intensities with statistical methods. Here we focus on a novel algorithm, AffyILM, available from Bioconductor, which relies on inputs from hybridization thermodynamics and uses an extended Langmuir isotherm model to compute transcript concentrations. These concentrations are then employed in the statistical analysis. We compared the performance of AffyILM and other traditional methods both in the old and in the newest generation of GeneChips. RESULTS: Tissue mixture and Latin Square datasets (provided by Affymetrix) were used to assess the performances of the differential expression analysis depending on the preprocessing strategy. A correlation analysis conducted on the tissue mixture data reveals that the median-polish algorithm allows to best summarize AffyILM concentrations computed at the probe-level. Those correlation results are equivalent to the best correlations observed using popular preprocessing methods relying on intensity values. The performances of each tested preprocessing algorithm were quantified using the Latin Square HG-U133A dataset, thanks to the comparison of differential analysis results with the list of spiked genes. The figures of merit generated illustrates that the performances associated to AffyILM(medianpolish), inferred from the present statistical analysis, are comparable to the best performing strategies previously reported. CONCLUSIONS: Converting probe intensities to estimates of target concentrations prior to the statistical analysis, AffyILM(medianpolish) is one of the best performing strategy currently available. Using hybridization theory, probe-level estimates of target concentrations should be identically distributed. In the future, a probe-level multivariate analysis of the concentrations should be compared to the univariate analysis of probe-set summarized expression data. |
format | Online Article Text |
id | pubmed-3267830 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32678302012-01-28 From hybridization theory to microarray data analysis: performance evaluation Berger, Fabrice Carlon, Enrico BMC Bioinformatics Research Article BACKGROUND: Several preprocessing methods are available for the analysis of Affymetrix Genechips arrays. The most popular algorithms analyze the measured fluorescence intensities with statistical methods. Here we focus on a novel algorithm, AffyILM, available from Bioconductor, which relies on inputs from hybridization thermodynamics and uses an extended Langmuir isotherm model to compute transcript concentrations. These concentrations are then employed in the statistical analysis. We compared the performance of AffyILM and other traditional methods both in the old and in the newest generation of GeneChips. RESULTS: Tissue mixture and Latin Square datasets (provided by Affymetrix) were used to assess the performances of the differential expression analysis depending on the preprocessing strategy. A correlation analysis conducted on the tissue mixture data reveals that the median-polish algorithm allows to best summarize AffyILM concentrations computed at the probe-level. Those correlation results are equivalent to the best correlations observed using popular preprocessing methods relying on intensity values. The performances of each tested preprocessing algorithm were quantified using the Latin Square HG-U133A dataset, thanks to the comparison of differential analysis results with the list of spiked genes. The figures of merit generated illustrates that the performances associated to AffyILM(medianpolish), inferred from the present statistical analysis, are comparable to the best performing strategies previously reported. CONCLUSIONS: Converting probe intensities to estimates of target concentrations prior to the statistical analysis, AffyILM(medianpolish) is one of the best performing strategy currently available. Using hybridization theory, probe-level estimates of target concentrations should be identically distributed. In the future, a probe-level multivariate analysis of the concentrations should be compared to the univariate analysis of probe-set summarized expression data. BioMed Central 2011-12-02 /pmc/articles/PMC3267830/ /pubmed/22136743 http://dx.doi.org/10.1186/1471-2105-12-464 Text en Copyright ©2011 Berger and Carlon; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Berger, Fabrice Carlon, Enrico From hybridization theory to microarray data analysis: performance evaluation |
title | From hybridization theory to microarray data analysis: performance evaluation |
title_full | From hybridization theory to microarray data analysis: performance evaluation |
title_fullStr | From hybridization theory to microarray data analysis: performance evaluation |
title_full_unstemmed | From hybridization theory to microarray data analysis: performance evaluation |
title_short | From hybridization theory to microarray data analysis: performance evaluation |
title_sort | from hybridization theory to microarray data analysis: performance evaluation |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3267830/ https://www.ncbi.nlm.nih.gov/pubmed/22136743 http://dx.doi.org/10.1186/1471-2105-12-464 |
work_keys_str_mv | AT bergerfabrice fromhybridizationtheorytomicroarraydataanalysisperformanceevaluation AT carlonenrico fromhybridizationtheorytomicroarraydataanalysisperformanceevaluation |