Cargando…

On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation

BACKGROUND: The original spotted array technology with competitive hybridization of two experimental samples and measuring relative expression levels is increasingly displaced by more accurate platforms that allow determining absolute expression values for a single sample (for example, Affymetrix Ge...

Descripción completa

Detalles Bibliográficos
Autores principales: Wong, Wing-Cheong, Loh, Marie, Eisenhaber, Frank
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2453111/
https://www.ncbi.nlm.nih.gov/pubmed/18522715
http://dx.doi.org/10.1186/1745-6150-3-23
_version_ 1782157345481031680
author Wong, Wing-Cheong
Loh, Marie
Eisenhaber, Frank
author_facet Wong, Wing-Cheong
Loh, Marie
Eisenhaber, Frank
author_sort Wong, Wing-Cheong
collection PubMed
description BACKGROUND: The original spotted array technology with competitive hybridization of two experimental samples and measuring relative expression levels is increasingly displaced by more accurate platforms that allow determining absolute expression values for a single sample (for example, Affymetrix GeneChip and Illumina BeadChip). Unfortunately, cross-platform comparisons show a disappointingly low concordance between lists of regulated genes between the latter two platforms. RESULTS: Whereas expression values determined with a single Affymetrix GeneChip represent single measurements, the expression results obtained with Illumina BeadChip are essentially statistical means from several dozens of identical probes. In the case of multiple technical replicates, the data require, therefore, different stistical treatment depending on the platform. The key is the computation of the squared standard deviation within replicates in the case of the Illumina data as weighted mean of the square of the standard deviations of the individual experiments. With an Illumina spike experiment, we demonstrate dramatically improved significance of spiked genes over all relevant concentration ranges. The re-evaluation of two published Illumina datasets (membrane type-1 matrix metalloproteinase expression in mammary epithelial cells by Golubkov et al. Cancer Research (2006) 66, 10460; spermatogenesis in normal and teratozoospermic men, Platts et al. Human Molecular Genetics (2007) 16, 763) significantly identified more biologically relevant genes as transcriptionally regulated targets and, thus, additional biological pathways involved. CONCLUSION: The results in this work show that it is important to process Illumina BeadChip data in a modified statistical procedure and to compute the standard deviation in experiments with technical replicates from the standard errors of individual BeadChips. This change leads also to an improved concordance with Affymetrix GeneChip results as the spermatogenesis dataset re-evaluation demonstrates. REVIEWERS: This article was reviewed by I. King Jordan, Mark J. Dunning and Shamil Sunyaev.
format Text
id pubmed-2453111
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-24531112008-07-11 On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation Wong, Wing-Cheong Loh, Marie Eisenhaber, Frank Biol Direct Research BACKGROUND: The original spotted array technology with competitive hybridization of two experimental samples and measuring relative expression levels is increasingly displaced by more accurate platforms that allow determining absolute expression values for a single sample (for example, Affymetrix GeneChip and Illumina BeadChip). Unfortunately, cross-platform comparisons show a disappointingly low concordance between lists of regulated genes between the latter two platforms. RESULTS: Whereas expression values determined with a single Affymetrix GeneChip represent single measurements, the expression results obtained with Illumina BeadChip are essentially statistical means from several dozens of identical probes. In the case of multiple technical replicates, the data require, therefore, different stistical treatment depending on the platform. The key is the computation of the squared standard deviation within replicates in the case of the Illumina data as weighted mean of the square of the standard deviations of the individual experiments. With an Illumina spike experiment, we demonstrate dramatically improved significance of spiked genes over all relevant concentration ranges. The re-evaluation of two published Illumina datasets (membrane type-1 matrix metalloproteinase expression in mammary epithelial cells by Golubkov et al. Cancer Research (2006) 66, 10460; spermatogenesis in normal and teratozoospermic men, Platts et al. Human Molecular Genetics (2007) 16, 763) significantly identified more biologically relevant genes as transcriptionally regulated targets and, thus, additional biological pathways involved. CONCLUSION: The results in this work show that it is important to process Illumina BeadChip data in a modified statistical procedure and to compute the standard deviation in experiments with technical replicates from the standard errors of individual BeadChips. This change leads also to an improved concordance with Affymetrix GeneChip results as the spermatogenesis dataset re-evaluation demonstrates. REVIEWERS: This article was reviewed by I. King Jordan, Mark J. Dunning and Shamil Sunyaev. BioMed Central 2008-06-03 /pmc/articles/PMC2453111/ /pubmed/18522715 http://dx.doi.org/10.1186/1745-6150-3-23 Text en Copyright © 2008 Wong et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Wong, Wing-Cheong
Loh, Marie
Eisenhaber, Frank
On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation
title On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation
title_full On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation
title_fullStr On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation
title_full_unstemmed On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation
title_short On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation
title_sort on the necessity of different statistical treatment for illumina beadchip and affymetrix genechip data and its significance for biological interpretation
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2453111/
https://www.ncbi.nlm.nih.gov/pubmed/18522715
http://dx.doi.org/10.1186/1745-6150-3-23
work_keys_str_mv AT wongwingcheong onthenecessityofdifferentstatisticaltreatmentforilluminabeadchipandaffymetrixgenechipdataanditssignificanceforbiologicalinterpretation
AT lohmarie onthenecessityofdifferentstatisticaltreatmentforilluminabeadchipandaffymetrixgenechipdataanditssignificanceforbiologicalinterpretation
AT eisenhaberfrank onthenecessityofdifferentstatisticaltreatmentforilluminabeadchipandaffymetrixgenechipdataanditssignificanceforbiologicalinterpretation