Cargando…
On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation
BACKGROUND: The original spotted array technology with competitive hybridization of two experimental samples and measuring relative expression levels is increasingly displaced by more accurate platforms that allow determining absolute expression values for a single sample (for example, Affymetrix Ge...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2453111/ https://www.ncbi.nlm.nih.gov/pubmed/18522715 http://dx.doi.org/10.1186/1745-6150-3-23 |
_version_ | 1782157345481031680 |
---|---|
author | Wong, Wing-Cheong Loh, Marie Eisenhaber, Frank |
author_facet | Wong, Wing-Cheong Loh, Marie Eisenhaber, Frank |
author_sort | Wong, Wing-Cheong |
collection | PubMed |
description | BACKGROUND: The original spotted array technology with competitive hybridization of two experimental samples and measuring relative expression levels is increasingly displaced by more accurate platforms that allow determining absolute expression values for a single sample (for example, Affymetrix GeneChip and Illumina BeadChip). Unfortunately, cross-platform comparisons show a disappointingly low concordance between lists of regulated genes between the latter two platforms. RESULTS: Whereas expression values determined with a single Affymetrix GeneChip represent single measurements, the expression results obtained with Illumina BeadChip are essentially statistical means from several dozens of identical probes. In the case of multiple technical replicates, the data require, therefore, different stistical treatment depending on the platform. The key is the computation of the squared standard deviation within replicates in the case of the Illumina data as weighted mean of the square of the standard deviations of the individual experiments. With an Illumina spike experiment, we demonstrate dramatically improved significance of spiked genes over all relevant concentration ranges. The re-evaluation of two published Illumina datasets (membrane type-1 matrix metalloproteinase expression in mammary epithelial cells by Golubkov et al. Cancer Research (2006) 66, 10460; spermatogenesis in normal and teratozoospermic men, Platts et al. Human Molecular Genetics (2007) 16, 763) significantly identified more biologically relevant genes as transcriptionally regulated targets and, thus, additional biological pathways involved. CONCLUSION: The results in this work show that it is important to process Illumina BeadChip data in a modified statistical procedure and to compute the standard deviation in experiments with technical replicates from the standard errors of individual BeadChips. This change leads also to an improved concordance with Affymetrix GeneChip results as the spermatogenesis dataset re-evaluation demonstrates. REVIEWERS: This article was reviewed by I. King Jordan, Mark J. Dunning and Shamil Sunyaev. |
format | Text |
id | pubmed-2453111 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-24531112008-07-11 On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation Wong, Wing-Cheong Loh, Marie Eisenhaber, Frank Biol Direct Research BACKGROUND: The original spotted array technology with competitive hybridization of two experimental samples and measuring relative expression levels is increasingly displaced by more accurate platforms that allow determining absolute expression values for a single sample (for example, Affymetrix GeneChip and Illumina BeadChip). Unfortunately, cross-platform comparisons show a disappointingly low concordance between lists of regulated genes between the latter two platforms. RESULTS: Whereas expression values determined with a single Affymetrix GeneChip represent single measurements, the expression results obtained with Illumina BeadChip are essentially statistical means from several dozens of identical probes. In the case of multiple technical replicates, the data require, therefore, different stistical treatment depending on the platform. The key is the computation of the squared standard deviation within replicates in the case of the Illumina data as weighted mean of the square of the standard deviations of the individual experiments. With an Illumina spike experiment, we demonstrate dramatically improved significance of spiked genes over all relevant concentration ranges. The re-evaluation of two published Illumina datasets (membrane type-1 matrix metalloproteinase expression in mammary epithelial cells by Golubkov et al. Cancer Research (2006) 66, 10460; spermatogenesis in normal and teratozoospermic men, Platts et al. Human Molecular Genetics (2007) 16, 763) significantly identified more biologically relevant genes as transcriptionally regulated targets and, thus, additional biological pathways involved. CONCLUSION: The results in this work show that it is important to process Illumina BeadChip data in a modified statistical procedure and to compute the standard deviation in experiments with technical replicates from the standard errors of individual BeadChips. This change leads also to an improved concordance with Affymetrix GeneChip results as the spermatogenesis dataset re-evaluation demonstrates. REVIEWERS: This article was reviewed by I. King Jordan, Mark J. Dunning and Shamil Sunyaev. BioMed Central 2008-06-03 /pmc/articles/PMC2453111/ /pubmed/18522715 http://dx.doi.org/10.1186/1745-6150-3-23 Text en Copyright © 2008 Wong et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Wong, Wing-Cheong Loh, Marie Eisenhaber, Frank On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation |
title | On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation |
title_full | On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation |
title_fullStr | On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation |
title_full_unstemmed | On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation |
title_short | On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation |
title_sort | on the necessity of different statistical treatment for illumina beadchip and affymetrix genechip data and its significance for biological interpretation |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2453111/ https://www.ncbi.nlm.nih.gov/pubmed/18522715 http://dx.doi.org/10.1186/1745-6150-3-23 |
work_keys_str_mv | AT wongwingcheong onthenecessityofdifferentstatisticaltreatmentforilluminabeadchipandaffymetrixgenechipdataanditssignificanceforbiologicalinterpretation AT lohmarie onthenecessityofdifferentstatisticaltreatmentforilluminabeadchipandaffymetrixgenechipdataanditssignificanceforbiologicalinterpretation AT eisenhaberfrank onthenecessityofdifferentstatisticaltreatmentforilluminabeadchipandaffymetrixgenechipdataanditssignificanceforbiologicalinterpretation |