Cargando…

Model-based variance-stabilizing transformation for Illumina microarray data

Variance stabilization is a step in the preprocessing of microarray data that can greatly benefit the performance of subsequent statistical modeling and inference. Due to the often limited number of technical replicates for Affymetrix and cDNA arrays, achieving variance stabilization can be difficul...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lin, Simon M., Du, Pan, Huber, Wolfgang, Kibbe, Warren A.
Formato:	Texto
Lenguaje:	English
Publicado:	Oxford University Press 2008
Materias:	Methods Online
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2241869/ https://www.ncbi.nlm.nih.gov/pubmed/18178591 http://dx.doi.org/10.1093/nar/gkm1075

_version_	1782150545493983232
author	Lin, Simon M. Du, Pan Huber, Wolfgang Kibbe, Warren A.
author_facet	Lin, Simon M. Du, Pan Huber, Wolfgang Kibbe, Warren A.
author_sort	Lin, Simon M.
collection	PubMed
description	Variance stabilization is a step in the preprocessing of microarray data that can greatly benefit the performance of subsequent statistical modeling and inference. Due to the often limited number of technical replicates for Affymetrix and cDNA arrays, achieving variance stabilization can be difficult. Although the Illumina microarray platform provides a larger number of technical replicates on each array (usually over 30 randomly distributed beads per probe), these replicates have not been leveraged in the current log2 data transformation process. We devised a variance-stabilizing transformation (VST) method that takes advantage of the technical replicates available on an Illumina microarray. We have compared VST with log2 and Variance-stabilizing normalization (VSN) by using the Kruglyak bead-level data (2006) and Barnes titration data (2005). The results of the Kruglyak data suggest that VST stabilizes variances of bead-replicates within an array. The results of the Barnes data show that VST can improve the detection of differentially expressed genes and reduce false-positive identifications. We conclude that although both VST and VSN are built upon the same model of measurement noise, VST stabilizes the variance better and more efficiently for the Illumina platform by leveraging the availability of a larger number of within-array replicates. The algorithms and Supplementary Data are included in the lumi package of Bioconductor, available at: www.bioconductor.org.
format	Text
id	pubmed-2241869
institution	National Center for Biotechnology Information
language	English
publishDate	2008
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-22418692008-02-21 Model-based variance-stabilizing transformation for Illumina microarray data Lin, Simon M. Du, Pan Huber, Wolfgang Kibbe, Warren A. Nucleic Acids Res Methods Online Variance stabilization is a step in the preprocessing of microarray data that can greatly benefit the performance of subsequent statistical modeling and inference. Due to the often limited number of technical replicates for Affymetrix and cDNA arrays, achieving variance stabilization can be difficult. Although the Illumina microarray platform provides a larger number of technical replicates on each array (usually over 30 randomly distributed beads per probe), these replicates have not been leveraged in the current log2 data transformation process. We devised a variance-stabilizing transformation (VST) method that takes advantage of the technical replicates available on an Illumina microarray. We have compared VST with log2 and Variance-stabilizing normalization (VSN) by using the Kruglyak bead-level data (2006) and Barnes titration data (2005). The results of the Kruglyak data suggest that VST stabilizes variances of bead-replicates within an array. The results of the Barnes data show that VST can improve the detection of differentially expressed genes and reduce false-positive identifications. We conclude that although both VST and VSN are built upon the same model of measurement noise, VST stabilizes the variance better and more efficiently for the Illumina platform by leveraging the availability of a larger number of within-array replicates. The algorithms and Supplementary Data are included in the lumi package of Bioconductor, available at: www.bioconductor.org. Oxford University Press 2008-02 2008-01-04 /pmc/articles/PMC2241869/ /pubmed/18178591 http://dx.doi.org/10.1093/nar/gkm1075 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methods Online Lin, Simon M. Du, Pan Huber, Wolfgang Kibbe, Warren A. Model-based variance-stabilizing transformation for Illumina microarray data
title	Model-based variance-stabilizing transformation for Illumina microarray data
title_full	Model-based variance-stabilizing transformation for Illumina microarray data
title_fullStr	Model-based variance-stabilizing transformation for Illumina microarray data
title_full_unstemmed	Model-based variance-stabilizing transformation for Illumina microarray data
title_short	Model-based variance-stabilizing transformation for Illumina microarray data
title_sort	model-based variance-stabilizing transformation for illumina microarray data
topic	Methods Online
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2241869/ https://www.ncbi.nlm.nih.gov/pubmed/18178591 http://dx.doi.org/10.1093/nar/gkm1075
work_keys_str_mv	AT linsimonm modelbasedvariancestabilizingtransformationforilluminamicroarraydata AT dupan modelbasedvariancestabilizingtransformationforilluminamicroarraydata AT huberwolfgang modelbasedvariancestabilizingtransformationforilluminamicroarraydata AT kibbewarrena modelbasedvariancestabilizingtransformationforilluminamicroarraydata

Model-based variance-stabilizing transformation for Illumina microarray data

Ejemplares similares