Cargando…

Improving the statistical detection of regulated genes from microarray data using intensity-based variance estimation

BACKGROUND: Gene microarray technology provides the ability to study the regulation of thousands of genes simultaneously, but its potential is limited without an estimate of the statistical significance of the observed changes in gene expression. Due to the large number of genes being tested and the...

Descripción completa

Detalles Bibliográficos
Autores principales: Comander, Jason, Natarajan, Sripriya, Gimbrone, Michael A, García-Cardeña, Guillermo
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC400250/
https://www.ncbi.nlm.nih.gov/pubmed/15113402
http://dx.doi.org/10.1186/1471-2164-5-17
_version_ 1782121345047330816
author Comander, Jason
Natarajan, Sripriya
Gimbrone, Michael A
García-Cardeña, Guillermo
author_facet Comander, Jason
Natarajan, Sripriya
Gimbrone, Michael A
García-Cardeña, Guillermo
author_sort Comander, Jason
collection PubMed
description BACKGROUND: Gene microarray technology provides the ability to study the regulation of thousands of genes simultaneously, but its potential is limited without an estimate of the statistical significance of the observed changes in gene expression. Due to the large number of genes being tested and the comparatively small number of array replicates (e.g., N = 3), standard statistical methods such as the Student's t-test fail to produce reliable results. Two other statistical approaches commonly used to improve significance estimates are a penalized t-test and a Z-test using intensity-dependent variance estimates. RESULTS: The performance of these approaches is compared using a dataset of 23 replicates, and a new implementation of the Z-test is introduced that pools together variance estimates of genes with similar minimum intensity. Significance estimates based on 3 replicate arrays are calculated using each statistical technique, and their accuracy is evaluated by comparing them to a reliable estimate based on the remaining 20 replicates. The reproducibility of each test statistic is evaluated by applying it to multiple, independent sets of 3 replicate arrays. Two implementations of a Z-test using intensity-dependent variance produce more reproducible results than two implementations of a penalized t-test. Furthermore, the minimum intensity-based Z-statistic demonstrates higher accuracy and higher or equal precision than all other statistical techniques tested. CONCLUSION: An intensity-based variance estimation technique provides one simple, effective approach that can improve p-value estimates for differentially regulated genes derived from replicated microarray datasets. Implementations of the Z-test algorithms are available at .
format Text
id pubmed-400250
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-4002502004-04-30 Improving the statistical detection of regulated genes from microarray data using intensity-based variance estimation Comander, Jason Natarajan, Sripriya Gimbrone, Michael A García-Cardeña, Guillermo BMC Genomics Methodology Article BACKGROUND: Gene microarray technology provides the ability to study the regulation of thousands of genes simultaneously, but its potential is limited without an estimate of the statistical significance of the observed changes in gene expression. Due to the large number of genes being tested and the comparatively small number of array replicates (e.g., N = 3), standard statistical methods such as the Student's t-test fail to produce reliable results. Two other statistical approaches commonly used to improve significance estimates are a penalized t-test and a Z-test using intensity-dependent variance estimates. RESULTS: The performance of these approaches is compared using a dataset of 23 replicates, and a new implementation of the Z-test is introduced that pools together variance estimates of genes with similar minimum intensity. Significance estimates based on 3 replicate arrays are calculated using each statistical technique, and their accuracy is evaluated by comparing them to a reliable estimate based on the remaining 20 replicates. The reproducibility of each test statistic is evaluated by applying it to multiple, independent sets of 3 replicate arrays. Two implementations of a Z-test using intensity-dependent variance produce more reproducible results than two implementations of a penalized t-test. Furthermore, the minimum intensity-based Z-statistic demonstrates higher accuracy and higher or equal precision than all other statistical techniques tested. CONCLUSION: An intensity-based variance estimation technique provides one simple, effective approach that can improve p-value estimates for differentially regulated genes derived from replicated microarray datasets. Implementations of the Z-test algorithms are available at . BioMed Central 2004-02-27 /pmc/articles/PMC400250/ /pubmed/15113402 http://dx.doi.org/10.1186/1471-2164-5-17 Text en Copyright © 2004 Comander et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Methodology Article
Comander, Jason
Natarajan, Sripriya
Gimbrone, Michael A
García-Cardeña, Guillermo
Improving the statistical detection of regulated genes from microarray data using intensity-based variance estimation
title Improving the statistical detection of regulated genes from microarray data using intensity-based variance estimation
title_full Improving the statistical detection of regulated genes from microarray data using intensity-based variance estimation
title_fullStr Improving the statistical detection of regulated genes from microarray data using intensity-based variance estimation
title_full_unstemmed Improving the statistical detection of regulated genes from microarray data using intensity-based variance estimation
title_short Improving the statistical detection of regulated genes from microarray data using intensity-based variance estimation
title_sort improving the statistical detection of regulated genes from microarray data using intensity-based variance estimation
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC400250/
https://www.ncbi.nlm.nih.gov/pubmed/15113402
http://dx.doi.org/10.1186/1471-2164-5-17
work_keys_str_mv AT comanderjason improvingthestatisticaldetectionofregulatedgenesfrommicroarraydatausingintensitybasedvarianceestimation
AT natarajansripriya improvingthestatisticaldetectionofregulatedgenesfrommicroarraydatausingintensitybasedvarianceestimation
AT gimbronemichaela improvingthestatisticaldetectionofregulatedgenesfrommicroarraydatausingintensitybasedvarianceestimation
AT garciacardenaguillermo improvingthestatisticaldetectionofregulatedgenesfrommicroarraydatausingintensitybasedvarianceestimation