Cargando…
Assessing Differential Expression in Two-Color Microarrays: A Resampling-Based Empirical Bayes Approach
Microarrays are widely used for examining differential gene expression, identifying single nucleotide polymorphisms, and detecting methylation loci. Multiple testing methods in microarray data analysis aim at controlling both Type I and Type II error rates; however, real microarray data do not alway...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3842292/ https://www.ncbi.nlm.nih.gov/pubmed/24312198 http://dx.doi.org/10.1371/journal.pone.0080099 |
_version_ | 1782292912707469312 |
---|---|
author | Li, Dongmei Le Pape, Marc A. Parikh, Nisha I. Chen, Will X. Dye, Timothy D. |
author_facet | Li, Dongmei Le Pape, Marc A. Parikh, Nisha I. Chen, Will X. Dye, Timothy D. |
author_sort | Li, Dongmei |
collection | PubMed |
description | Microarrays are widely used for examining differential gene expression, identifying single nucleotide polymorphisms, and detecting methylation loci. Multiple testing methods in microarray data analysis aim at controlling both Type I and Type II error rates; however, real microarray data do not always fit their distribution assumptions. Smyth's ubiquitous parametric method, for example, inadequately accommodates violations of normality assumptions, resulting in inflated Type I error rates. The Significance Analysis of Microarrays, another widely used microarray data analysis method, is based on a permutation test and is robust to non-normally distributed data; however, the Significance Analysis of Microarrays method fold change criteria are problematic, and can critically alter the conclusion of a study, as a result of compositional changes of the control data set in the analysis. We propose a novel approach, combining resampling with empirical Bayes methods: the Resampling-based empirical Bayes Methods. This approach not only reduces false discovery rates for non-normally distributed microarray data, but it is also impervious to fold change threshold since no control data set selection is needed. Through simulation studies, sensitivities, specificities, total rejections, and false discovery rates are compared across the Smyth's parametric method, the Significance Analysis of Microarrays, and the Resampling-based empirical Bayes Methods. Differences in false discovery rates controls between each approach are illustrated through a preterm delivery methylation study. The results show that the Resampling-based empirical Bayes Methods offer significantly higher specificity and lower false discovery rates compared to Smyth's parametric method when data are not normally distributed. The Resampling-based empirical Bayes Methods also offers higher statistical power than the Significance Analysis of Microarrays method when the proportion of significantly differentially expressed genes is large for both normally and non-normally distributed data. Finally, the Resampling-based empirical Bayes Methods are generalizable to next generation sequencing RNA-seq data analysis. |
format | Online Article Text |
id | pubmed-3842292 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-38422922013-12-05 Assessing Differential Expression in Two-Color Microarrays: A Resampling-Based Empirical Bayes Approach Li, Dongmei Le Pape, Marc A. Parikh, Nisha I. Chen, Will X. Dye, Timothy D. PLoS One Research Article Microarrays are widely used for examining differential gene expression, identifying single nucleotide polymorphisms, and detecting methylation loci. Multiple testing methods in microarray data analysis aim at controlling both Type I and Type II error rates; however, real microarray data do not always fit their distribution assumptions. Smyth's ubiquitous parametric method, for example, inadequately accommodates violations of normality assumptions, resulting in inflated Type I error rates. The Significance Analysis of Microarrays, another widely used microarray data analysis method, is based on a permutation test and is robust to non-normally distributed data; however, the Significance Analysis of Microarrays method fold change criteria are problematic, and can critically alter the conclusion of a study, as a result of compositional changes of the control data set in the analysis. We propose a novel approach, combining resampling with empirical Bayes methods: the Resampling-based empirical Bayes Methods. This approach not only reduces false discovery rates for non-normally distributed microarray data, but it is also impervious to fold change threshold since no control data set selection is needed. Through simulation studies, sensitivities, specificities, total rejections, and false discovery rates are compared across the Smyth's parametric method, the Significance Analysis of Microarrays, and the Resampling-based empirical Bayes Methods. Differences in false discovery rates controls between each approach are illustrated through a preterm delivery methylation study. The results show that the Resampling-based empirical Bayes Methods offer significantly higher specificity and lower false discovery rates compared to Smyth's parametric method when data are not normally distributed. The Resampling-based empirical Bayes Methods also offers higher statistical power than the Significance Analysis of Microarrays method when the proportion of significantly differentially expressed genes is large for both normally and non-normally distributed data. Finally, the Resampling-based empirical Bayes Methods are generalizable to next generation sequencing RNA-seq data analysis. Public Library of Science 2013-11-27 /pmc/articles/PMC3842292/ /pubmed/24312198 http://dx.doi.org/10.1371/journal.pone.0080099 Text en © 2013 Li et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Li, Dongmei Le Pape, Marc A. Parikh, Nisha I. Chen, Will X. Dye, Timothy D. Assessing Differential Expression in Two-Color Microarrays: A Resampling-Based Empirical Bayes Approach |
title | Assessing Differential Expression in Two-Color Microarrays: A Resampling-Based Empirical Bayes Approach |
title_full | Assessing Differential Expression in Two-Color Microarrays: A Resampling-Based Empirical Bayes Approach |
title_fullStr | Assessing Differential Expression in Two-Color Microarrays: A Resampling-Based Empirical Bayes Approach |
title_full_unstemmed | Assessing Differential Expression in Two-Color Microarrays: A Resampling-Based Empirical Bayes Approach |
title_short | Assessing Differential Expression in Two-Color Microarrays: A Resampling-Based Empirical Bayes Approach |
title_sort | assessing differential expression in two-color microarrays: a resampling-based empirical bayes approach |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3842292/ https://www.ncbi.nlm.nih.gov/pubmed/24312198 http://dx.doi.org/10.1371/journal.pone.0080099 |
work_keys_str_mv | AT lidongmei assessingdifferentialexpressionintwocolormicroarraysaresamplingbasedempiricalbayesapproach AT lepapemarca assessingdifferentialexpressionintwocolormicroarraysaresamplingbasedempiricalbayesapproach AT parikhnishai assessingdifferentialexpressionintwocolormicroarraysaresamplingbasedempiricalbayesapproach AT chenwillx assessingdifferentialexpressionintwocolormicroarraysaresamplingbasedempiricalbayesapproach AT dyetimothyd assessingdifferentialexpressionintwocolormicroarraysaresamplingbasedempiricalbayesapproach |