Cargando…
Tests for differential gene expression using weights in oligonucleotide microarray experiments
BACKGROUND: Microarray data analysts commonly filter out genes based on a number of ad hoc criteria prior to any high-level statistical analysis. Such ad hoc approaches could lead to conflicting conclusions with no clear guidance as to which method is most likely to be reproducible. Furthermore, the...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1420292/ https://www.ncbi.nlm.nih.gov/pubmed/16504060 http://dx.doi.org/10.1186/1471-2164-7-33 |
_version_ | 1782127139121790976 |
---|---|
author | Hu, Pingzhao Beyene, Joseph Greenwood, Celia MT |
author_facet | Hu, Pingzhao Beyene, Joseph Greenwood, Celia MT |
author_sort | Hu, Pingzhao |
collection | PubMed |
description | BACKGROUND: Microarray data analysts commonly filter out genes based on a number of ad hoc criteria prior to any high-level statistical analysis. Such ad hoc approaches could lead to conflicting conclusions with no clear guidance as to which method is most likely to be reproducible. Furthermore, the number of tests performed with concomitant inflation in type I error also plagues the statistical analysis of microarray data, since the number of tested quantities in a study significantly affects the family-wise error rate. It would, therefore, be very useful to develop and adopt strategies that allow quantification of the quality of each probeset, to filter out or give little credence to low-quality or unexpressed probesets, and to incorporate these strategies into gene selection within a multiple testing framework. RESULTS: We have proposed a unified scheme for filtering and gene selection. For Affymetrix gene expression microarrays, we developed new methods for measuring the reliability of a particular probeset in a single array, and we used these to develop measures for a set of arrays. These measures are then used as weights in standard t-statistic calculations, and are incorporated into the multiple testing procedures. We demonstrated the advantages of our methods using simulated data, publicly available spiked-in data as well as data comparing normal muscle to muscle from patients with Duchenne muscular dystrophy (DMD), in which a set of truly differentially expressed genes is known. CONCLUSION: Our quality measures provide convenient ways to search for individual genes of high quality. The quality weighting strategies we proposed for testing differential gene expression have demonstrable improvement on the traditional filtering methods, the standard t-statistic and a regularized t-statistic in Affymetrix data analysis. |
format | Text |
id | pubmed-1420292 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-14202922006-04-14 Tests for differential gene expression using weights in oligonucleotide microarray experiments Hu, Pingzhao Beyene, Joseph Greenwood, Celia MT BMC Genomics Methodology Article BACKGROUND: Microarray data analysts commonly filter out genes based on a number of ad hoc criteria prior to any high-level statistical analysis. Such ad hoc approaches could lead to conflicting conclusions with no clear guidance as to which method is most likely to be reproducible. Furthermore, the number of tests performed with concomitant inflation in type I error also plagues the statistical analysis of microarray data, since the number of tested quantities in a study significantly affects the family-wise error rate. It would, therefore, be very useful to develop and adopt strategies that allow quantification of the quality of each probeset, to filter out or give little credence to low-quality or unexpressed probesets, and to incorporate these strategies into gene selection within a multiple testing framework. RESULTS: We have proposed a unified scheme for filtering and gene selection. For Affymetrix gene expression microarrays, we developed new methods for measuring the reliability of a particular probeset in a single array, and we used these to develop measures for a set of arrays. These measures are then used as weights in standard t-statistic calculations, and are incorporated into the multiple testing procedures. We demonstrated the advantages of our methods using simulated data, publicly available spiked-in data as well as data comparing normal muscle to muscle from patients with Duchenne muscular dystrophy (DMD), in which a set of truly differentially expressed genes is known. CONCLUSION: Our quality measures provide convenient ways to search for individual genes of high quality. The quality weighting strategies we proposed for testing differential gene expression have demonstrable improvement on the traditional filtering methods, the standard t-statistic and a regularized t-statistic in Affymetrix data analysis. BioMed Central 2006-02-22 /pmc/articles/PMC1420292/ /pubmed/16504060 http://dx.doi.org/10.1186/1471-2164-7-33 Text en Copyright © 2006 Hu et al; licensee BioMed Central Ltd. |
spellingShingle | Methodology Article Hu, Pingzhao Beyene, Joseph Greenwood, Celia MT Tests for differential gene expression using weights in oligonucleotide microarray experiments |
title | Tests for differential gene expression using weights in oligonucleotide microarray experiments |
title_full | Tests for differential gene expression using weights in oligonucleotide microarray experiments |
title_fullStr | Tests for differential gene expression using weights in oligonucleotide microarray experiments |
title_full_unstemmed | Tests for differential gene expression using weights in oligonucleotide microarray experiments |
title_short | Tests for differential gene expression using weights in oligonucleotide microarray experiments |
title_sort | tests for differential gene expression using weights in oligonucleotide microarray experiments |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1420292/ https://www.ncbi.nlm.nih.gov/pubmed/16504060 http://dx.doi.org/10.1186/1471-2164-7-33 |
work_keys_str_mv | AT hupingzhao testsfordifferentialgeneexpressionusingweightsinoligonucleotidemicroarrayexperiments AT beyenejoseph testsfordifferentialgeneexpressionusingweightsinoligonucleotidemicroarrayexperiments AT greenwoodceliamt testsfordifferentialgeneexpressionusingweightsinoligonucleotidemicroarrayexperiments |