Cargando…
A gene selection method for GeneChip array data with small sample sizes
BACKGROUND: In microarray experiments with small sample sizes, it is a challenge to estimate p-values accurately and decide cutoff p-values for gene selection appropriately. Although permutation-based methods have proved to have greater sensitivity and specificity than the regular t-test, their p-va...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287503/ https://www.ncbi.nlm.nih.gov/pubmed/22369149 http://dx.doi.org/10.1186/1471-2164-12-S5-S7 |
_version_ | 1782224678062915584 |
---|---|
author | Chen, Zhongxue Liu, Qingzhong McGee, Monnie Kong, Megan Huang, Xudong Deng, Youping Scheuermann, Richard H |
author_facet | Chen, Zhongxue Liu, Qingzhong McGee, Monnie Kong, Megan Huang, Xudong Deng, Youping Scheuermann, Richard H |
author_sort | Chen, Zhongxue |
collection | PubMed |
description | BACKGROUND: In microarray experiments with small sample sizes, it is a challenge to estimate p-values accurately and decide cutoff p-values for gene selection appropriately. Although permutation-based methods have proved to have greater sensitivity and specificity than the regular t-test, their p-values are highly discrete due to the limited number of permutations available in very small sample sizes. Furthermore, estimated permutation-based p-values for true nulls are highly correlated and not uniformly distributed between zero and one, making it difficult to use current false discovery rate (FDR)-controlling methods. RESULTS: We propose a model-based information sharing method (MBIS) that, after an appropriate data transformation, utilizes information shared among genes. We use a normal distribution to model the mean differences of true nulls across two experimental conditions. The parameters of the model are then estimated using all data in hand. Based on this model, p-values, which are uniformly distributed from true nulls, are calculated. Then, since FDR-controlling methods are generally not well suited to microarray data with very small sample sizes, we select genes for a given cutoff p-value and then estimate the false discovery rate. CONCLUSION: Simulation studies and analysis using real microarray data show that the proposed method, MBIS, is more powerful and reliable than current methods. It has wide application to a variety of situations. |
format | Online Article Text |
id | pubmed-3287503 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32875032012-03-01 A gene selection method for GeneChip array data with small sample sizes Chen, Zhongxue Liu, Qingzhong McGee, Monnie Kong, Megan Huang, Xudong Deng, Youping Scheuermann, Richard H BMC Genomics Research Article BACKGROUND: In microarray experiments with small sample sizes, it is a challenge to estimate p-values accurately and decide cutoff p-values for gene selection appropriately. Although permutation-based methods have proved to have greater sensitivity and specificity than the regular t-test, their p-values are highly discrete due to the limited number of permutations available in very small sample sizes. Furthermore, estimated permutation-based p-values for true nulls are highly correlated and not uniformly distributed between zero and one, making it difficult to use current false discovery rate (FDR)-controlling methods. RESULTS: We propose a model-based information sharing method (MBIS) that, after an appropriate data transformation, utilizes information shared among genes. We use a normal distribution to model the mean differences of true nulls across two experimental conditions. The parameters of the model are then estimated using all data in hand. Based on this model, p-values, which are uniformly distributed from true nulls, are calculated. Then, since FDR-controlling methods are generally not well suited to microarray data with very small sample sizes, we select genes for a given cutoff p-value and then estimate the false discovery rate. CONCLUSION: Simulation studies and analysis using real microarray data show that the proposed method, MBIS, is more powerful and reliable than current methods. It has wide application to a variety of situations. BioMed Central 2011-12-23 /pmc/articles/PMC3287503/ /pubmed/22369149 http://dx.doi.org/10.1186/1471-2164-12-S5-S7 Text en Copyright ©2011 Chen et al. licensee BioMed Central Ltd http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Chen, Zhongxue Liu, Qingzhong McGee, Monnie Kong, Megan Huang, Xudong Deng, Youping Scheuermann, Richard H A gene selection method for GeneChip array data with small sample sizes |
title | A gene selection method for GeneChip array data with small sample sizes |
title_full | A gene selection method for GeneChip array data with small sample sizes |
title_fullStr | A gene selection method for GeneChip array data with small sample sizes |
title_full_unstemmed | A gene selection method for GeneChip array data with small sample sizes |
title_short | A gene selection method for GeneChip array data with small sample sizes |
title_sort | gene selection method for genechip array data with small sample sizes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287503/ https://www.ncbi.nlm.nih.gov/pubmed/22369149 http://dx.doi.org/10.1186/1471-2164-12-S5-S7 |
work_keys_str_mv | AT chenzhongxue ageneselectionmethodforgenechiparraydatawithsmallsamplesizes AT liuqingzhong ageneselectionmethodforgenechiparraydatawithsmallsamplesizes AT mcgeemonnie ageneselectionmethodforgenechiparraydatawithsmallsamplesizes AT kongmegan ageneselectionmethodforgenechiparraydatawithsmallsamplesizes AT huangxudong ageneselectionmethodforgenechiparraydatawithsmallsamplesizes AT dengyouping ageneselectionmethodforgenechiparraydatawithsmallsamplesizes AT scheuermannrichardh ageneselectionmethodforgenechiparraydatawithsmallsamplesizes AT chenzhongxue geneselectionmethodforgenechiparraydatawithsmallsamplesizes AT liuqingzhong geneselectionmethodforgenechiparraydatawithsmallsamplesizes AT mcgeemonnie geneselectionmethodforgenechiparraydatawithsmallsamplesizes AT kongmegan geneselectionmethodforgenechiparraydatawithsmallsamplesizes AT huangxudong geneselectionmethodforgenechiparraydatawithsmallsamplesizes AT dengyouping geneselectionmethodforgenechiparraydatawithsmallsamplesizes AT scheuermannrichardh geneselectionmethodforgenechiparraydatawithsmallsamplesizes |