Cargando…

A gene selection method for GeneChip array data with small sample sizes

BACKGROUND: In microarray experiments with small sample sizes, it is a challenge to estimate p-values accurately and decide cutoff p-values for gene selection appropriately. Although permutation-based methods have proved to have greater sensitivity and specificity than the regular t-test, their p-va...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Zhongxue, Liu, Qingzhong, McGee, Monnie, Kong, Megan, Huang, Xudong, Deng, Youping, Scheuermann, Richard H
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287503/
https://www.ncbi.nlm.nih.gov/pubmed/22369149
http://dx.doi.org/10.1186/1471-2164-12-S5-S7
_version_ 1782224678062915584
author Chen, Zhongxue
Liu, Qingzhong
McGee, Monnie
Kong, Megan
Huang, Xudong
Deng, Youping
Scheuermann, Richard H
author_facet Chen, Zhongxue
Liu, Qingzhong
McGee, Monnie
Kong, Megan
Huang, Xudong
Deng, Youping
Scheuermann, Richard H
author_sort Chen, Zhongxue
collection PubMed
description BACKGROUND: In microarray experiments with small sample sizes, it is a challenge to estimate p-values accurately and decide cutoff p-values for gene selection appropriately. Although permutation-based methods have proved to have greater sensitivity and specificity than the regular t-test, their p-values are highly discrete due to the limited number of permutations available in very small sample sizes. Furthermore, estimated permutation-based p-values for true nulls are highly correlated and not uniformly distributed between zero and one, making it difficult to use current false discovery rate (FDR)-controlling methods. RESULTS: We propose a model-based information sharing method (MBIS) that, after an appropriate data transformation, utilizes information shared among genes. We use a normal distribution to model the mean differences of true nulls across two experimental conditions. The parameters of the model are then estimated using all data in hand. Based on this model, p-values, which are uniformly distributed from true nulls, are calculated. Then, since FDR-controlling methods are generally not well suited to microarray data with very small sample sizes, we select genes for a given cutoff p-value and then estimate the false discovery rate. CONCLUSION: Simulation studies and analysis using real microarray data show that the proposed method, MBIS, is more powerful and reliable than current methods. It has wide application to a variety of situations.
format Online
Article
Text
id pubmed-3287503
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32875032012-03-01 A gene selection method for GeneChip array data with small sample sizes Chen, Zhongxue Liu, Qingzhong McGee, Monnie Kong, Megan Huang, Xudong Deng, Youping Scheuermann, Richard H BMC Genomics Research Article BACKGROUND: In microarray experiments with small sample sizes, it is a challenge to estimate p-values accurately and decide cutoff p-values for gene selection appropriately. Although permutation-based methods have proved to have greater sensitivity and specificity than the regular t-test, their p-values are highly discrete due to the limited number of permutations available in very small sample sizes. Furthermore, estimated permutation-based p-values for true nulls are highly correlated and not uniformly distributed between zero and one, making it difficult to use current false discovery rate (FDR)-controlling methods. RESULTS: We propose a model-based information sharing method (MBIS) that, after an appropriate data transformation, utilizes information shared among genes. We use a normal distribution to model the mean differences of true nulls across two experimental conditions. The parameters of the model are then estimated using all data in hand. Based on this model, p-values, which are uniformly distributed from true nulls, are calculated. Then, since FDR-controlling methods are generally not well suited to microarray data with very small sample sizes, we select genes for a given cutoff p-value and then estimate the false discovery rate. CONCLUSION: Simulation studies and analysis using real microarray data show that the proposed method, MBIS, is more powerful and reliable than current methods. It has wide application to a variety of situations. BioMed Central 2011-12-23 /pmc/articles/PMC3287503/ /pubmed/22369149 http://dx.doi.org/10.1186/1471-2164-12-S5-S7 Text en Copyright ©2011 Chen et al. licensee BioMed Central Ltd http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chen, Zhongxue
Liu, Qingzhong
McGee, Monnie
Kong, Megan
Huang, Xudong
Deng, Youping
Scheuermann, Richard H
A gene selection method for GeneChip array data with small sample sizes
title A gene selection method for GeneChip array data with small sample sizes
title_full A gene selection method for GeneChip array data with small sample sizes
title_fullStr A gene selection method for GeneChip array data with small sample sizes
title_full_unstemmed A gene selection method for GeneChip array data with small sample sizes
title_short A gene selection method for GeneChip array data with small sample sizes
title_sort gene selection method for genechip array data with small sample sizes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287503/
https://www.ncbi.nlm.nih.gov/pubmed/22369149
http://dx.doi.org/10.1186/1471-2164-12-S5-S7
work_keys_str_mv AT chenzhongxue ageneselectionmethodforgenechiparraydatawithsmallsamplesizes
AT liuqingzhong ageneselectionmethodforgenechiparraydatawithsmallsamplesizes
AT mcgeemonnie ageneselectionmethodforgenechiparraydatawithsmallsamplesizes
AT kongmegan ageneselectionmethodforgenechiparraydatawithsmallsamplesizes
AT huangxudong ageneselectionmethodforgenechiparraydatawithsmallsamplesizes
AT dengyouping ageneselectionmethodforgenechiparraydatawithsmallsamplesizes
AT scheuermannrichardh ageneselectionmethodforgenechiparraydatawithsmallsamplesizes
AT chenzhongxue geneselectionmethodforgenechiparraydatawithsmallsamplesizes
AT liuqingzhong geneselectionmethodforgenechiparraydatawithsmallsamplesizes
AT mcgeemonnie geneselectionmethodforgenechiparraydatawithsmallsamplesizes
AT kongmegan geneselectionmethodforgenechiparraydatawithsmallsamplesizes
AT huangxudong geneselectionmethodforgenechiparraydatawithsmallsamplesizes
AT dengyouping geneselectionmethodforgenechiparraydatawithsmallsamplesizes
AT scheuermannrichardh geneselectionmethodforgenechiparraydatawithsmallsamplesizes