Cargando…

Gene selection with multiple ordering criteria

BACKGROUND: A microarray study may select different differentially expressed gene sets because of different selection criteria. For example, the fold-change and p-value are two commonly known criteria to select differentially expressed genes under two experimental conditions. These two selection cri...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, James J, Tsai, Chen-An, Tzeng, ShengLi, Chen, Chun-Houh
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1829166/
https://www.ncbi.nlm.nih.gov/pubmed/17338815
http://dx.doi.org/10.1186/1471-2105-8-74
_version_ 1782132749666091008
author Chen, James J
Tsai, Chen-An
Tzeng, ShengLi
Chen, Chun-Houh
author_facet Chen, James J
Tsai, Chen-An
Tzeng, ShengLi
Chen, Chun-Houh
author_sort Chen, James J
collection PubMed
description BACKGROUND: A microarray study may select different differentially expressed gene sets because of different selection criteria. For example, the fold-change and p-value are two commonly known criteria to select differentially expressed genes under two experimental conditions. These two selection criteria often result in incompatible selected gene sets. Also, in a two-factor, say, treatment by time experiment, the investigator may be interested in one gene list that responds to both treatment and time effects. RESULTS: We propose three layer ranking algorithms, point-admissible, line-admissible (convex), and Pareto, to provide a preference gene list from multiple gene lists generated by different ranking criteria. Using the public colon data as an example, the layer ranking algorithms are applied to the three univariate ranking criteria, fold-change, p-value, and frequency of selections by the SVM-RFE classifier. A simulation experiment shows that for experiments with small or moderate sample sizes (less than 20 per group) and detecting a 4-fold change or less, the two-dimensional (p-value and fold-change) convex layer ranking selects differentially expressed genes with generally lower FDR and higher power than the standard p-value ranking. Three applications are presented. The first application illustrates a use of the layer rankings to potentially improve predictive accuracy. The second application illustrates an application to a two-factor experiment involving two dose levels and two time points. The layer rankings are applied to selecting differentially expressed genes relating to the dose and time effects. In the third application, the layer rankings are applied to a benchmark data set consisting of three dilution concentrations to provide a ranking system from a long list of differentially expressed genes generated from the three dilution concentrations. CONCLUSION: The layer ranking algorithms are useful to help investigators in selecting the most promising genes from multiple gene lists generated by different filter, normalization, or analysis methods for various objectives.
format Text
id pubmed-1829166
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18291662007-03-26 Gene selection with multiple ordering criteria Chen, James J Tsai, Chen-An Tzeng, ShengLi Chen, Chun-Houh BMC Bioinformatics Methodology Article BACKGROUND: A microarray study may select different differentially expressed gene sets because of different selection criteria. For example, the fold-change and p-value are two commonly known criteria to select differentially expressed genes under two experimental conditions. These two selection criteria often result in incompatible selected gene sets. Also, in a two-factor, say, treatment by time experiment, the investigator may be interested in one gene list that responds to both treatment and time effects. RESULTS: We propose three layer ranking algorithms, point-admissible, line-admissible (convex), and Pareto, to provide a preference gene list from multiple gene lists generated by different ranking criteria. Using the public colon data as an example, the layer ranking algorithms are applied to the three univariate ranking criteria, fold-change, p-value, and frequency of selections by the SVM-RFE classifier. A simulation experiment shows that for experiments with small or moderate sample sizes (less than 20 per group) and detecting a 4-fold change or less, the two-dimensional (p-value and fold-change) convex layer ranking selects differentially expressed genes with generally lower FDR and higher power than the standard p-value ranking. Three applications are presented. The first application illustrates a use of the layer rankings to potentially improve predictive accuracy. The second application illustrates an application to a two-factor experiment involving two dose levels and two time points. The layer rankings are applied to selecting differentially expressed genes relating to the dose and time effects. In the third application, the layer rankings are applied to a benchmark data set consisting of three dilution concentrations to provide a ranking system from a long list of differentially expressed genes generated from the three dilution concentrations. CONCLUSION: The layer ranking algorithms are useful to help investigators in selecting the most promising genes from multiple gene lists generated by different filter, normalization, or analysis methods for various objectives. BioMed Central 2007-03-05 /pmc/articles/PMC1829166/ /pubmed/17338815 http://dx.doi.org/10.1186/1471-2105-8-74 Text en Copyright © 2007 Chen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Chen, James J
Tsai, Chen-An
Tzeng, ShengLi
Chen, Chun-Houh
Gene selection with multiple ordering criteria
title Gene selection with multiple ordering criteria
title_full Gene selection with multiple ordering criteria
title_fullStr Gene selection with multiple ordering criteria
title_full_unstemmed Gene selection with multiple ordering criteria
title_short Gene selection with multiple ordering criteria
title_sort gene selection with multiple ordering criteria
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1829166/
https://www.ncbi.nlm.nih.gov/pubmed/17338815
http://dx.doi.org/10.1186/1471-2105-8-74
work_keys_str_mv AT chenjamesj geneselectionwithmultipleorderingcriteria
AT tsaichenan geneselectionwithmultipleorderingcriteria
AT tzengshengli geneselectionwithmultipleorderingcriteria
AT chenchunhouh geneselectionwithmultipleorderingcriteria