Cargando…

Gene Selection for Multiclass Prediction by Weighted Fisher Criterion

Gene expression profiling has been widely used to study molecular signatures of many diseases and to develop molecular diagnostics for disease prediction. Gene selection, as an important step for improved diagnostics, screens tens of thousands of genes and identifies a small subset that discriminate...

Descripción completa

Detalles Bibliográficos
Autores principales: Xuan, Jianhua, Wang, Yue, Dong, Yibin, Feng, Yuanjian, Wang, Bin, Khan, Javed, Bakay, Maria, Wang, Zuyi, Pachman, Lauren, Winokur, Sara, Chen, Yi-Wen, Clarke, Robert, Hoffman, Eric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171347/
https://www.ncbi.nlm.nih.gov/pubmed/17713593
http://dx.doi.org/10.1155/2007/64628
_version_ 1782211741975838720
author Xuan, Jianhua
Wang, Yue
Dong, Yibin
Feng, Yuanjian
Wang, Bin
Khan, Javed
Bakay, Maria
Wang, Zuyi
Pachman, Lauren
Winokur, Sara
Chen, Yi-Wen
Clarke, Robert
Hoffman, Eric
author_facet Xuan, Jianhua
Wang, Yue
Dong, Yibin
Feng, Yuanjian
Wang, Bin
Khan, Javed
Bakay, Maria
Wang, Zuyi
Pachman, Lauren
Winokur, Sara
Chen, Yi-Wen
Clarke, Robert
Hoffman, Eric
author_sort Xuan, Jianhua
collection PubMed
description Gene expression profiling has been widely used to study molecular signatures of many diseases and to develop molecular diagnostics for disease prediction. Gene selection, as an important step for improved diagnostics, screens tens of thousands of genes and identifies a small subset that discriminates between disease types. A two-step gene selection method is proposed to identify informative gene subsets for accurate classification of multiclass phenotypes. In the first step, individually discriminatory genes (IDGs) are identified by using one-dimensional weighted Fisher criterion (wFC). In the second step, jointly discriminatory genes (JDGs) are selected by sequential search methods, based on their joint class separability measured by multidimensional weighted Fisher criterion (wFC). The performance of the selected gene subsets for multiclass prediction is evaluated by artificial neural networks (ANNs) and/or support vector machines (SVMs). By applying the proposed IDG/JDG approach to two microarray studies, that is, small round blue cell tumors (SRBCTs) and muscular dystrophies (MDs), we successfully identified a much smaller yet efficient set of JDGs for diagnosing SRBCTs and MDs with high prediction accuracies (96.9% for SRBCTs and 92.3% for MDs, resp.). These experimental results demonstrated that the two-step gene selection method is able to identify a subset of highly discriminative genes for improved multiclass prediction.
format Online
Article
Text
id pubmed-3171347
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Springer
record_format MEDLINE/PubMed
spelling pubmed-31713472011-09-13 Gene Selection for Multiclass Prediction by Weighted Fisher Criterion Xuan, Jianhua Wang, Yue Dong, Yibin Feng, Yuanjian Wang, Bin Khan, Javed Bakay, Maria Wang, Zuyi Pachman, Lauren Winokur, Sara Chen, Yi-Wen Clarke, Robert Hoffman, Eric EURASIP J Bioinform Syst Biol Research Article Gene expression profiling has been widely used to study molecular signatures of many diseases and to develop molecular diagnostics for disease prediction. Gene selection, as an important step for improved diagnostics, screens tens of thousands of genes and identifies a small subset that discriminates between disease types. A two-step gene selection method is proposed to identify informative gene subsets for accurate classification of multiclass phenotypes. In the first step, individually discriminatory genes (IDGs) are identified by using one-dimensional weighted Fisher criterion (wFC). In the second step, jointly discriminatory genes (JDGs) are selected by sequential search methods, based on their joint class separability measured by multidimensional weighted Fisher criterion (wFC). The performance of the selected gene subsets for multiclass prediction is evaluated by artificial neural networks (ANNs) and/or support vector machines (SVMs). By applying the proposed IDG/JDG approach to two microarray studies, that is, small round blue cell tumors (SRBCTs) and muscular dystrophies (MDs), we successfully identified a much smaller yet efficient set of JDGs for diagnosing SRBCTs and MDs with high prediction accuracies (96.9% for SRBCTs and 92.3% for MDs, resp.). These experimental results demonstrated that the two-step gene selection method is able to identify a subset of highly discriminative genes for improved multiclass prediction. Springer 2007-07-10 /pmc/articles/PMC3171347/ /pubmed/17713593 http://dx.doi.org/10.1155/2007/64628 Text en Copyright © 2007 Jianhua Xuan et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Xuan, Jianhua
Wang, Yue
Dong, Yibin
Feng, Yuanjian
Wang, Bin
Khan, Javed
Bakay, Maria
Wang, Zuyi
Pachman, Lauren
Winokur, Sara
Chen, Yi-Wen
Clarke, Robert
Hoffman, Eric
Gene Selection for Multiclass Prediction by Weighted Fisher Criterion
title Gene Selection for Multiclass Prediction by Weighted Fisher Criterion
title_full Gene Selection for Multiclass Prediction by Weighted Fisher Criterion
title_fullStr Gene Selection for Multiclass Prediction by Weighted Fisher Criterion
title_full_unstemmed Gene Selection for Multiclass Prediction by Weighted Fisher Criterion
title_short Gene Selection for Multiclass Prediction by Weighted Fisher Criterion
title_sort gene selection for multiclass prediction by weighted fisher criterion
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171347/
https://www.ncbi.nlm.nih.gov/pubmed/17713593
http://dx.doi.org/10.1155/2007/64628
work_keys_str_mv AT xuanjianhua geneselectionformulticlasspredictionbyweightedfishercriterion
AT wangyue geneselectionformulticlasspredictionbyweightedfishercriterion
AT dongyibin geneselectionformulticlasspredictionbyweightedfishercriterion
AT fengyuanjian geneselectionformulticlasspredictionbyweightedfishercriterion
AT wangbin geneselectionformulticlasspredictionbyweightedfishercriterion
AT khanjaved geneselectionformulticlasspredictionbyweightedfishercriterion
AT bakaymaria geneselectionformulticlasspredictionbyweightedfishercriterion
AT wangzuyi geneselectionformulticlasspredictionbyweightedfishercriterion
AT pachmanlauren geneselectionformulticlasspredictionbyweightedfishercriterion
AT winokursara geneselectionformulticlasspredictionbyweightedfishercriterion
AT chenyiwen geneselectionformulticlasspredictionbyweightedfishercriterion
AT clarkerobert geneselectionformulticlasspredictionbyweightedfishercriterion
AT hoffmaneric geneselectionformulticlasspredictionbyweightedfishercriterion