Cargando…

Robust proportional overlapping analysis for feature selection in binary classification within functional genomic experiments

In this paper, a novel feature selection method called Robust Proportional Overlapping Score (RPOS), for microarray gene expression datasets has been proposed, by utilizing the robust measure of dispersion, i.e., Median Absolute Deviation (MAD). This method robustly identifies the most discriminativ...

Descripción completa

Detalles Bibliográficos
Autores principales: Hamraz, Muhammad, Gul, Naz, Raza, Mushtaq, Khan, Dost Muhammad, Khalil, Umair, Zubair, Seema, Khan, Zardad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8176540/
https://www.ncbi.nlm.nih.gov/pubmed/34141889
http://dx.doi.org/10.7717/peerj-cs.562
_version_ 1783703275614240768
author Hamraz, Muhammad
Gul, Naz
Raza, Mushtaq
Khan, Dost Muhammad
Khalil, Umair
Zubair, Seema
Khan, Zardad
author_facet Hamraz, Muhammad
Gul, Naz
Raza, Mushtaq
Khan, Dost Muhammad
Khalil, Umair
Zubair, Seema
Khan, Zardad
author_sort Hamraz, Muhammad
collection PubMed
description In this paper, a novel feature selection method called Robust Proportional Overlapping Score (RPOS), for microarray gene expression datasets has been proposed, by utilizing the robust measure of dispersion, i.e., Median Absolute Deviation (MAD). This method robustly identifies the most discriminative genes by considering the overlapping scores of the gene expression values for binary class problems. Genes with a high degree of overlap between classes are discarded and the ones that discriminate between the classes are selected. The results of the proposed method are compared with five state-of-the-art gene selection methods based on classification error, Brier score, and sensitivity, by considering eleven gene expression datasets. Classification of observations for different sets of selected genes by the proposed method is carried out by three different classifiers, i.e., random forest, k-nearest neighbors (k-NN), and support vector machine (SVM). Box-plots and stability scores of the results are also shown in this paper. The results reveal that in most of the cases the proposed method outperforms the other methods.
format Online
Article
Text
id pubmed-8176540
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-81765402021-06-16 Robust proportional overlapping analysis for feature selection in binary classification within functional genomic experiments Hamraz, Muhammad Gul, Naz Raza, Mushtaq Khan, Dost Muhammad Khalil, Umair Zubair, Seema Khan, Zardad PeerJ Comput Sci Algorithms and Analysis of Algorithms In this paper, a novel feature selection method called Robust Proportional Overlapping Score (RPOS), for microarray gene expression datasets has been proposed, by utilizing the robust measure of dispersion, i.e., Median Absolute Deviation (MAD). This method robustly identifies the most discriminative genes by considering the overlapping scores of the gene expression values for binary class problems. Genes with a high degree of overlap between classes are discarded and the ones that discriminate between the classes are selected. The results of the proposed method are compared with five state-of-the-art gene selection methods based on classification error, Brier score, and sensitivity, by considering eleven gene expression datasets. Classification of observations for different sets of selected genes by the proposed method is carried out by three different classifiers, i.e., random forest, k-nearest neighbors (k-NN), and support vector machine (SVM). Box-plots and stability scores of the results are also shown in this paper. The results reveal that in most of the cases the proposed method outperforms the other methods. PeerJ Inc. 2021-06-01 /pmc/articles/PMC8176540/ /pubmed/34141889 http://dx.doi.org/10.7717/peerj-cs.562 Text en © 2021 Hamraz et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Algorithms and Analysis of Algorithms
Hamraz, Muhammad
Gul, Naz
Raza, Mushtaq
Khan, Dost Muhammad
Khalil, Umair
Zubair, Seema
Khan, Zardad
Robust proportional overlapping analysis for feature selection in binary classification within functional genomic experiments
title Robust proportional overlapping analysis for feature selection in binary classification within functional genomic experiments
title_full Robust proportional overlapping analysis for feature selection in binary classification within functional genomic experiments
title_fullStr Robust proportional overlapping analysis for feature selection in binary classification within functional genomic experiments
title_full_unstemmed Robust proportional overlapping analysis for feature selection in binary classification within functional genomic experiments
title_short Robust proportional overlapping analysis for feature selection in binary classification within functional genomic experiments
title_sort robust proportional overlapping analysis for feature selection in binary classification within functional genomic experiments
topic Algorithms and Analysis of Algorithms
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8176540/
https://www.ncbi.nlm.nih.gov/pubmed/34141889
http://dx.doi.org/10.7717/peerj-cs.562
work_keys_str_mv AT hamrazmuhammad robustproportionaloverlappinganalysisforfeatureselectioninbinaryclassificationwithinfunctionalgenomicexperiments
AT gulnaz robustproportionaloverlappinganalysisforfeatureselectioninbinaryclassificationwithinfunctionalgenomicexperiments
AT razamushtaq robustproportionaloverlappinganalysisforfeatureselectioninbinaryclassificationwithinfunctionalgenomicexperiments
AT khandostmuhammad robustproportionaloverlappinganalysisforfeatureselectioninbinaryclassificationwithinfunctionalgenomicexperiments
AT khalilumair robustproportionaloverlappinganalysisforfeatureselectioninbinaryclassificationwithinfunctionalgenomicexperiments
AT zubairseema robustproportionaloverlappinganalysisforfeatureselectioninbinaryclassificationwithinfunctionalgenomicexperiments
AT khanzardad robustproportionaloverlappinganalysisforfeatureselectioninbinaryclassificationwithinfunctionalgenomicexperiments