Cargando…

FeatureSelect: a software for feature selection based on machine learning approaches

BACKGROUND: Feature selection, as a preprocessing stage, is a challenging problem in various sciences such as biology, engineering, computer science, and other fields. For this purpose, some studies have introduced tools and softwares such as WEKA. Meanwhile, these tools or softwares are based on fi...

Descripción completa

Detalles Bibliográficos
Autores principales: Masoudi-Sobhanzadeh, Yosef, Motieghader, Habib, Masoudi-Nejad, Ali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6446290/
https://www.ncbi.nlm.nih.gov/pubmed/30943889
http://dx.doi.org/10.1186/s12859-019-2754-0
_version_ 1783408332791349248
author Masoudi-Sobhanzadeh, Yosef
Motieghader, Habib
Masoudi-Nejad, Ali
author_facet Masoudi-Sobhanzadeh, Yosef
Motieghader, Habib
Masoudi-Nejad, Ali
author_sort Masoudi-Sobhanzadeh, Yosef
collection PubMed
description BACKGROUND: Feature selection, as a preprocessing stage, is a challenging problem in various sciences such as biology, engineering, computer science, and other fields. For this purpose, some studies have introduced tools and softwares such as WEKA. Meanwhile, these tools or softwares are based on filter methods which have lower performance relative to wrapper methods. In this paper, we address this limitation and introduce a software application called FeatureSelect. In addition to filter methods, FeatureSelect consists of optimisation algorithms and three types of learners. It provides a user-friendly and straightforward method of feature selection for use in any kind of research, and can easily be applied to any type of balanced and unbalanced data based on several score functions like accuracy, sensitivity, specificity, etc. RESULTS: In addition to our previously introduced optimisation algorithm (WCC), a total of 10 efficient, well-known and recently developed algorithms have been implemented in FeatureSelect. We applied our software to a range of different datasets and evaluated the performance of its algorithms. Acquired results show that the performances of algorithms are varying on different datasets, but WCC, LCA, FOA, and LA are suitable than others in the overall state. The results also show that wrapper methods are better than filter methods. CONCLUSIONS: FeatureSelect is a feature or gene selection software application which is based on wrapper methods. Furthermore, it includes some popular filter methods and generates various comparison diagrams and statistical measurements. It is available from GitHub (https://github.com/LBBSoft/FeatureSelect) and is free open source software under an MIT license. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2754-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6446290
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64462902019-04-12 FeatureSelect: a software for feature selection based on machine learning approaches Masoudi-Sobhanzadeh, Yosef Motieghader, Habib Masoudi-Nejad, Ali BMC Bioinformatics Software BACKGROUND: Feature selection, as a preprocessing stage, is a challenging problem in various sciences such as biology, engineering, computer science, and other fields. For this purpose, some studies have introduced tools and softwares such as WEKA. Meanwhile, these tools or softwares are based on filter methods which have lower performance relative to wrapper methods. In this paper, we address this limitation and introduce a software application called FeatureSelect. In addition to filter methods, FeatureSelect consists of optimisation algorithms and three types of learners. It provides a user-friendly and straightforward method of feature selection for use in any kind of research, and can easily be applied to any type of balanced and unbalanced data based on several score functions like accuracy, sensitivity, specificity, etc. RESULTS: In addition to our previously introduced optimisation algorithm (WCC), a total of 10 efficient, well-known and recently developed algorithms have been implemented in FeatureSelect. We applied our software to a range of different datasets and evaluated the performance of its algorithms. Acquired results show that the performances of algorithms are varying on different datasets, but WCC, LCA, FOA, and LA are suitable than others in the overall state. The results also show that wrapper methods are better than filter methods. CONCLUSIONS: FeatureSelect is a feature or gene selection software application which is based on wrapper methods. Furthermore, it includes some popular filter methods and generates various comparison diagrams and statistical measurements. It is available from GitHub (https://github.com/LBBSoft/FeatureSelect) and is free open source software under an MIT license. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2754-0) contains supplementary material, which is available to authorized users. BioMed Central 2019-04-03 /pmc/articles/PMC6446290/ /pubmed/30943889 http://dx.doi.org/10.1186/s12859-019-2754-0 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Masoudi-Sobhanzadeh, Yosef
Motieghader, Habib
Masoudi-Nejad, Ali
FeatureSelect: a software for feature selection based on machine learning approaches
title FeatureSelect: a software for feature selection based on machine learning approaches
title_full FeatureSelect: a software for feature selection based on machine learning approaches
title_fullStr FeatureSelect: a software for feature selection based on machine learning approaches
title_full_unstemmed FeatureSelect: a software for feature selection based on machine learning approaches
title_short FeatureSelect: a software for feature selection based on machine learning approaches
title_sort featureselect: a software for feature selection based on machine learning approaches
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6446290/
https://www.ncbi.nlm.nih.gov/pubmed/30943889
http://dx.doi.org/10.1186/s12859-019-2754-0
work_keys_str_mv AT masoudisobhanzadehyosef featureselectasoftwareforfeatureselectionbasedonmachinelearningapproaches
AT motieghaderhabib featureselectasoftwareforfeatureselectionbasedonmachinelearningapproaches
AT masoudinejadali featureselectasoftwareforfeatureselectionbasedonmachinelearningapproaches