Cargando…

Exploring feature selection and classification methods for predicting heart disease

Machine learning has been used successfully to improve the accuracy of computer-aided diagnosis systems. This paper experimentally assesses the performance of models derived by machine learning techniques by using relevant features chosen by various feature-selection methods. Four commonly used hear...

Descripción completa

Detalles Bibliográficos
Autores principales: Spencer, Robinson, Thabtah, Fadi, Abdelhamid, Neda, Thompson, Michael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7133070/
https://www.ncbi.nlm.nih.gov/pubmed/32284873
http://dx.doi.org/10.1177/2055207620914777
_version_ 1783517559013769216
author Spencer, Robinson
Thabtah, Fadi
Abdelhamid, Neda
Thompson, Michael
author_facet Spencer, Robinson
Thabtah, Fadi
Abdelhamid, Neda
Thompson, Michael
author_sort Spencer, Robinson
collection PubMed
description Machine learning has been used successfully to improve the accuracy of computer-aided diagnosis systems. This paper experimentally assesses the performance of models derived by machine learning techniques by using relevant features chosen by various feature-selection methods. Four commonly used heart disease datasets have been evaluated using principal component analysis, Chi squared testing, ReliefF and symmetrical uncertainty to create distinctive feature sets. Then, a variety of classification algorithms have been used to create models that are then compared to seek the optimal features combinations, to improve the correct prediction of heart conditions. We found the benefits of using feature selection vary depending on the machine learning technique used for the heart datasets we consider. However, the best model we created used a combination of Chi-squared feature selection with the BayesNet algorithm and achieved an accuracy of 85.00% on the considered datasets.
format Online
Article
Text
id pubmed-7133070
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-71330702020-04-13 Exploring feature selection and classification methods for predicting heart disease Spencer, Robinson Thabtah, Fadi Abdelhamid, Neda Thompson, Michael Digit Health Original Research Machine learning has been used successfully to improve the accuracy of computer-aided diagnosis systems. This paper experimentally assesses the performance of models derived by machine learning techniques by using relevant features chosen by various feature-selection methods. Four commonly used heart disease datasets have been evaluated using principal component analysis, Chi squared testing, ReliefF and symmetrical uncertainty to create distinctive feature sets. Then, a variety of classification algorithms have been used to create models that are then compared to seek the optimal features combinations, to improve the correct prediction of heart conditions. We found the benefits of using feature selection vary depending on the machine learning technique used for the heart datasets we consider. However, the best model we created used a combination of Chi-squared feature selection with the BayesNet algorithm and achieved an accuracy of 85.00% on the considered datasets. SAGE Publications 2020-03-29 /pmc/articles/PMC7133070/ /pubmed/32284873 http://dx.doi.org/10.1177/2055207620914777 Text en © The Author(s) 2020 https://creativecommons.org/licenses/by-nc/4.0/ Creative Commons Non Commercial CC BY-NC: This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Spencer, Robinson
Thabtah, Fadi
Abdelhamid, Neda
Thompson, Michael
Exploring feature selection and classification methods for predicting heart disease
title Exploring feature selection and classification methods for predicting heart disease
title_full Exploring feature selection and classification methods for predicting heart disease
title_fullStr Exploring feature selection and classification methods for predicting heart disease
title_full_unstemmed Exploring feature selection and classification methods for predicting heart disease
title_short Exploring feature selection and classification methods for predicting heart disease
title_sort exploring feature selection and classification methods for predicting heart disease
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7133070/
https://www.ncbi.nlm.nih.gov/pubmed/32284873
http://dx.doi.org/10.1177/2055207620914777
work_keys_str_mv AT spencerrobinson exploringfeatureselectionandclassificationmethodsforpredictingheartdisease
AT thabtahfadi exploringfeatureselectionandclassificationmethodsforpredictingheartdisease
AT abdelhamidneda exploringfeatureselectionandclassificationmethodsforpredictingheartdisease
AT thompsonmichael exploringfeatureselectionandclassificationmethodsforpredictingheartdisease