Cargando…

Predicting coronary artery disease: a comparison between two data mining algorithms

BACKGROUND: Cardiovascular diseases (CADs) are the first leading cause of death across the world. World Health Organization has estimated that morality rate caused by heart diseases will mount to 23 million cases by 2030. Hence, the use of data mining algorithms could be useful in predicting coronar...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ayatollahi, Haleh, Gholamhosseini, Leila, Salehi, Masoud
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2019
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6489351/ https://www.ncbi.nlm.nih.gov/pubmed/31035958 http://dx.doi.org/10.1186/s12889-019-6721-5

_version_	1783414808656216064
author	Ayatollahi, Haleh Gholamhosseini, Leila Salehi, Masoud
author_facet	Ayatollahi, Haleh Gholamhosseini, Leila Salehi, Masoud
author_sort	Ayatollahi, Haleh
collection	PubMed
description	BACKGROUND: Cardiovascular diseases (CADs) are the first leading cause of death across the world. World Health Organization has estimated that morality rate caused by heart diseases will mount to 23 million cases by 2030. Hence, the use of data mining algorithms could be useful in predicting coronary artery diseases. Therefore, the present study aimed to compare the positive predictive value (PPV) of CAD using artificial neural network (ANN) and SVM algorithms and their distinction in terms of predicting CAD in the selected hospitals. METHODS: The present study was conducted by using data mining techniques. The research sample was the medical records of the patients with coronary artery disease who were hospitalized in three hospitals affiliated to AJA University of Medical Sciences between March 2016 and March 2017 (n = 1324). The dataset and the predicting variables used in this study was the same for both data mining techniques. Totally, 25 variables affecting CAD were selected and related data were extracted. After normalizing and cleaning the data, they were entered into SPSS (V23.0) and Excel 2013. Then, R 3.3.2 was used for statistical computing. RESULTS: The SVM model had lower MAPE (112.03), higher Hosmer-Lemeshow test’s result (16.71), and higher sensitivity (92.23). Moreover, variables affecting CAD (74.42) yielded better goodness of fit in SVM model and provided more accurate result than the ANN model. On the other hand, since the area under the receiver operating characteristic (ROC) curve in the SVM algorithm was more than this area in ANN model, it could be concluded that SVM model had higher accuracy than the ANN model. CONCLUSION: According to the results, the SVM algorithm presented higher accuracy and better performance than the ANN model and was characterized with higher power and sensitivity. Overall, it provided a better classification for the prediction of CAD. The use of other data mining algorithms are suggested to improve the positive predictive value of the disease prediction.
format	Online Article Text
id	pubmed-6489351
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-64893512019-06-04 Predicting coronary artery disease: a comparison between two data mining algorithms Ayatollahi, Haleh Gholamhosseini, Leila Salehi, Masoud BMC Public Health Research Article BACKGROUND: Cardiovascular diseases (CADs) are the first leading cause of death across the world. World Health Organization has estimated that morality rate caused by heart diseases will mount to 23 million cases by 2030. Hence, the use of data mining algorithms could be useful in predicting coronary artery diseases. Therefore, the present study aimed to compare the positive predictive value (PPV) of CAD using artificial neural network (ANN) and SVM algorithms and their distinction in terms of predicting CAD in the selected hospitals. METHODS: The present study was conducted by using data mining techniques. The research sample was the medical records of the patients with coronary artery disease who were hospitalized in three hospitals affiliated to AJA University of Medical Sciences between March 2016 and March 2017 (n = 1324). The dataset and the predicting variables used in this study was the same for both data mining techniques. Totally, 25 variables affecting CAD were selected and related data were extracted. After normalizing and cleaning the data, they were entered into SPSS (V23.0) and Excel 2013. Then, R 3.3.2 was used for statistical computing. RESULTS: The SVM model had lower MAPE (112.03), higher Hosmer-Lemeshow test’s result (16.71), and higher sensitivity (92.23). Moreover, variables affecting CAD (74.42) yielded better goodness of fit in SVM model and provided more accurate result than the ANN model. On the other hand, since the area under the receiver operating characteristic (ROC) curve in the SVM algorithm was more than this area in ANN model, it could be concluded that SVM model had higher accuracy than the ANN model. CONCLUSION: According to the results, the SVM algorithm presented higher accuracy and better performance than the ANN model and was characterized with higher power and sensitivity. Overall, it provided a better classification for the prediction of CAD. The use of other data mining algorithms are suggested to improve the positive predictive value of the disease prediction. BioMed Central 2019-04-29 /pmc/articles/PMC6489351/ /pubmed/31035958 http://dx.doi.org/10.1186/s12889-019-6721-5 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article Ayatollahi, Haleh Gholamhosseini, Leila Salehi, Masoud Predicting coronary artery disease: a comparison between two data mining algorithms
title	Predicting coronary artery disease: a comparison between two data mining algorithms
title_full	Predicting coronary artery disease: a comparison between two data mining algorithms
title_fullStr	Predicting coronary artery disease: a comparison between two data mining algorithms
title_full_unstemmed	Predicting coronary artery disease: a comparison between two data mining algorithms
title_short	Predicting coronary artery disease: a comparison between two data mining algorithms
title_sort	predicting coronary artery disease: a comparison between two data mining algorithms
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6489351/ https://www.ncbi.nlm.nih.gov/pubmed/31035958 http://dx.doi.org/10.1186/s12889-019-6721-5
work_keys_str_mv	AT ayatollahihaleh predictingcoronaryarterydiseaseacomparisonbetweentwodataminingalgorithms AT gholamhosseinileila predictingcoronaryarterydiseaseacomparisonbetweentwodataminingalgorithms AT salehimasoud predictingcoronaryarterydiseaseacomparisonbetweentwodataminingalgorithms

Predicting coronary artery disease: a comparison between two data mining algorithms

Ejemplares similares