Cargando…
Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk
BACKGROUND: The use of Cardiovascular Disease (CVD) risk estimation scores in primary prevention has long been established. However, their performance still remains a matter of concern. The aim of this study was to explore the potential of using ML methodologies on CVD prediction, especially compare...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6311054/ https://www.ncbi.nlm.nih.gov/pubmed/30594138 http://dx.doi.org/10.1186/s12874-018-0644-1 |
_version_ | 1783383544855265280 |
---|---|
author | Dimopoulos, Alexandros C. Nikolaidou, Mara Caballero, Francisco Félix Engchuan, Worrawat Sanchez-Niubo, Albert Arndt, Holger Ayuso-Mateos, José Luis Haro, Josep Maria Chatterji, Somnath Georgousopoulou, Ekavi N. Pitsavos, Christos Panagiotakos, Demosthenes B. |
author_facet | Dimopoulos, Alexandros C. Nikolaidou, Mara Caballero, Francisco Félix Engchuan, Worrawat Sanchez-Niubo, Albert Arndt, Holger Ayuso-Mateos, José Luis Haro, Josep Maria Chatterji, Somnath Georgousopoulou, Ekavi N. Pitsavos, Christos Panagiotakos, Demosthenes B. |
author_sort | Dimopoulos, Alexandros C. |
collection | PubMed |
description | BACKGROUND: The use of Cardiovascular Disease (CVD) risk estimation scores in primary prevention has long been established. However, their performance still remains a matter of concern. The aim of this study was to explore the potential of using ML methodologies on CVD prediction, especially compared to established risk tool, the HellenicSCORE. METHODS: Data from the ATTICA prospective study (n = 2020 adults), enrolled during 2001–02 and followed-up in 2011–12 were used. Three different machine-learning classifiers (k-NN, random forest, and decision tree) were trained and evaluated against 10-year CVD incidence, in comparison with the HellenicSCORE tool (a calibration of the ESC SCORE). Training datasets, consisting from 16 variables to only 5 variables, were chosen, with or without bootstrapping, in an attempt to achieve the best overall performance for the machine learning classifiers. RESULTS: Depending on the classifier and the training dataset the outcome varied in efficiency but was comparable between the two methodological approaches. In particular, the HellenicSCORE showed accuracy 85%, specificity 20%, sensitivity 97%, positive predictive value 87%, and negative predictive value 58%, whereas for the machine learning methodologies, accuracy ranged from 65 to 84%, specificity from 46 to 56%, sensitivity from 67 to 89%, positive predictive value from 89 to 91%, and negative predictive value from 24 to 45%; random forest gave the best results, while the k-NN gave the poorest results. CONCLUSIONS: The alternative approach of machine learning classification produced results comparable to that of risk prediction scores and, thus, it can be used as a method of CVD prediction, taking into consideration the advantages that machine learning methodologies may offer. |
format | Online Article Text |
id | pubmed-6311054 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-63110542019-01-07 Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk Dimopoulos, Alexandros C. Nikolaidou, Mara Caballero, Francisco Félix Engchuan, Worrawat Sanchez-Niubo, Albert Arndt, Holger Ayuso-Mateos, José Luis Haro, Josep Maria Chatterji, Somnath Georgousopoulou, Ekavi N. Pitsavos, Christos Panagiotakos, Demosthenes B. BMC Med Res Methodol Research Article BACKGROUND: The use of Cardiovascular Disease (CVD) risk estimation scores in primary prevention has long been established. However, their performance still remains a matter of concern. The aim of this study was to explore the potential of using ML methodologies on CVD prediction, especially compared to established risk tool, the HellenicSCORE. METHODS: Data from the ATTICA prospective study (n = 2020 adults), enrolled during 2001–02 and followed-up in 2011–12 were used. Three different machine-learning classifiers (k-NN, random forest, and decision tree) were trained and evaluated against 10-year CVD incidence, in comparison with the HellenicSCORE tool (a calibration of the ESC SCORE). Training datasets, consisting from 16 variables to only 5 variables, were chosen, with or without bootstrapping, in an attempt to achieve the best overall performance for the machine learning classifiers. RESULTS: Depending on the classifier and the training dataset the outcome varied in efficiency but was comparable between the two methodological approaches. In particular, the HellenicSCORE showed accuracy 85%, specificity 20%, sensitivity 97%, positive predictive value 87%, and negative predictive value 58%, whereas for the machine learning methodologies, accuracy ranged from 65 to 84%, specificity from 46 to 56%, sensitivity from 67 to 89%, positive predictive value from 89 to 91%, and negative predictive value from 24 to 45%; random forest gave the best results, while the k-NN gave the poorest results. CONCLUSIONS: The alternative approach of machine learning classification produced results comparable to that of risk prediction scores and, thus, it can be used as a method of CVD prediction, taking into consideration the advantages that machine learning methodologies may offer. BioMed Central 2018-12-29 /pmc/articles/PMC6311054/ /pubmed/30594138 http://dx.doi.org/10.1186/s12874-018-0644-1 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Dimopoulos, Alexandros C. Nikolaidou, Mara Caballero, Francisco Félix Engchuan, Worrawat Sanchez-Niubo, Albert Arndt, Holger Ayuso-Mateos, José Luis Haro, Josep Maria Chatterji, Somnath Georgousopoulou, Ekavi N. Pitsavos, Christos Panagiotakos, Demosthenes B. Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk |
title | Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk |
title_full | Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk |
title_fullStr | Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk |
title_full_unstemmed | Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk |
title_short | Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk |
title_sort | machine learning methodologies versus cardiovascular risk scores, in predicting disease risk |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6311054/ https://www.ncbi.nlm.nih.gov/pubmed/30594138 http://dx.doi.org/10.1186/s12874-018-0644-1 |
work_keys_str_mv | AT dimopoulosalexandrosc machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk AT nikolaidoumara machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk AT caballerofranciscofelix machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk AT engchuanworrawat machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk AT sanchezniuboalbert machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk AT arndtholger machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk AT ayusomateosjoseluis machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk AT harojosepmaria machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk AT chatterjisomnath machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk AT georgousopoulouekavin machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk AT pitsavoschristos machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk AT panagiotakosdemosthenesb machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk |