Cargando…

Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk

BACKGROUND: The use of Cardiovascular Disease (CVD) risk estimation scores in primary prevention has long been established. However, their performance still remains a matter of concern. The aim of this study was to explore the potential of using ML methodologies on CVD prediction, especially compare...

Descripción completa

Detalles Bibliográficos
Autores principales: Dimopoulos, Alexandros C., Nikolaidou, Mara, Caballero, Francisco Félix, Engchuan, Worrawat, Sanchez-Niubo, Albert, Arndt, Holger, Ayuso-Mateos, José Luis, Haro, Josep Maria, Chatterji, Somnath, Georgousopoulou, Ekavi N., Pitsavos, Christos, Panagiotakos, Demosthenes B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6311054/
https://www.ncbi.nlm.nih.gov/pubmed/30594138
http://dx.doi.org/10.1186/s12874-018-0644-1
_version_ 1783383544855265280
author Dimopoulos, Alexandros C.
Nikolaidou, Mara
Caballero, Francisco Félix
Engchuan, Worrawat
Sanchez-Niubo, Albert
Arndt, Holger
Ayuso-Mateos, José Luis
Haro, Josep Maria
Chatterji, Somnath
Georgousopoulou, Ekavi N.
Pitsavos, Christos
Panagiotakos, Demosthenes B.
author_facet Dimopoulos, Alexandros C.
Nikolaidou, Mara
Caballero, Francisco Félix
Engchuan, Worrawat
Sanchez-Niubo, Albert
Arndt, Holger
Ayuso-Mateos, José Luis
Haro, Josep Maria
Chatterji, Somnath
Georgousopoulou, Ekavi N.
Pitsavos, Christos
Panagiotakos, Demosthenes B.
author_sort Dimopoulos, Alexandros C.
collection PubMed
description BACKGROUND: The use of Cardiovascular Disease (CVD) risk estimation scores in primary prevention has long been established. However, their performance still remains a matter of concern. The aim of this study was to explore the potential of using ML methodologies on CVD prediction, especially compared to established risk tool, the HellenicSCORE. METHODS: Data from the ATTICA prospective study (n = 2020 adults), enrolled during 2001–02 and followed-up in 2011–12 were used. Three different machine-learning classifiers (k-NN, random forest, and decision tree) were trained and evaluated against 10-year CVD incidence, in comparison with the HellenicSCORE tool (a calibration of the ESC SCORE). Training datasets, consisting from 16 variables to only 5 variables, were chosen, with or without bootstrapping, in an attempt to achieve the best overall performance for the machine learning classifiers. RESULTS: Depending on the classifier and the training dataset the outcome varied in efficiency but was comparable between the two methodological approaches. In particular, the HellenicSCORE showed accuracy 85%, specificity 20%, sensitivity 97%, positive predictive value 87%, and negative predictive value 58%, whereas for the machine learning methodologies, accuracy ranged from 65 to 84%, specificity from 46 to 56%, sensitivity from 67 to 89%, positive predictive value from 89 to 91%, and negative predictive value from 24 to 45%; random forest gave the best results, while the k-NN gave the poorest results. CONCLUSIONS: The alternative approach of machine learning classification produced results comparable to that of risk prediction scores and, thus, it can be used as a method of CVD prediction, taking into consideration the advantages that machine learning methodologies may offer.
format Online
Article
Text
id pubmed-6311054
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63110542019-01-07 Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk Dimopoulos, Alexandros C. Nikolaidou, Mara Caballero, Francisco Félix Engchuan, Worrawat Sanchez-Niubo, Albert Arndt, Holger Ayuso-Mateos, José Luis Haro, Josep Maria Chatterji, Somnath Georgousopoulou, Ekavi N. Pitsavos, Christos Panagiotakos, Demosthenes B. BMC Med Res Methodol Research Article BACKGROUND: The use of Cardiovascular Disease (CVD) risk estimation scores in primary prevention has long been established. However, their performance still remains a matter of concern. The aim of this study was to explore the potential of using ML methodologies on CVD prediction, especially compared to established risk tool, the HellenicSCORE. METHODS: Data from the ATTICA prospective study (n = 2020 adults), enrolled during 2001–02 and followed-up in 2011–12 were used. Three different machine-learning classifiers (k-NN, random forest, and decision tree) were trained and evaluated against 10-year CVD incidence, in comparison with the HellenicSCORE tool (a calibration of the ESC SCORE). Training datasets, consisting from 16 variables to only 5 variables, were chosen, with or without bootstrapping, in an attempt to achieve the best overall performance for the machine learning classifiers. RESULTS: Depending on the classifier and the training dataset the outcome varied in efficiency but was comparable between the two methodological approaches. In particular, the HellenicSCORE showed accuracy 85%, specificity 20%, sensitivity 97%, positive predictive value 87%, and negative predictive value 58%, whereas for the machine learning methodologies, accuracy ranged from 65 to 84%, specificity from 46 to 56%, sensitivity from 67 to 89%, positive predictive value from 89 to 91%, and negative predictive value from 24 to 45%; random forest gave the best results, while the k-NN gave the poorest results. CONCLUSIONS: The alternative approach of machine learning classification produced results comparable to that of risk prediction scores and, thus, it can be used as a method of CVD prediction, taking into consideration the advantages that machine learning methodologies may offer. BioMed Central 2018-12-29 /pmc/articles/PMC6311054/ /pubmed/30594138 http://dx.doi.org/10.1186/s12874-018-0644-1 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Dimopoulos, Alexandros C.
Nikolaidou, Mara
Caballero, Francisco Félix
Engchuan, Worrawat
Sanchez-Niubo, Albert
Arndt, Holger
Ayuso-Mateos, José Luis
Haro, Josep Maria
Chatterji, Somnath
Georgousopoulou, Ekavi N.
Pitsavos, Christos
Panagiotakos, Demosthenes B.
Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk
title Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk
title_full Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk
title_fullStr Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk
title_full_unstemmed Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk
title_short Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk
title_sort machine learning methodologies versus cardiovascular risk scores, in predicting disease risk
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6311054/
https://www.ncbi.nlm.nih.gov/pubmed/30594138
http://dx.doi.org/10.1186/s12874-018-0644-1
work_keys_str_mv AT dimopoulosalexandrosc machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk
AT nikolaidoumara machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk
AT caballerofranciscofelix machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk
AT engchuanworrawat machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk
AT sanchezniuboalbert machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk
AT arndtholger machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk
AT ayusomateosjoseluis machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk
AT harojosepmaria machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk
AT chatterjisomnath machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk
AT georgousopoulouekavin machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk
AT pitsavoschristos machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk
AT panagiotakosdemosthenesb machinelearningmethodologiesversuscardiovascularriskscoresinpredictingdiseaserisk