Cargando…

Machine learning with sparse nutrition data to improve cardiovascular mortality risk prediction in the USA using nationally randomly sampled data

OBJECTIVES: We aimed to test whether or not adding (1) nutrition predictor variables and/or (2) using machine learning models improves cardiovascular death prediction versus standard Cox models without nutrition predictor variables. DESIGN: Retrospective study. SETTING: Six waves of Survey (NHANES)...

Descripción completa

Detalles Bibliográficos
Autores principales: Rigdon, Joseph, Basu, Sanjay
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6924849/
https://www.ncbi.nlm.nih.gov/pubmed/31784446
http://dx.doi.org/10.1136/bmjopen-2019-032703
_version_ 1783481803718262784
author Rigdon, Joseph
Basu, Sanjay
author_facet Rigdon, Joseph
Basu, Sanjay
author_sort Rigdon, Joseph
collection PubMed
description OBJECTIVES: We aimed to test whether or not adding (1) nutrition predictor variables and/or (2) using machine learning models improves cardiovascular death prediction versus standard Cox models without nutrition predictor variables. DESIGN: Retrospective study. SETTING: Six waves of Survey (NHANES) data collected from 1999 to 2011 linked to the National Death Index (NDI). PARTICIPANTS: 29 390 participants were included in the training set for model derivation and 12 600 were included in the test set for model evaluation. Our study sample was approximately 20% black race and 25% Hispanic ethnicity. PRIMARY AND SECONDARY OUTCOME MEASURES: Time from NHANES interview until the minimum of time of cardiovascular death or censoring. RESULTS: A standard risk model excluding nutrition data overestimated risk nearly two-fold (calibration slope of predicted vs true risk: 0.53 (95% CI: 0.50 to 0.55)) with moderate discrimination (C-statistic: 0.87 (0.86 to 0.89)). Nutrition data alone failed to improve performance while machine learning alone improved calibration to 1.18 (0.92 to 1.44) and discrimination to 0.91 (0.90 to 0.92). Both together substantially improved calibration (slope: 1.01 (0.76 to 1.27)) and discrimination (C-statistic: 0.93 (0.92 to 0.94)). CONCLUSION: Our results indicate that the inclusion of nutrition data with available machine learning algorithms can substantially improve cardiovascular risk prediction.
format Online
Article
Text
id pubmed-6924849
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-69248492020-01-03 Machine learning with sparse nutrition data to improve cardiovascular mortality risk prediction in the USA using nationally randomly sampled data Rigdon, Joseph Basu, Sanjay BMJ Open Cardiovascular Medicine OBJECTIVES: We aimed to test whether or not adding (1) nutrition predictor variables and/or (2) using machine learning models improves cardiovascular death prediction versus standard Cox models without nutrition predictor variables. DESIGN: Retrospective study. SETTING: Six waves of Survey (NHANES) data collected from 1999 to 2011 linked to the National Death Index (NDI). PARTICIPANTS: 29 390 participants were included in the training set for model derivation and 12 600 were included in the test set for model evaluation. Our study sample was approximately 20% black race and 25% Hispanic ethnicity. PRIMARY AND SECONDARY OUTCOME MEASURES: Time from NHANES interview until the minimum of time of cardiovascular death or censoring. RESULTS: A standard risk model excluding nutrition data overestimated risk nearly two-fold (calibration slope of predicted vs true risk: 0.53 (95% CI: 0.50 to 0.55)) with moderate discrimination (C-statistic: 0.87 (0.86 to 0.89)). Nutrition data alone failed to improve performance while machine learning alone improved calibration to 1.18 (0.92 to 1.44) and discrimination to 0.91 (0.90 to 0.92). Both together substantially improved calibration (slope: 1.01 (0.76 to 1.27)) and discrimination (C-statistic: 0.93 (0.92 to 0.94)). CONCLUSION: Our results indicate that the inclusion of nutrition data with available machine learning algorithms can substantially improve cardiovascular risk prediction. BMJ Publishing Group 2019-11-28 /pmc/articles/PMC6924849/ /pubmed/31784446 http://dx.doi.org/10.1136/bmjopen-2019-032703 Text en © Author(s) (or their employer(s)) 2019. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Cardiovascular Medicine
Rigdon, Joseph
Basu, Sanjay
Machine learning with sparse nutrition data to improve cardiovascular mortality risk prediction in the USA using nationally randomly sampled data
title Machine learning with sparse nutrition data to improve cardiovascular mortality risk prediction in the USA using nationally randomly sampled data
title_full Machine learning with sparse nutrition data to improve cardiovascular mortality risk prediction in the USA using nationally randomly sampled data
title_fullStr Machine learning with sparse nutrition data to improve cardiovascular mortality risk prediction in the USA using nationally randomly sampled data
title_full_unstemmed Machine learning with sparse nutrition data to improve cardiovascular mortality risk prediction in the USA using nationally randomly sampled data
title_short Machine learning with sparse nutrition data to improve cardiovascular mortality risk prediction in the USA using nationally randomly sampled data
title_sort machine learning with sparse nutrition data to improve cardiovascular mortality risk prediction in the usa using nationally randomly sampled data
topic Cardiovascular Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6924849/
https://www.ncbi.nlm.nih.gov/pubmed/31784446
http://dx.doi.org/10.1136/bmjopen-2019-032703
work_keys_str_mv AT rigdonjoseph machinelearningwithsparsenutritiondatatoimprovecardiovascularmortalityriskpredictionintheusausingnationallyrandomlysampleddata
AT basusanjay machinelearningwithsparsenutritiondatatoimprovecardiovascularmortalityriskpredictionintheusausingnationallyrandomlysampleddata