Cargando…

Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction

Current approaches to predicting a cardiovascular disease (CVD) event rely on conventional risk factors and cross-sectional data. In this study, we applied machine learning and deep learning models to 10-year CVD event prediction by using longitudinal electronic health record (EHR) and genetic data....

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Juan, Feng, QiPing, Wu, Patrick, Lupu, Roxana A., Wilke, Russell A., Wells, Quinn S., Denny, Joshua C., Wei, Wei-Qi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6345960/
https://www.ncbi.nlm.nih.gov/pubmed/30679510
http://dx.doi.org/10.1038/s41598-018-36745-x
_version_ 1783389667085778944
author Zhao, Juan
Feng, QiPing
Wu, Patrick
Lupu, Roxana A.
Wilke, Russell A.
Wells, Quinn S.
Denny, Joshua C.
Wei, Wei-Qi
author_facet Zhao, Juan
Feng, QiPing
Wu, Patrick
Lupu, Roxana A.
Wilke, Russell A.
Wells, Quinn S.
Denny, Joshua C.
Wei, Wei-Qi
author_sort Zhao, Juan
collection PubMed
description Current approaches to predicting a cardiovascular disease (CVD) event rely on conventional risk factors and cross-sectional data. In this study, we applied machine learning and deep learning models to 10-year CVD event prediction by using longitudinal electronic health record (EHR) and genetic data. Our study cohort included 109, 490 individuals. In the first experiment, we extracted aggregated and longitudinal features from EHR. We applied logistic regression, random forests, gradient boosting trees, convolutional neural networks (CNN) and recurrent neural networks with long short-term memory (LSTM) units. In the second experiment, we applied a late-fusion approach to incorporate genetic features. We compared the performance with approaches currently utilized in routine clinical practice – American College of Cardiology and the American Heart Association (ACC/AHA) Pooled Cohort Risk Equation. Our results indicated that incorporating longitudinal feature lead to better event prediction. Combining genetic features through a late-fusion approach can further improve CVD prediction, underscoring the importance of integrating relevant genetic data whenever available.
format Online
Article
Text
id pubmed-6345960
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-63459602019-01-29 Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction Zhao, Juan Feng, QiPing Wu, Patrick Lupu, Roxana A. Wilke, Russell A. Wells, Quinn S. Denny, Joshua C. Wei, Wei-Qi Sci Rep Article Current approaches to predicting a cardiovascular disease (CVD) event rely on conventional risk factors and cross-sectional data. In this study, we applied machine learning and deep learning models to 10-year CVD event prediction by using longitudinal electronic health record (EHR) and genetic data. Our study cohort included 109, 490 individuals. In the first experiment, we extracted aggregated and longitudinal features from EHR. We applied logistic regression, random forests, gradient boosting trees, convolutional neural networks (CNN) and recurrent neural networks with long short-term memory (LSTM) units. In the second experiment, we applied a late-fusion approach to incorporate genetic features. We compared the performance with approaches currently utilized in routine clinical practice – American College of Cardiology and the American Heart Association (ACC/AHA) Pooled Cohort Risk Equation. Our results indicated that incorporating longitudinal feature lead to better event prediction. Combining genetic features through a late-fusion approach can further improve CVD prediction, underscoring the importance of integrating relevant genetic data whenever available. Nature Publishing Group UK 2019-01-24 /pmc/articles/PMC6345960/ /pubmed/30679510 http://dx.doi.org/10.1038/s41598-018-36745-x Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Zhao, Juan
Feng, QiPing
Wu, Patrick
Lupu, Roxana A.
Wilke, Russell A.
Wells, Quinn S.
Denny, Joshua C.
Wei, Wei-Qi
Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction
title Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction
title_full Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction
title_fullStr Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction
title_full_unstemmed Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction
title_short Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction
title_sort learning from longitudinal data in electronic health record and genetic data to improve cardiovascular event prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6345960/
https://www.ncbi.nlm.nih.gov/pubmed/30679510
http://dx.doi.org/10.1038/s41598-018-36745-x
work_keys_str_mv AT zhaojuan learningfromlongitudinaldatainelectronichealthrecordandgeneticdatatoimprovecardiovasculareventprediction
AT fengqiping learningfromlongitudinaldatainelectronichealthrecordandgeneticdatatoimprovecardiovasculareventprediction
AT wupatrick learningfromlongitudinaldatainelectronichealthrecordandgeneticdatatoimprovecardiovasculareventprediction
AT lupuroxanaa learningfromlongitudinaldatainelectronichealthrecordandgeneticdatatoimprovecardiovasculareventprediction
AT wilkerussella learningfromlongitudinaldatainelectronichealthrecordandgeneticdatatoimprovecardiovasculareventprediction
AT wellsquinns learningfromlongitudinaldatainelectronichealthrecordandgeneticdatatoimprovecardiovasculareventprediction
AT dennyjoshuac learningfromlongitudinaldatainelectronichealthrecordandgeneticdatatoimprovecardiovasculareventprediction
AT weiweiqi learningfromlongitudinaldatainelectronichealthrecordandgeneticdatatoimprovecardiovasculareventprediction