Cargando…

Development and verification of prediction models for preventing cardiovascular diseases

OBJECTIVES: Cardiovascular disease (CVD) is one of the major causes of death worldwide. For improved accuracy of CVD prediction, risk classification was performed using national time-series health examination data. The data offers an opportunity to access deep learning (RNN-LSTM), which is widely kn...

Descripción completa

Detalles Bibliográficos
Autores principales: Sung, Ji Min, Cho, In-Jeong, Sung, David, Kim, Sunhee, Kim, Hyeon Chang, Chae, Myeong-Hun, Kavousi, Maryam, Rueda-Ochoa, Oscar L., Ikram, M. Arfan, Franco, Oscar H., Chang, Hyuk-Jae
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6752799/
https://www.ncbi.nlm.nih.gov/pubmed/31536581
http://dx.doi.org/10.1371/journal.pone.0222809
_version_ 1783452788650409984
author Sung, Ji Min
Cho, In-Jeong
Sung, David
Kim, Sunhee
Kim, Hyeon Chang
Chae, Myeong-Hun
Kavousi, Maryam
Rueda-Ochoa, Oscar L.
Ikram, M. Arfan
Franco, Oscar H.
Chang, Hyuk-Jae
author_facet Sung, Ji Min
Cho, In-Jeong
Sung, David
Kim, Sunhee
Kim, Hyeon Chang
Chae, Myeong-Hun
Kavousi, Maryam
Rueda-Ochoa, Oscar L.
Ikram, M. Arfan
Franco, Oscar H.
Chang, Hyuk-Jae
author_sort Sung, Ji Min
collection PubMed
description OBJECTIVES: Cardiovascular disease (CVD) is one of the major causes of death worldwide. For improved accuracy of CVD prediction, risk classification was performed using national time-series health examination data. The data offers an opportunity to access deep learning (RNN-LSTM), which is widely known as an outstanding algorithm for analyzing time-series datasets. The objective of this study was to show the improved accuracy of deep learning by comparing the performance of a Cox hazard regression and RNN-LSTM based on survival analysis. METHODS AND FINDINGS: We selected 361,239 subjects (age 40 to 79 years) with more than two health examination records from 2002–2006 using the National Health Insurance System-National Health Screening Cohort (NHIS-HEALS). The average number of health screenings (from 2002–2013) used in the analysis was 2.9 ± 1.0. Two CVD prediction models were developed from the NHIS-HEALS data: a Cox hazard regression model and a deep learning model. In an internal validation of the NHIS-HEALS dataset, the Cox regression model showed a highest time-dependent area under the curve (AUC) of 0.79 (95% CI 0.70 to 0.87) for in females and 0.75 (95% CI 0.70 to 0.80) in males at 2 years. The deep learning model showed a highest time-dependent AUC of 0.94 (95% CI 0.91 to 0.97) for in females and 0.96 (95% CI 0.95 to 0.97) in males at 2 years. Layer-wise Relevance Propagation (LRP) revealed that age was the variable that had the greatest effect on CVD, followed by systolic blood pressure (SBP) and diastolic blood pressure (DBP), in that order. CONCLUSION: The performance of the deep learning model for predicting CVD occurrences was better than that of the Cox regression model. In addition, it was confirmed that the known risk factors shown to be important by previous clinical studies were extracted from the study results using LRP.
format Online
Article
Text
id pubmed-6752799
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-67527992019-09-27 Development and verification of prediction models for preventing cardiovascular diseases Sung, Ji Min Cho, In-Jeong Sung, David Kim, Sunhee Kim, Hyeon Chang Chae, Myeong-Hun Kavousi, Maryam Rueda-Ochoa, Oscar L. Ikram, M. Arfan Franco, Oscar H. Chang, Hyuk-Jae PLoS One Research Article OBJECTIVES: Cardiovascular disease (CVD) is one of the major causes of death worldwide. For improved accuracy of CVD prediction, risk classification was performed using national time-series health examination data. The data offers an opportunity to access deep learning (RNN-LSTM), which is widely known as an outstanding algorithm for analyzing time-series datasets. The objective of this study was to show the improved accuracy of deep learning by comparing the performance of a Cox hazard regression and RNN-LSTM based on survival analysis. METHODS AND FINDINGS: We selected 361,239 subjects (age 40 to 79 years) with more than two health examination records from 2002–2006 using the National Health Insurance System-National Health Screening Cohort (NHIS-HEALS). The average number of health screenings (from 2002–2013) used in the analysis was 2.9 ± 1.0. Two CVD prediction models were developed from the NHIS-HEALS data: a Cox hazard regression model and a deep learning model. In an internal validation of the NHIS-HEALS dataset, the Cox regression model showed a highest time-dependent area under the curve (AUC) of 0.79 (95% CI 0.70 to 0.87) for in females and 0.75 (95% CI 0.70 to 0.80) in males at 2 years. The deep learning model showed a highest time-dependent AUC of 0.94 (95% CI 0.91 to 0.97) for in females and 0.96 (95% CI 0.95 to 0.97) in males at 2 years. Layer-wise Relevance Propagation (LRP) revealed that age was the variable that had the greatest effect on CVD, followed by systolic blood pressure (SBP) and diastolic blood pressure (DBP), in that order. CONCLUSION: The performance of the deep learning model for predicting CVD occurrences was better than that of the Cox regression model. In addition, it was confirmed that the known risk factors shown to be important by previous clinical studies were extracted from the study results using LRP. Public Library of Science 2019-09-19 /pmc/articles/PMC6752799/ /pubmed/31536581 http://dx.doi.org/10.1371/journal.pone.0222809 Text en © 2019 Sung et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Sung, Ji Min
Cho, In-Jeong
Sung, David
Kim, Sunhee
Kim, Hyeon Chang
Chae, Myeong-Hun
Kavousi, Maryam
Rueda-Ochoa, Oscar L.
Ikram, M. Arfan
Franco, Oscar H.
Chang, Hyuk-Jae
Development and verification of prediction models for preventing cardiovascular diseases
title Development and verification of prediction models for preventing cardiovascular diseases
title_full Development and verification of prediction models for preventing cardiovascular diseases
title_fullStr Development and verification of prediction models for preventing cardiovascular diseases
title_full_unstemmed Development and verification of prediction models for preventing cardiovascular diseases
title_short Development and verification of prediction models for preventing cardiovascular diseases
title_sort development and verification of prediction models for preventing cardiovascular diseases
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6752799/
https://www.ncbi.nlm.nih.gov/pubmed/31536581
http://dx.doi.org/10.1371/journal.pone.0222809
work_keys_str_mv AT sungjimin developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT choinjeong developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT sungdavid developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT kimsunhee developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT kimhyeonchang developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT chaemyeonghun developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT kavousimaryam developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT ruedaochoaoscarl developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT ikrammarfan developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT francooscarh developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT changhyukjae developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases