Cargando…

Patient-Level Prediction of Cardio-Cerebrovascular Events in Hypertension Using Nationwide Claims Data

BACKGROUND: Prevention and management of chronic diseases are the main goals of national health maintenance programs. Previously widely used screening tools, such as Health Risk Appraisal, are restricted in their achievement this goal due to their limitations, such as static characteristics, accessi...

Descripción completa

Detalles Bibliográficos
Autores principales: Park, Jaram, Kim, Jeong-Whun, Ryu, Borim, Heo, Eunyoung, Jung, Se Young, Yoo, Sooyoung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6396076/
https://www.ncbi.nlm.nih.gov/pubmed/30767907
http://dx.doi.org/10.2196/11757
_version_ 1783399199310610432
author Park, Jaram
Kim, Jeong-Whun
Ryu, Borim
Heo, Eunyoung
Jung, Se Young
Yoo, Sooyoung
author_facet Park, Jaram
Kim, Jeong-Whun
Ryu, Borim
Heo, Eunyoung
Jung, Se Young
Yoo, Sooyoung
author_sort Park, Jaram
collection PubMed
description BACKGROUND: Prevention and management of chronic diseases are the main goals of national health maintenance programs. Previously widely used screening tools, such as Health Risk Appraisal, are restricted in their achievement this goal due to their limitations, such as static characteristics, accessibility, and generalizability. Hypertension is one of the most important chronic diseases requiring management via the nationwide health maintenance program, and health care providers should inform patients about their risks of a complication caused by hypertension. OBJECTIVE: Our goal was to develop and compare machine learning models predicting high-risk vascular diseases for hypertensive patients so that they can manage their blood pressure based on their risk level. METHODS: We used a 12-year longitudinal dataset of the nationwide sample cohort, which contains the data of 514,866 patients and allows tracking of patients’ medical history across all health care providers in Korea (N=51,920). To ensure the generalizability of our models, we conducted an external validation using another national sample cohort dataset, comprising one million different patients, published by the National Health Insurance Service. From each dataset, we obtained the data of 74,535 and 59,738 patients with essential hypertension and developed machine learning models for predicting cardiovascular and cerebrovascular events. Six machine learning models were developed and compared for evaluating performances based on validation metrics. RESULTS: Machine learning algorithms enabled us to detect high-risk patients based on their medical history. The long short-term memory-based algorithm outperformed in the within test (F1-score=.772, external test F1-score=.613), and the random forest-based algorithm of risk prediction showed better performance over other machine learning algorithms concerning generalization (within test F1-score=.757, external test F1-score=.705). Concerning the number of features, in the within test, the long short-term memory-based algorithms outperformed regardless of the number of features. However, in the external test, the random forest-based algorithm was the best, irrespective of the number of features it encountered. CONCLUSIONS: We developed and compared machine learning models predicting high-risk vascular diseases in hypertensive patients so that they may manage their blood pressure based on their risk level. By relying on the prediction model, a government can predict high-risk patients at the nationwide level and establish health care policies in advance.
format Online
Article
Text
id pubmed-6396076
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-63960762019-03-29 Patient-Level Prediction of Cardio-Cerebrovascular Events in Hypertension Using Nationwide Claims Data Park, Jaram Kim, Jeong-Whun Ryu, Borim Heo, Eunyoung Jung, Se Young Yoo, Sooyoung J Med Internet Res Original Paper BACKGROUND: Prevention and management of chronic diseases are the main goals of national health maintenance programs. Previously widely used screening tools, such as Health Risk Appraisal, are restricted in their achievement this goal due to their limitations, such as static characteristics, accessibility, and generalizability. Hypertension is one of the most important chronic diseases requiring management via the nationwide health maintenance program, and health care providers should inform patients about their risks of a complication caused by hypertension. OBJECTIVE: Our goal was to develop and compare machine learning models predicting high-risk vascular diseases for hypertensive patients so that they can manage their blood pressure based on their risk level. METHODS: We used a 12-year longitudinal dataset of the nationwide sample cohort, which contains the data of 514,866 patients and allows tracking of patients’ medical history across all health care providers in Korea (N=51,920). To ensure the generalizability of our models, we conducted an external validation using another national sample cohort dataset, comprising one million different patients, published by the National Health Insurance Service. From each dataset, we obtained the data of 74,535 and 59,738 patients with essential hypertension and developed machine learning models for predicting cardiovascular and cerebrovascular events. Six machine learning models were developed and compared for evaluating performances based on validation metrics. RESULTS: Machine learning algorithms enabled us to detect high-risk patients based on their medical history. The long short-term memory-based algorithm outperformed in the within test (F1-score=.772, external test F1-score=.613), and the random forest-based algorithm of risk prediction showed better performance over other machine learning algorithms concerning generalization (within test F1-score=.757, external test F1-score=.705). Concerning the number of features, in the within test, the long short-term memory-based algorithms outperformed regardless of the number of features. However, in the external test, the random forest-based algorithm was the best, irrespective of the number of features it encountered. CONCLUSIONS: We developed and compared machine learning models predicting high-risk vascular diseases in hypertensive patients so that they may manage their blood pressure based on their risk level. By relying on the prediction model, a government can predict high-risk patients at the nationwide level and establish health care policies in advance. JMIR Publications 2019-02-15 /pmc/articles/PMC6396076/ /pubmed/30767907 http://dx.doi.org/10.2196/11757 Text en ©Jaram Park, Jeong-Whun Kim, Borim Ryu, Eunyoung Heo, Se Young Jung, Sooyoung Yoo. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 15.02.2019. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Park, Jaram
Kim, Jeong-Whun
Ryu, Borim
Heo, Eunyoung
Jung, Se Young
Yoo, Sooyoung
Patient-Level Prediction of Cardio-Cerebrovascular Events in Hypertension Using Nationwide Claims Data
title Patient-Level Prediction of Cardio-Cerebrovascular Events in Hypertension Using Nationwide Claims Data
title_full Patient-Level Prediction of Cardio-Cerebrovascular Events in Hypertension Using Nationwide Claims Data
title_fullStr Patient-Level Prediction of Cardio-Cerebrovascular Events in Hypertension Using Nationwide Claims Data
title_full_unstemmed Patient-Level Prediction of Cardio-Cerebrovascular Events in Hypertension Using Nationwide Claims Data
title_short Patient-Level Prediction of Cardio-Cerebrovascular Events in Hypertension Using Nationwide Claims Data
title_sort patient-level prediction of cardio-cerebrovascular events in hypertension using nationwide claims data
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6396076/
https://www.ncbi.nlm.nih.gov/pubmed/30767907
http://dx.doi.org/10.2196/11757
work_keys_str_mv AT parkjaram patientlevelpredictionofcardiocerebrovasculareventsinhypertensionusingnationwideclaimsdata
AT kimjeongwhun patientlevelpredictionofcardiocerebrovasculareventsinhypertensionusingnationwideclaimsdata
AT ryuborim patientlevelpredictionofcardiocerebrovasculareventsinhypertensionusingnationwideclaimsdata
AT heoeunyoung patientlevelpredictionofcardiocerebrovasculareventsinhypertensionusingnationwideclaimsdata
AT jungseyoung patientlevelpredictionofcardiocerebrovasculareventsinhypertensionusingnationwideclaimsdata
AT yoosooyoung patientlevelpredictionofcardiocerebrovasculareventsinhypertensionusingnationwideclaimsdata