Cargando…
Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning
BACKGROUND: As a high-prevalence health condition, hypertension is clinically costly, difficult to manage, and often leads to severe and life-threatening diseases such as cardiovascular disease (CVD) and stroke. OBJECTIVE: The aim of this study was to develop and validate prospectively a risk predic...
Autores principales: | , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5811646/ https://www.ncbi.nlm.nih.gov/pubmed/29382633 http://dx.doi.org/10.2196/jmir.9268 |
_version_ | 1783299903118639104 |
---|---|
author | Ye, Chengyin Fu, Tianyun Hao, Shiying Zhang, Yan Wang, Oliver Jin, Bo Xia, Minjie Liu, Modi Zhou, Xin Wu, Qian Guo, Yanting Zhu, Chunqing Li, Yu-Ming Culver, Devore S Alfreds, Shaun T Stearns, Frank Sylvester, Karl G Widen, Eric McElhinney, Doff Ling, Xuefeng |
author_facet | Ye, Chengyin Fu, Tianyun Hao, Shiying Zhang, Yan Wang, Oliver Jin, Bo Xia, Minjie Liu, Modi Zhou, Xin Wu, Qian Guo, Yanting Zhu, Chunqing Li, Yu-Ming Culver, Devore S Alfreds, Shaun T Stearns, Frank Sylvester, Karl G Widen, Eric McElhinney, Doff Ling, Xuefeng |
author_sort | Ye, Chengyin |
collection | PubMed |
description | BACKGROUND: As a high-prevalence health condition, hypertension is clinically costly, difficult to manage, and often leads to severe and life-threatening diseases such as cardiovascular disease (CVD) and stroke. OBJECTIVE: The aim of this study was to develop and validate prospectively a risk prediction model of incident essential hypertension within the following year. METHODS: Data from individual patient electronic health records (EHRs) were extracted from the Maine Health Information Exchange network. Retrospective (N=823,627, calendar year 2013) and prospective (N=680,810, calendar year 2014) cohorts were formed. A machine learning algorithm, XGBoost, was adopted in the process of feature selection and model building. It generated an ensemble of classification trees and assigned a final predictive risk score to each individual. RESULTS: The 1-year incident hypertension risk model attained areas under the curve (AUCs) of 0.917 and 0.870 in the retrospective and prospective cohorts, respectively. Risk scores were calculated and stratified into five risk categories, with 4526 out of 381,544 patients (1.19%) in the lowest risk category (score 0-0.05) and 21,050 out of 41,329 patients (50.93%) in the highest risk category (score 0.4-1) receiving a diagnosis of incident hypertension in the following 1 year. Type 2 diabetes, lipid disorders, CVDs, mental illness, clinical utilization indicators, and socioeconomic determinants were recognized as driving or associated features of incident essential hypertension. The very high risk population mainly comprised elderly (age>50 years) individuals with multiple chronic conditions, especially those receiving medications for mental disorders. Disparities were also found in social determinants, including some community-level factors associated with higher risk and others that were protective against hypertension. CONCLUSIONS: With statewide EHR datasets, our study prospectively validated an accurate 1-year risk prediction model for incident essential hypertension. Our real-time predictive analytic model has been deployed in the state of Maine, providing implications in interventions for hypertension and related diseases and hopefully enhancing hypertension care. |
format | Online Article Text |
id | pubmed-5811646 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-58116462018-02-16 Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning Ye, Chengyin Fu, Tianyun Hao, Shiying Zhang, Yan Wang, Oliver Jin, Bo Xia, Minjie Liu, Modi Zhou, Xin Wu, Qian Guo, Yanting Zhu, Chunqing Li, Yu-Ming Culver, Devore S Alfreds, Shaun T Stearns, Frank Sylvester, Karl G Widen, Eric McElhinney, Doff Ling, Xuefeng J Med Internet Res Original Paper BACKGROUND: As a high-prevalence health condition, hypertension is clinically costly, difficult to manage, and often leads to severe and life-threatening diseases such as cardiovascular disease (CVD) and stroke. OBJECTIVE: The aim of this study was to develop and validate prospectively a risk prediction model of incident essential hypertension within the following year. METHODS: Data from individual patient electronic health records (EHRs) were extracted from the Maine Health Information Exchange network. Retrospective (N=823,627, calendar year 2013) and prospective (N=680,810, calendar year 2014) cohorts were formed. A machine learning algorithm, XGBoost, was adopted in the process of feature selection and model building. It generated an ensemble of classification trees and assigned a final predictive risk score to each individual. RESULTS: The 1-year incident hypertension risk model attained areas under the curve (AUCs) of 0.917 and 0.870 in the retrospective and prospective cohorts, respectively. Risk scores were calculated and stratified into five risk categories, with 4526 out of 381,544 patients (1.19%) in the lowest risk category (score 0-0.05) and 21,050 out of 41,329 patients (50.93%) in the highest risk category (score 0.4-1) receiving a diagnosis of incident hypertension in the following 1 year. Type 2 diabetes, lipid disorders, CVDs, mental illness, clinical utilization indicators, and socioeconomic determinants were recognized as driving or associated features of incident essential hypertension. The very high risk population mainly comprised elderly (age>50 years) individuals with multiple chronic conditions, especially those receiving medications for mental disorders. Disparities were also found in social determinants, including some community-level factors associated with higher risk and others that were protective against hypertension. CONCLUSIONS: With statewide EHR datasets, our study prospectively validated an accurate 1-year risk prediction model for incident essential hypertension. Our real-time predictive analytic model has been deployed in the state of Maine, providing implications in interventions for hypertension and related diseases and hopefully enhancing hypertension care. JMIR Publications 2018-01-30 /pmc/articles/PMC5811646/ /pubmed/29382633 http://dx.doi.org/10.2196/jmir.9268 Text en ©Chengyin Ye, Tianyun Fu, Shiying Hao, Yan Zhang, Oliver Wang, Bo Jin, Minjie Xia, Modi Liu, Xin Zhou, Qian Wu, Yanting Guo, Chunqing Zhu, Yu-Ming Li, Devore S Culver, Shaun T Alfreds, Frank Stearns, Karl G Sylvester, Eric Widen, Doff McElhinney, Xuefeng Ling. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 30.01.2018. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Ye, Chengyin Fu, Tianyun Hao, Shiying Zhang, Yan Wang, Oliver Jin, Bo Xia, Minjie Liu, Modi Zhou, Xin Wu, Qian Guo, Yanting Zhu, Chunqing Li, Yu-Ming Culver, Devore S Alfreds, Shaun T Stearns, Frank Sylvester, Karl G Widen, Eric McElhinney, Doff Ling, Xuefeng Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning |
title | Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning |
title_full | Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning |
title_fullStr | Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning |
title_full_unstemmed | Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning |
title_short | Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning |
title_sort | prediction of incident hypertension within the next year: prospective study using statewide electronic health records and machine learning |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5811646/ https://www.ncbi.nlm.nih.gov/pubmed/29382633 http://dx.doi.org/10.2196/jmir.9268 |
work_keys_str_mv | AT yechengyin predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT futianyun predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT haoshiying predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT zhangyan predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT wangoliver predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT jinbo predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT xiaminjie predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT liumodi predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT zhouxin predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT wuqian predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT guoyanting predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT zhuchunqing predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT liyuming predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT culverdevores predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT alfredsshaunt predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT stearnsfrank predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT sylvesterkarlg predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT wideneric predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT mcelhinneydoff predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning AT lingxuefeng predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning |