Cargando…

Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning

BACKGROUND: As a high-prevalence health condition, hypertension is clinically costly, difficult to manage, and often leads to severe and life-threatening diseases such as cardiovascular disease (CVD) and stroke. OBJECTIVE: The aim of this study was to develop and validate prospectively a risk predic...

Descripción completa

Detalles Bibliográficos
Autores principales: Ye, Chengyin, Fu, Tianyun, Hao, Shiying, Zhang, Yan, Wang, Oliver, Jin, Bo, Xia, Minjie, Liu, Modi, Zhou, Xin, Wu, Qian, Guo, Yanting, Zhu, Chunqing, Li, Yu-Ming, Culver, Devore S, Alfreds, Shaun T, Stearns, Frank, Sylvester, Karl G, Widen, Eric, McElhinney, Doff, Ling, Xuefeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5811646/
https://www.ncbi.nlm.nih.gov/pubmed/29382633
http://dx.doi.org/10.2196/jmir.9268
_version_ 1783299903118639104
author Ye, Chengyin
Fu, Tianyun
Hao, Shiying
Zhang, Yan
Wang, Oliver
Jin, Bo
Xia, Minjie
Liu, Modi
Zhou, Xin
Wu, Qian
Guo, Yanting
Zhu, Chunqing
Li, Yu-Ming
Culver, Devore S
Alfreds, Shaun T
Stearns, Frank
Sylvester, Karl G
Widen, Eric
McElhinney, Doff
Ling, Xuefeng
author_facet Ye, Chengyin
Fu, Tianyun
Hao, Shiying
Zhang, Yan
Wang, Oliver
Jin, Bo
Xia, Minjie
Liu, Modi
Zhou, Xin
Wu, Qian
Guo, Yanting
Zhu, Chunqing
Li, Yu-Ming
Culver, Devore S
Alfreds, Shaun T
Stearns, Frank
Sylvester, Karl G
Widen, Eric
McElhinney, Doff
Ling, Xuefeng
author_sort Ye, Chengyin
collection PubMed
description BACKGROUND: As a high-prevalence health condition, hypertension is clinically costly, difficult to manage, and often leads to severe and life-threatening diseases such as cardiovascular disease (CVD) and stroke. OBJECTIVE: The aim of this study was to develop and validate prospectively a risk prediction model of incident essential hypertension within the following year. METHODS: Data from individual patient electronic health records (EHRs) were extracted from the Maine Health Information Exchange network. Retrospective (N=823,627, calendar year 2013) and prospective (N=680,810, calendar year 2014) cohorts were formed. A machine learning algorithm, XGBoost, was adopted in the process of feature selection and model building. It generated an ensemble of classification trees and assigned a final predictive risk score to each individual. RESULTS: The 1-year incident hypertension risk model attained areas under the curve (AUCs) of 0.917 and 0.870 in the retrospective and prospective cohorts, respectively. Risk scores were calculated and stratified into five risk categories, with 4526 out of 381,544 patients (1.19%) in the lowest risk category (score 0-0.05) and 21,050 out of 41,329 patients (50.93%) in the highest risk category (score 0.4-1) receiving a diagnosis of incident hypertension in the following 1 year. Type 2 diabetes, lipid disorders, CVDs, mental illness, clinical utilization indicators, and socioeconomic determinants were recognized as driving or associated features of incident essential hypertension. The very high risk population mainly comprised elderly (age>50 years) individuals with multiple chronic conditions, especially those receiving medications for mental disorders. Disparities were also found in social determinants, including some community-level factors associated with higher risk and others that were protective against hypertension. CONCLUSIONS: With statewide EHR datasets, our study prospectively validated an accurate 1-year risk prediction model for incident essential hypertension. Our real-time predictive analytic model has been deployed in the state of Maine, providing implications in interventions for hypertension and related diseases and hopefully enhancing hypertension care.
format Online
Article
Text
id pubmed-5811646
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-58116462018-02-16 Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning Ye, Chengyin Fu, Tianyun Hao, Shiying Zhang, Yan Wang, Oliver Jin, Bo Xia, Minjie Liu, Modi Zhou, Xin Wu, Qian Guo, Yanting Zhu, Chunqing Li, Yu-Ming Culver, Devore S Alfreds, Shaun T Stearns, Frank Sylvester, Karl G Widen, Eric McElhinney, Doff Ling, Xuefeng J Med Internet Res Original Paper BACKGROUND: As a high-prevalence health condition, hypertension is clinically costly, difficult to manage, and often leads to severe and life-threatening diseases such as cardiovascular disease (CVD) and stroke. OBJECTIVE: The aim of this study was to develop and validate prospectively a risk prediction model of incident essential hypertension within the following year. METHODS: Data from individual patient electronic health records (EHRs) were extracted from the Maine Health Information Exchange network. Retrospective (N=823,627, calendar year 2013) and prospective (N=680,810, calendar year 2014) cohorts were formed. A machine learning algorithm, XGBoost, was adopted in the process of feature selection and model building. It generated an ensemble of classification trees and assigned a final predictive risk score to each individual. RESULTS: The 1-year incident hypertension risk model attained areas under the curve (AUCs) of 0.917 and 0.870 in the retrospective and prospective cohorts, respectively. Risk scores were calculated and stratified into five risk categories, with 4526 out of 381,544 patients (1.19%) in the lowest risk category (score 0-0.05) and 21,050 out of 41,329 patients (50.93%) in the highest risk category (score 0.4-1) receiving a diagnosis of incident hypertension in the following 1 year. Type 2 diabetes, lipid disorders, CVDs, mental illness, clinical utilization indicators, and socioeconomic determinants were recognized as driving or associated features of incident essential hypertension. The very high risk population mainly comprised elderly (age>50 years) individuals with multiple chronic conditions, especially those receiving medications for mental disorders. Disparities were also found in social determinants, including some community-level factors associated with higher risk and others that were protective against hypertension. CONCLUSIONS: With statewide EHR datasets, our study prospectively validated an accurate 1-year risk prediction model for incident essential hypertension. Our real-time predictive analytic model has been deployed in the state of Maine, providing implications in interventions for hypertension and related diseases and hopefully enhancing hypertension care. JMIR Publications 2018-01-30 /pmc/articles/PMC5811646/ /pubmed/29382633 http://dx.doi.org/10.2196/jmir.9268 Text en ©Chengyin Ye, Tianyun Fu, Shiying Hao, Yan Zhang, Oliver Wang, Bo Jin, Minjie Xia, Modi Liu, Xin Zhou, Qian Wu, Yanting Guo, Chunqing Zhu, Yu-Ming Li, Devore S Culver, Shaun T Alfreds, Frank Stearns, Karl G Sylvester, Eric Widen, Doff McElhinney, Xuefeng Ling. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 30.01.2018. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Ye, Chengyin
Fu, Tianyun
Hao, Shiying
Zhang, Yan
Wang, Oliver
Jin, Bo
Xia, Minjie
Liu, Modi
Zhou, Xin
Wu, Qian
Guo, Yanting
Zhu, Chunqing
Li, Yu-Ming
Culver, Devore S
Alfreds, Shaun T
Stearns, Frank
Sylvester, Karl G
Widen, Eric
McElhinney, Doff
Ling, Xuefeng
Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning
title Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning
title_full Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning
title_fullStr Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning
title_full_unstemmed Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning
title_short Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning
title_sort prediction of incident hypertension within the next year: prospective study using statewide electronic health records and machine learning
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5811646/
https://www.ncbi.nlm.nih.gov/pubmed/29382633
http://dx.doi.org/10.2196/jmir.9268
work_keys_str_mv AT yechengyin predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT futianyun predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT haoshiying predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT zhangyan predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT wangoliver predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT jinbo predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT xiaminjie predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT liumodi predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT zhouxin predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT wuqian predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT guoyanting predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT zhuchunqing predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT liyuming predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT culverdevores predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT alfredsshaunt predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT stearnsfrank predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT sylvesterkarlg predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT wideneric predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT mcelhinneydoff predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning
AT lingxuefeng predictionofincidenthypertensionwithinthenextyearprospectivestudyusingstatewideelectronichealthrecordsandmachinelearning