Cargando…

Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine

BACKGROUND: Lung cancer is the leading cause of cancer death worldwide. Early detection of individuals at risk of lung cancer is critical to reduce the mortality rate. OBJECTIVE: The aim of this study was to develop and validate a prospective risk prediction model to identify patients at risk of new...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xiaofang, Zhang, Yan, Hao, Shiying, Zheng, Le, Liao, Jiayu, Ye, Chengyin, Xia, Minjie, Wang, Oliver, Liu, Modi, Weng, Ching Ho, Duong, Son Q, Jin, Bo, Alfreds, Shaun T, Stearns, Frank, Kanov, Laura, Sylvester, Karl G, Widen, Eric, McElhinney, Doff B, Ling, Xuefeng B
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6542253/
https://www.ncbi.nlm.nih.gov/pubmed/31099339
http://dx.doi.org/10.2196/13260
_version_ 1783422907115896832
author Wang, Xiaofang
Zhang, Yan
Hao, Shiying
Zheng, Le
Liao, Jiayu
Ye, Chengyin
Xia, Minjie
Wang, Oliver
Liu, Modi
Weng, Ching Ho
Duong, Son Q
Jin, Bo
Alfreds, Shaun T
Stearns, Frank
Kanov, Laura
Sylvester, Karl G
Widen, Eric
McElhinney, Doff B
Ling, Xuefeng B
author_facet Wang, Xiaofang
Zhang, Yan
Hao, Shiying
Zheng, Le
Liao, Jiayu
Ye, Chengyin
Xia, Minjie
Wang, Oliver
Liu, Modi
Weng, Ching Ho
Duong, Son Q
Jin, Bo
Alfreds, Shaun T
Stearns, Frank
Kanov, Laura
Sylvester, Karl G
Widen, Eric
McElhinney, Doff B
Ling, Xuefeng B
author_sort Wang, Xiaofang
collection PubMed
description BACKGROUND: Lung cancer is the leading cause of cancer death worldwide. Early detection of individuals at risk of lung cancer is critical to reduce the mortality rate. OBJECTIVE: The aim of this study was to develop and validate a prospective risk prediction model to identify patients at risk of new incident lung cancer within the next 1 year in the general population. METHODS: Data from individual patient electronic health records (EHRs) were extracted from the Maine Health Information Exchange network. The study population consisted of patients with at least one EHR between April 1, 2016, and March 31, 2018, who had no history of lung cancer. A retrospective cohort (N=873,598) and a prospective cohort (N=836,659) were formed for model construction and validation. An Extreme Gradient Boosting (XGBoost) algorithm was adopted to build the model. It assigned a score to each individual to quantify the probability of a new incident lung cancer diagnosis from October 1, 2016, to September 31, 2017. The model was trained with the clinical profile in the retrospective cohort from the preceding 6 months and validated with the prospective cohort to predict the risk of incident lung cancer from April 1, 2017, to March 31, 2018. RESULTS: The model had an area under the curve (AUC) of 0.881 (95% CI 0.873-0.889) in the prospective cohort. Two thresholds of 0.0045 and 0.01 were applied to the predictive scores to stratify the population into low-, medium-, and high-risk categories. The incidence of lung cancer in the high-risk category (579/53,922, 1.07%) was 7.7 times higher than that in the overall cohort (1167/836,659, 0.14%). Age, a history of pulmonary diseases and other chronic diseases, medications for mental disorders, and social disparities were found to be associated with new incident lung cancer. CONCLUSIONS: We retrospectively developed and prospectively validated an accurate risk prediction model of new incident lung cancer occurring in the next 1 year. Through statistical learning from the statewide EHR data in the preceding 6 months, our model was able to identify statewide high-risk patients, which will benefit the population health through establishment of preventive interventions or more intensive surveillance.
format Online
Article
Text
id pubmed-6542253
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-65422532019-06-07 Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine Wang, Xiaofang Zhang, Yan Hao, Shiying Zheng, Le Liao, Jiayu Ye, Chengyin Xia, Minjie Wang, Oliver Liu, Modi Weng, Ching Ho Duong, Son Q Jin, Bo Alfreds, Shaun T Stearns, Frank Kanov, Laura Sylvester, Karl G Widen, Eric McElhinney, Doff B Ling, Xuefeng B J Med Internet Res Original Paper BACKGROUND: Lung cancer is the leading cause of cancer death worldwide. Early detection of individuals at risk of lung cancer is critical to reduce the mortality rate. OBJECTIVE: The aim of this study was to develop and validate a prospective risk prediction model to identify patients at risk of new incident lung cancer within the next 1 year in the general population. METHODS: Data from individual patient electronic health records (EHRs) were extracted from the Maine Health Information Exchange network. The study population consisted of patients with at least one EHR between April 1, 2016, and March 31, 2018, who had no history of lung cancer. A retrospective cohort (N=873,598) and a prospective cohort (N=836,659) were formed for model construction and validation. An Extreme Gradient Boosting (XGBoost) algorithm was adopted to build the model. It assigned a score to each individual to quantify the probability of a new incident lung cancer diagnosis from October 1, 2016, to September 31, 2017. The model was trained with the clinical profile in the retrospective cohort from the preceding 6 months and validated with the prospective cohort to predict the risk of incident lung cancer from April 1, 2017, to March 31, 2018. RESULTS: The model had an area under the curve (AUC) of 0.881 (95% CI 0.873-0.889) in the prospective cohort. Two thresholds of 0.0045 and 0.01 were applied to the predictive scores to stratify the population into low-, medium-, and high-risk categories. The incidence of lung cancer in the high-risk category (579/53,922, 1.07%) was 7.7 times higher than that in the overall cohort (1167/836,659, 0.14%). Age, a history of pulmonary diseases and other chronic diseases, medications for mental disorders, and social disparities were found to be associated with new incident lung cancer. CONCLUSIONS: We retrospectively developed and prospectively validated an accurate risk prediction model of new incident lung cancer occurring in the next 1 year. Through statistical learning from the statewide EHR data in the preceding 6 months, our model was able to identify statewide high-risk patients, which will benefit the population health through establishment of preventive interventions or more intensive surveillance. JMIR Publications 2019-05-16 /pmc/articles/PMC6542253/ /pubmed/31099339 http://dx.doi.org/10.2196/13260 Text en ©Xiaofang Wang, Yan Zhang, Shiying Hao, Le Zheng, Jiayu Liao, Chengyin Ye, Minjie Xia, Oliver Wang, Modi Liu, Ching Ho Weng, Son Q Duong, Bo Jin, Shaun T Alfreds, Frank Stearns, Laura Kanov, Karl G Sylvester, Eric Widen, Doff B McElhinney, Xuefeng B Ling. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 16.05.2019. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Wang, Xiaofang
Zhang, Yan
Hao, Shiying
Zheng, Le
Liao, Jiayu
Ye, Chengyin
Xia, Minjie
Wang, Oliver
Liu, Modi
Weng, Ching Ho
Duong, Son Q
Jin, Bo
Alfreds, Shaun T
Stearns, Frank
Kanov, Laura
Sylvester, Karl G
Widen, Eric
McElhinney, Doff B
Ling, Xuefeng B
Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine
title Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine
title_full Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine
title_fullStr Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine
title_full_unstemmed Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine
title_short Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine
title_sort prediction of the 1-year risk of incident lung cancer: prospective study using electronic health records from the state of maine
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6542253/
https://www.ncbi.nlm.nih.gov/pubmed/31099339
http://dx.doi.org/10.2196/13260
work_keys_str_mv AT wangxiaofang predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT zhangyan predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT haoshiying predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT zhengle predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT liaojiayu predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT yechengyin predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT xiaminjie predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT wangoliver predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT liumodi predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT wengchingho predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT duongsonq predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT jinbo predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT alfredsshaunt predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT stearnsfrank predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT kanovlaura predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT sylvesterkarlg predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT wideneric predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT mcelhinneydoffb predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine
AT lingxuefengb predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine