Cargando…
Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine
BACKGROUND: Lung cancer is the leading cause of cancer death worldwide. Early detection of individuals at risk of lung cancer is critical to reduce the mortality rate. OBJECTIVE: The aim of this study was to develop and validate a prospective risk prediction model to identify patients at risk of new...
Autores principales: | , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6542253/ https://www.ncbi.nlm.nih.gov/pubmed/31099339 http://dx.doi.org/10.2196/13260 |
_version_ | 1783422907115896832 |
---|---|
author | Wang, Xiaofang Zhang, Yan Hao, Shiying Zheng, Le Liao, Jiayu Ye, Chengyin Xia, Minjie Wang, Oliver Liu, Modi Weng, Ching Ho Duong, Son Q Jin, Bo Alfreds, Shaun T Stearns, Frank Kanov, Laura Sylvester, Karl G Widen, Eric McElhinney, Doff B Ling, Xuefeng B |
author_facet | Wang, Xiaofang Zhang, Yan Hao, Shiying Zheng, Le Liao, Jiayu Ye, Chengyin Xia, Minjie Wang, Oliver Liu, Modi Weng, Ching Ho Duong, Son Q Jin, Bo Alfreds, Shaun T Stearns, Frank Kanov, Laura Sylvester, Karl G Widen, Eric McElhinney, Doff B Ling, Xuefeng B |
author_sort | Wang, Xiaofang |
collection | PubMed |
description | BACKGROUND: Lung cancer is the leading cause of cancer death worldwide. Early detection of individuals at risk of lung cancer is critical to reduce the mortality rate. OBJECTIVE: The aim of this study was to develop and validate a prospective risk prediction model to identify patients at risk of new incident lung cancer within the next 1 year in the general population. METHODS: Data from individual patient electronic health records (EHRs) were extracted from the Maine Health Information Exchange network. The study population consisted of patients with at least one EHR between April 1, 2016, and March 31, 2018, who had no history of lung cancer. A retrospective cohort (N=873,598) and a prospective cohort (N=836,659) were formed for model construction and validation. An Extreme Gradient Boosting (XGBoost) algorithm was adopted to build the model. It assigned a score to each individual to quantify the probability of a new incident lung cancer diagnosis from October 1, 2016, to September 31, 2017. The model was trained with the clinical profile in the retrospective cohort from the preceding 6 months and validated with the prospective cohort to predict the risk of incident lung cancer from April 1, 2017, to March 31, 2018. RESULTS: The model had an area under the curve (AUC) of 0.881 (95% CI 0.873-0.889) in the prospective cohort. Two thresholds of 0.0045 and 0.01 were applied to the predictive scores to stratify the population into low-, medium-, and high-risk categories. The incidence of lung cancer in the high-risk category (579/53,922, 1.07%) was 7.7 times higher than that in the overall cohort (1167/836,659, 0.14%). Age, a history of pulmonary diseases and other chronic diseases, medications for mental disorders, and social disparities were found to be associated with new incident lung cancer. CONCLUSIONS: We retrospectively developed and prospectively validated an accurate risk prediction model of new incident lung cancer occurring in the next 1 year. Through statistical learning from the statewide EHR data in the preceding 6 months, our model was able to identify statewide high-risk patients, which will benefit the population health through establishment of preventive interventions or more intensive surveillance. |
format | Online Article Text |
id | pubmed-6542253 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-65422532019-06-07 Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine Wang, Xiaofang Zhang, Yan Hao, Shiying Zheng, Le Liao, Jiayu Ye, Chengyin Xia, Minjie Wang, Oliver Liu, Modi Weng, Ching Ho Duong, Son Q Jin, Bo Alfreds, Shaun T Stearns, Frank Kanov, Laura Sylvester, Karl G Widen, Eric McElhinney, Doff B Ling, Xuefeng B J Med Internet Res Original Paper BACKGROUND: Lung cancer is the leading cause of cancer death worldwide. Early detection of individuals at risk of lung cancer is critical to reduce the mortality rate. OBJECTIVE: The aim of this study was to develop and validate a prospective risk prediction model to identify patients at risk of new incident lung cancer within the next 1 year in the general population. METHODS: Data from individual patient electronic health records (EHRs) were extracted from the Maine Health Information Exchange network. The study population consisted of patients with at least one EHR between April 1, 2016, and March 31, 2018, who had no history of lung cancer. A retrospective cohort (N=873,598) and a prospective cohort (N=836,659) were formed for model construction and validation. An Extreme Gradient Boosting (XGBoost) algorithm was adopted to build the model. It assigned a score to each individual to quantify the probability of a new incident lung cancer diagnosis from October 1, 2016, to September 31, 2017. The model was trained with the clinical profile in the retrospective cohort from the preceding 6 months and validated with the prospective cohort to predict the risk of incident lung cancer from April 1, 2017, to March 31, 2018. RESULTS: The model had an area under the curve (AUC) of 0.881 (95% CI 0.873-0.889) in the prospective cohort. Two thresholds of 0.0045 and 0.01 were applied to the predictive scores to stratify the population into low-, medium-, and high-risk categories. The incidence of lung cancer in the high-risk category (579/53,922, 1.07%) was 7.7 times higher than that in the overall cohort (1167/836,659, 0.14%). Age, a history of pulmonary diseases and other chronic diseases, medications for mental disorders, and social disparities were found to be associated with new incident lung cancer. CONCLUSIONS: We retrospectively developed and prospectively validated an accurate risk prediction model of new incident lung cancer occurring in the next 1 year. Through statistical learning from the statewide EHR data in the preceding 6 months, our model was able to identify statewide high-risk patients, which will benefit the population health through establishment of preventive interventions or more intensive surveillance. JMIR Publications 2019-05-16 /pmc/articles/PMC6542253/ /pubmed/31099339 http://dx.doi.org/10.2196/13260 Text en ©Xiaofang Wang, Yan Zhang, Shiying Hao, Le Zheng, Jiayu Liao, Chengyin Ye, Minjie Xia, Oliver Wang, Modi Liu, Ching Ho Weng, Son Q Duong, Bo Jin, Shaun T Alfreds, Frank Stearns, Laura Kanov, Karl G Sylvester, Eric Widen, Doff B McElhinney, Xuefeng B Ling. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 16.05.2019. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Wang, Xiaofang Zhang, Yan Hao, Shiying Zheng, Le Liao, Jiayu Ye, Chengyin Xia, Minjie Wang, Oliver Liu, Modi Weng, Ching Ho Duong, Son Q Jin, Bo Alfreds, Shaun T Stearns, Frank Kanov, Laura Sylvester, Karl G Widen, Eric McElhinney, Doff B Ling, Xuefeng B Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine |
title | Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine |
title_full | Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine |
title_fullStr | Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine |
title_full_unstemmed | Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine |
title_short | Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine |
title_sort | prediction of the 1-year risk of incident lung cancer: prospective study using electronic health records from the state of maine |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6542253/ https://www.ncbi.nlm.nih.gov/pubmed/31099339 http://dx.doi.org/10.2196/13260 |
work_keys_str_mv | AT wangxiaofang predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT zhangyan predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT haoshiying predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT zhengle predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT liaojiayu predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT yechengyin predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT xiaminjie predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT wangoliver predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT liumodi predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT wengchingho predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT duongsonq predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT jinbo predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT alfredsshaunt predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT stearnsfrank predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT kanovlaura predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT sylvesterkarlg predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT wideneric predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT mcelhinneydoffb predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine AT lingxuefengb predictionofthe1yearriskofincidentlungcancerprospectivestudyusingelectronichealthrecordsfromthestateofmaine |