Cargando…

Machine learning approaches to predict peak demand days of cardiovascular admissions considering environmental exposure

BACKGROUND: Accumulating evidence has linked environmental exposure, such as ambient air pollution and meteorological factors, to the development and severity of cardiovascular diseases (CVDs), resulting in increased healthcare demand. Effective prediction of demand for healthcare services, particul...

Descripción completa

Detalles Bibliográficos
Autores principales: Qiu, Hang, Luo, Lin, Su, Ziqi, Zhou, Li, Wang, Liya, Chen, Yucheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7195717/
https://www.ncbi.nlm.nih.gov/pubmed/32357880
http://dx.doi.org/10.1186/s12911-020-1101-8
_version_ 1783528593881563136
author Qiu, Hang
Luo, Lin
Su, Ziqi
Zhou, Li
Wang, Liya
Chen, Yucheng
author_facet Qiu, Hang
Luo, Lin
Su, Ziqi
Zhou, Li
Wang, Liya
Chen, Yucheng
author_sort Qiu, Hang
collection PubMed
description BACKGROUND: Accumulating evidence has linked environmental exposure, such as ambient air pollution and meteorological factors, to the development and severity of cardiovascular diseases (CVDs), resulting in increased healthcare demand. Effective prediction of demand for healthcare services, particularly those associated with peak events of CVDs, can be useful in optimizing the allocation of medical resources. However, few studies have attempted to adopt machine learning approaches with excellent predictive abilities to forecast the healthcare demand for CVDs. This study aims to develop and compare several machine learning models in predicting the peak demand days of CVDs admissions using the hospital admissions data, air quality data and meteorological data in Chengdu, China from 2015 to 2017. METHODS: Six machine learning algorithms, including logistic regression (LR), support vector machine (SVM), artificial neural network (ANN), random forest (RF), extreme gradient boosting (XGBoost), and light gradient boosting machine (LightGBM) were applied to build the predictive models with a unique feature set. The area under a receiver operating characteristic curve (AUC), logarithmic loss function, accuracy, sensitivity, specificity, precision, and F1 score were used to evaluate the predictive performances of the six models. RESULTS: The LightGBM model exhibited the highest AUC (0.940, 95% CI: 0.900–0.980), which was significantly higher than that of LR (0.842, 95% CI: 0.783–0.901), SVM (0.834, 95% CI: 0.774–0.894) and ANN (0.890, 95% CI: 0.836–0.944), but did not differ significantly from that of RF (0.926, 95% CI: 0.879–0.974) and XGBoost (0.930, 95% CI: 0.878–0.982). In addition, the LightGBM has the optimal logarithmic loss function (0.218), accuracy (91.3%), specificity (94.1%), precision (0.695), and F1 score (0.725). Feature importance identification indicated that the contribution rate of meteorological conditions and air pollutants for the prediction was 32 and 43%, respectively. CONCLUSION: This study suggests that ensemble learning models, especially the LightGBM model, can be used to effectively predict the peak events of CVDs admissions, and therefore could be a very useful decision-making tool for medical resource management.
format Online
Article
Text
id pubmed-7195717
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-71957172020-05-06 Machine learning approaches to predict peak demand days of cardiovascular admissions considering environmental exposure Qiu, Hang Luo, Lin Su, Ziqi Zhou, Li Wang, Liya Chen, Yucheng BMC Med Inform Decis Mak Research Article BACKGROUND: Accumulating evidence has linked environmental exposure, such as ambient air pollution and meteorological factors, to the development and severity of cardiovascular diseases (CVDs), resulting in increased healthcare demand. Effective prediction of demand for healthcare services, particularly those associated with peak events of CVDs, can be useful in optimizing the allocation of medical resources. However, few studies have attempted to adopt machine learning approaches with excellent predictive abilities to forecast the healthcare demand for CVDs. This study aims to develop and compare several machine learning models in predicting the peak demand days of CVDs admissions using the hospital admissions data, air quality data and meteorological data in Chengdu, China from 2015 to 2017. METHODS: Six machine learning algorithms, including logistic regression (LR), support vector machine (SVM), artificial neural network (ANN), random forest (RF), extreme gradient boosting (XGBoost), and light gradient boosting machine (LightGBM) were applied to build the predictive models with a unique feature set. The area under a receiver operating characteristic curve (AUC), logarithmic loss function, accuracy, sensitivity, specificity, precision, and F1 score were used to evaluate the predictive performances of the six models. RESULTS: The LightGBM model exhibited the highest AUC (0.940, 95% CI: 0.900–0.980), which was significantly higher than that of LR (0.842, 95% CI: 0.783–0.901), SVM (0.834, 95% CI: 0.774–0.894) and ANN (0.890, 95% CI: 0.836–0.944), but did not differ significantly from that of RF (0.926, 95% CI: 0.879–0.974) and XGBoost (0.930, 95% CI: 0.878–0.982). In addition, the LightGBM has the optimal logarithmic loss function (0.218), accuracy (91.3%), specificity (94.1%), precision (0.695), and F1 score (0.725). Feature importance identification indicated that the contribution rate of meteorological conditions and air pollutants for the prediction was 32 and 43%, respectively. CONCLUSION: This study suggests that ensemble learning models, especially the LightGBM model, can be used to effectively predict the peak events of CVDs admissions, and therefore could be a very useful decision-making tool for medical resource management. BioMed Central 2020-05-01 /pmc/articles/PMC7195717/ /pubmed/32357880 http://dx.doi.org/10.1186/s12911-020-1101-8 Text en © The Author(s). 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Qiu, Hang
Luo, Lin
Su, Ziqi
Zhou, Li
Wang, Liya
Chen, Yucheng
Machine learning approaches to predict peak demand days of cardiovascular admissions considering environmental exposure
title Machine learning approaches to predict peak demand days of cardiovascular admissions considering environmental exposure
title_full Machine learning approaches to predict peak demand days of cardiovascular admissions considering environmental exposure
title_fullStr Machine learning approaches to predict peak demand days of cardiovascular admissions considering environmental exposure
title_full_unstemmed Machine learning approaches to predict peak demand days of cardiovascular admissions considering environmental exposure
title_short Machine learning approaches to predict peak demand days of cardiovascular admissions considering environmental exposure
title_sort machine learning approaches to predict peak demand days of cardiovascular admissions considering environmental exposure
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7195717/
https://www.ncbi.nlm.nih.gov/pubmed/32357880
http://dx.doi.org/10.1186/s12911-020-1101-8
work_keys_str_mv AT qiuhang machinelearningapproachestopredictpeakdemanddaysofcardiovascularadmissionsconsideringenvironmentalexposure
AT luolin machinelearningapproachestopredictpeakdemanddaysofcardiovascularadmissionsconsideringenvironmentalexposure
AT suziqi machinelearningapproachestopredictpeakdemanddaysofcardiovascularadmissionsconsideringenvironmentalexposure
AT zhouli machinelearningapproachestopredictpeakdemanddaysofcardiovascularadmissionsconsideringenvironmentalexposure
AT wangliya machinelearningapproachestopredictpeakdemanddaysofcardiovascularadmissionsconsideringenvironmentalexposure
AT chenyucheng machinelearningapproachestopredictpeakdemanddaysofcardiovascularadmissionsconsideringenvironmentalexposure