Cargando…
Predicting Coronavirus Disease 2019 Infection Risk and Related Risk Drivers in Nursing Homes: A Machine Learning Approach
OBJECTIVE: Inform coronavirus disease 2019 (COVID-19) infection prevention measures by identifying and assessing risk and possible vectors of infection in nursing homes (NHs) using a machine-learning approach. DESIGN: This retrospective cohort study used a gradient boosting algorithm to evaluate ris...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
AMDA - The Society for Post-Acute and Long-Term Care Medicine.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7451194/ https://www.ncbi.nlm.nih.gov/pubmed/33032935 http://dx.doi.org/10.1016/j.jamda.2020.08.030 |
_version_ | 1783574938164133888 |
---|---|
author | Sun, Christopher L.F. Zuccarelli, Eugenio Zerhouni, El Ghali A. Lee, Jason Muller, James Scott, Karen M. Lujan, Alida M. Levi, Retsef |
author_facet | Sun, Christopher L.F. Zuccarelli, Eugenio Zerhouni, El Ghali A. Lee, Jason Muller, James Scott, Karen M. Lujan, Alida M. Levi, Retsef |
author_sort | Sun, Christopher L.F. |
collection | PubMed |
description | OBJECTIVE: Inform coronavirus disease 2019 (COVID-19) infection prevention measures by identifying and assessing risk and possible vectors of infection in nursing homes (NHs) using a machine-learning approach. DESIGN: This retrospective cohort study used a gradient boosting algorithm to evaluate risk of COVID-19 infection (ie, presence of at least 1 confirmed COVID-19 resident) in NHs. SETTING AND PARTICIPANTS: The model was trained on outcomes from 1146 NHs in Massachusetts, Georgia, and New Jersey, reporting COVID-19 case data on April 20, 2020. Risk indices generated from the model using data from May 4 were prospectively validated against outcomes reported on May 11 from 1021 NHs in California. METHODS: Model features, pertaining to facility and community characteristics, were obtained from a self-constructed dataset based on multiple public and private sources. The model was assessed via out-of-sample area under the receiver operating characteristic curve (AUC), sensitivity, and specificity in the training (via 10-fold cross-validation) and validation datasets. RESULTS: The mean AUC, sensitivity, and specificity of the model over 10-fold cross-validation were 0.729 [95% confidence interval (CI) 0.690‒0.767], 0.670 (95% CI 0.477‒0.862), and 0.611 (95% CI 0.412‒0.809), respectively. Prospective out-of-sample validation yielded similar performance measures (AUC 0.721; sensitivity 0.622; specificity 0.713). The strongest predictors of COVID-19 infection were identified as the NH's county's infection rate and the number of separate units in the NH; other predictors included the county's population density, historical Centers of Medicare and Medicaid Services cited health deficiencies, and the NH's resident density (in persons per 1000 square feet). In addition, the NH's historical percentage of non-Hispanic white residents was identified as a protective factor. CONCLUSIONS AND IMPLICATIONS: A machine-learning model can help quantify and predict NH infection risk. The identified risk factors support the early identification and management of presymptomatic and asymptomatic individuals (eg, staff) entering the NH from the surrounding community and the development of financially sustainable staff testing initiatives in preventing COVID-19 infection. |
format | Online Article Text |
id | pubmed-7451194 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | AMDA - The Society for Post-Acute and Long-Term Care Medicine. |
record_format | MEDLINE/PubMed |
spelling | pubmed-74511942020-08-28 Predicting Coronavirus Disease 2019 Infection Risk and Related Risk Drivers in Nursing Homes: A Machine Learning Approach Sun, Christopher L.F. Zuccarelli, Eugenio Zerhouni, El Ghali A. Lee, Jason Muller, James Scott, Karen M. Lujan, Alida M. Levi, Retsef J Am Med Dir Assoc Covid-19 OBJECTIVE: Inform coronavirus disease 2019 (COVID-19) infection prevention measures by identifying and assessing risk and possible vectors of infection in nursing homes (NHs) using a machine-learning approach. DESIGN: This retrospective cohort study used a gradient boosting algorithm to evaluate risk of COVID-19 infection (ie, presence of at least 1 confirmed COVID-19 resident) in NHs. SETTING AND PARTICIPANTS: The model was trained on outcomes from 1146 NHs in Massachusetts, Georgia, and New Jersey, reporting COVID-19 case data on April 20, 2020. Risk indices generated from the model using data from May 4 were prospectively validated against outcomes reported on May 11 from 1021 NHs in California. METHODS: Model features, pertaining to facility and community characteristics, were obtained from a self-constructed dataset based on multiple public and private sources. The model was assessed via out-of-sample area under the receiver operating characteristic curve (AUC), sensitivity, and specificity in the training (via 10-fold cross-validation) and validation datasets. RESULTS: The mean AUC, sensitivity, and specificity of the model over 10-fold cross-validation were 0.729 [95% confidence interval (CI) 0.690‒0.767], 0.670 (95% CI 0.477‒0.862), and 0.611 (95% CI 0.412‒0.809), respectively. Prospective out-of-sample validation yielded similar performance measures (AUC 0.721; sensitivity 0.622; specificity 0.713). The strongest predictors of COVID-19 infection were identified as the NH's county's infection rate and the number of separate units in the NH; other predictors included the county's population density, historical Centers of Medicare and Medicaid Services cited health deficiencies, and the NH's resident density (in persons per 1000 square feet). In addition, the NH's historical percentage of non-Hispanic white residents was identified as a protective factor. CONCLUSIONS AND IMPLICATIONS: A machine-learning model can help quantify and predict NH infection risk. The identified risk factors support the early identification and management of presymptomatic and asymptomatic individuals (eg, staff) entering the NH from the surrounding community and the development of financially sustainable staff testing initiatives in preventing COVID-19 infection. AMDA - The Society for Post-Acute and Long-Term Care Medicine. 2020-11 2020-08-27 /pmc/articles/PMC7451194/ /pubmed/33032935 http://dx.doi.org/10.1016/j.jamda.2020.08.030 Text en © 2020 AMDA - The Society for Post-Acute and Long-Term Care Medicine. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Covid-19 Sun, Christopher L.F. Zuccarelli, Eugenio Zerhouni, El Ghali A. Lee, Jason Muller, James Scott, Karen M. Lujan, Alida M. Levi, Retsef Predicting Coronavirus Disease 2019 Infection Risk and Related Risk Drivers in Nursing Homes: A Machine Learning Approach |
title | Predicting Coronavirus Disease 2019 Infection Risk and Related Risk Drivers in Nursing Homes: A Machine Learning Approach |
title_full | Predicting Coronavirus Disease 2019 Infection Risk and Related Risk Drivers in Nursing Homes: A Machine Learning Approach |
title_fullStr | Predicting Coronavirus Disease 2019 Infection Risk and Related Risk Drivers in Nursing Homes: A Machine Learning Approach |
title_full_unstemmed | Predicting Coronavirus Disease 2019 Infection Risk and Related Risk Drivers in Nursing Homes: A Machine Learning Approach |
title_short | Predicting Coronavirus Disease 2019 Infection Risk and Related Risk Drivers in Nursing Homes: A Machine Learning Approach |
title_sort | predicting coronavirus disease 2019 infection risk and related risk drivers in nursing homes: a machine learning approach |
topic | Covid-19 |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7451194/ https://www.ncbi.nlm.nih.gov/pubmed/33032935 http://dx.doi.org/10.1016/j.jamda.2020.08.030 |
work_keys_str_mv | AT sunchristopherlf predictingcoronavirusdisease2019infectionriskandrelatedriskdriversinnursinghomesamachinelearningapproach AT zuccarellieugenio predictingcoronavirusdisease2019infectionriskandrelatedriskdriversinnursinghomesamachinelearningapproach AT zerhounielghalia predictingcoronavirusdisease2019infectionriskandrelatedriskdriversinnursinghomesamachinelearningapproach AT leejason predictingcoronavirusdisease2019infectionriskandrelatedriskdriversinnursinghomesamachinelearningapproach AT mullerjames predictingcoronavirusdisease2019infectionriskandrelatedriskdriversinnursinghomesamachinelearningapproach AT scottkarenm predictingcoronavirusdisease2019infectionriskandrelatedriskdriversinnursinghomesamachinelearningapproach AT lujanalidam predictingcoronavirusdisease2019infectionriskandrelatedriskdriversinnursinghomesamachinelearningapproach AT leviretsef predictingcoronavirusdisease2019infectionriskandrelatedriskdriversinnursinghomesamachinelearningapproach |