Cargando…

1380. Machine Learning-based Estimation of Unconfirmed COVID-19 Cases from a 10,000-Household Survey in Gilgit-Baltistan, Pakistan

BACKGROUND: Robust estimates of COVID-19 prevalence during the pandemic are scarce, particularly in settings with limited SARS-CoV-2 testing. Gilgit-Baltistan (GB) is a remote region of Pakistan where healthcare access is limited by underdeveloped facility and road infrastructure. We leveraged a lar...

Descripción completa

Detalles Bibliográficos
Autores principales:	Farrar, Daniel S, Pell, Lisa G, Muhammad, Yasin, Hafiz, Sher, Erdman, Lauren, Bassani, Diego G, Tanner, Zachary, Ahmed, Imran, Muhammad, Karim, Madhani, Falak, Paracha, Shariq, Khan, Masood Ali, Soofi, Sajid B, Taljaard, Monica, Spitzer, Rachel, Abu Fadaleh, Sarah M, Bhutta, Zulfiqar A, Morris, Shaun
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2023
Materias:	Abstract
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10677587/ http://dx.doi.org/10.1093/ofid/ofad500.1217

_version_	1785150165070381056
author	Farrar, Daniel S Pell, Lisa G Muhammad, Yasin Hafiz, Sher Erdman, Lauren Bassani, Diego G Tanner, Zachary Ahmed, Imran Muhammad, Karim Madhani, Falak Paracha, Shariq Khan, Masood Ali Soofi, Sajid B Taljaard, Monica Spitzer, Rachel Abu Fadaleh, Sarah M Bhutta, Zulfiqar A Morris, Shaun
author_facet	Farrar, Daniel S Pell, Lisa G Muhammad, Yasin Hafiz, Sher Erdman, Lauren Bassani, Diego G Tanner, Zachary Ahmed, Imran Muhammad, Karim Madhani, Falak Paracha, Shariq Khan, Masood Ali Soofi, Sajid B Taljaard, Monica Spitzer, Rachel Abu Fadaleh, Sarah M Bhutta, Zulfiqar A Morris, Shaun
author_sort	Farrar, Daniel S
collection	PubMed
description	BACKGROUND: Robust estimates of COVID-19 prevalence during the pandemic are scarce, particularly in settings with limited SARS-CoV-2 testing. Gilgit-Baltistan (GB) is a remote region of Pakistan where healthcare access is limited by underdeveloped facility and road infrastructure. We leveraged a large household survey to describe the burden of confirmed and unconfirmed COVID-19 in GB. METHODS: We conducted a cross-sectional survey in GB from June–August 2021 during the baseline phase of a cluster randomized trial. Households were randomly selected using a stratified, two-stage sampling design. Data regarding SARS-CoV-2 testing, healthcare worker (HCW) diagnoses without testing, symptoms, and outcomes since March 2020 were self-reported for all household members. “Confirmed/probable” COVID-19 was defined as a positive test, HCW diagnosis of COVID-19, or HCW diagnosis of pneumonia with COVID-19 positive contact. Using machine learning (ML) and bootstrap validation, we developed a symptom-based diagnostic model to differentiate confirmed/probable infections from those with negative SARS-CoV-2 tests (Fig. 1). We applied this model to untested respondents to estimate the total prevalence of COVID-19. [Figure: see text] RESULTS: Data were collected from 77924 people in 10264 households. Overall, 314 had confirmed/probable COVID-19, 3263 had negative tests, and 74347 were untested. SARS-CoV-2 testing was less common in females (vs. males; 38 vs. 58 tests per 1000 people) and children (vs. adults; 17 vs. 76 tests per 1000 people). Using an extreme gradient boosting model, area under the receiver operating characteristic curve was 0.92 (95% confidence interval [CI] 0.90–0.93), sensitivity was 0.81 (CI 0.75–0.85), and specificity was 0.88 (CI 0.85–0.90). With this model, total estimated cases were 8–17 times more than the number of individuals with positive tests (Fig. 2). The ratio of estimated to confirmed cases was higher for children (90–213 times) and females (13–25 times). [Figure: see text] CONCLUSION: From March 2020–August 2021, the majority of COVID-19 cases in GB went unconfirmed. Women and children were tested less often, perhaps due to preferences in healthcare seeking and perceptions of lower risk of severe illness. Our approach may be used to estimate COVID-19 prevalence in settings with limited testing capacity. DISCLOSURES: Shaun Morris, MD, MPH, DTM&H, FRCPC, FAAP, GlaxoSmithKline: Honoraria\|JNJ China: Honoraria\|Merck: served on ad hoc advisory board\|Pfizer: Grant/Research Support\|Pfizer: served on ad-hoc advisory board\|Sanofi-Pasteur: served on ad-hoc advisory board
format	Online Article Text
id	pubmed-10677587
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-106775872023-11-27 1380. Machine Learning-based Estimation of Unconfirmed COVID-19 Cases from a 10,000-Household Survey in Gilgit-Baltistan, Pakistan Farrar, Daniel S Pell, Lisa G Muhammad, Yasin Hafiz, Sher Erdman, Lauren Bassani, Diego G Tanner, Zachary Ahmed, Imran Muhammad, Karim Madhani, Falak Paracha, Shariq Khan, Masood Ali Soofi, Sajid B Taljaard, Monica Spitzer, Rachel Abu Fadaleh, Sarah M Bhutta, Zulfiqar A Morris, Shaun Open Forum Infect Dis Abstract BACKGROUND: Robust estimates of COVID-19 prevalence during the pandemic are scarce, particularly in settings with limited SARS-CoV-2 testing. Gilgit-Baltistan (GB) is a remote region of Pakistan where healthcare access is limited by underdeveloped facility and road infrastructure. We leveraged a large household survey to describe the burden of confirmed and unconfirmed COVID-19 in GB. METHODS: We conducted a cross-sectional survey in GB from June–August 2021 during the baseline phase of a cluster randomized trial. Households were randomly selected using a stratified, two-stage sampling design. Data regarding SARS-CoV-2 testing, healthcare worker (HCW) diagnoses without testing, symptoms, and outcomes since March 2020 were self-reported for all household members. “Confirmed/probable” COVID-19 was defined as a positive test, HCW diagnosis of COVID-19, or HCW diagnosis of pneumonia with COVID-19 positive contact. Using machine learning (ML) and bootstrap validation, we developed a symptom-based diagnostic model to differentiate confirmed/probable infections from those with negative SARS-CoV-2 tests (Fig. 1). We applied this model to untested respondents to estimate the total prevalence of COVID-19. [Figure: see text] RESULTS: Data were collected from 77924 people in 10264 households. Overall, 314 had confirmed/probable COVID-19, 3263 had negative tests, and 74347 were untested. SARS-CoV-2 testing was less common in females (vs. males; 38 vs. 58 tests per 1000 people) and children (vs. adults; 17 vs. 76 tests per 1000 people). Using an extreme gradient boosting model, area under the receiver operating characteristic curve was 0.92 (95% confidence interval [CI] 0.90–0.93), sensitivity was 0.81 (CI 0.75–0.85), and specificity was 0.88 (CI 0.85–0.90). With this model, total estimated cases were 8–17 times more than the number of individuals with positive tests (Fig. 2). The ratio of estimated to confirmed cases was higher for children (90–213 times) and females (13–25 times). [Figure: see text] CONCLUSION: From March 2020–August 2021, the majority of COVID-19 cases in GB went unconfirmed. Women and children were tested less often, perhaps due to preferences in healthcare seeking and perceptions of lower risk of severe illness. Our approach may be used to estimate COVID-19 prevalence in settings with limited testing capacity. DISCLOSURES: Shaun Morris, MD, MPH, DTM&H, FRCPC, FAAP, GlaxoSmithKline: Honoraria\|JNJ China: Honoraria\|Merck: served on ad hoc advisory board\|Pfizer: Grant/Research Support\|Pfizer: served on ad-hoc advisory board\|Sanofi-Pasteur: served on ad-hoc advisory board Oxford University Press 2023-11-27 /pmc/articles/PMC10677587/ http://dx.doi.org/10.1093/ofid/ofad500.1217 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Infectious Diseases Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Abstract Farrar, Daniel S Pell, Lisa G Muhammad, Yasin Hafiz, Sher Erdman, Lauren Bassani, Diego G Tanner, Zachary Ahmed, Imran Muhammad, Karim Madhani, Falak Paracha, Shariq Khan, Masood Ali Soofi, Sajid B Taljaard, Monica Spitzer, Rachel Abu Fadaleh, Sarah M Bhutta, Zulfiqar A Morris, Shaun 1380. Machine Learning-based Estimation of Unconfirmed COVID-19 Cases from a 10,000-Household Survey in Gilgit-Baltistan, Pakistan
title	1380. Machine Learning-based Estimation of Unconfirmed COVID-19 Cases from a 10,000-Household Survey in Gilgit-Baltistan, Pakistan
title_full	1380. Machine Learning-based Estimation of Unconfirmed COVID-19 Cases from a 10,000-Household Survey in Gilgit-Baltistan, Pakistan
title_fullStr	1380. Machine Learning-based Estimation of Unconfirmed COVID-19 Cases from a 10,000-Household Survey in Gilgit-Baltistan, Pakistan
title_full_unstemmed	1380. Machine Learning-based Estimation of Unconfirmed COVID-19 Cases from a 10,000-Household Survey in Gilgit-Baltistan, Pakistan
title_short	1380. Machine Learning-based Estimation of Unconfirmed COVID-19 Cases from a 10,000-Household Survey in Gilgit-Baltistan, Pakistan
title_sort	1380. machine learning-based estimation of unconfirmed covid-19 cases from a 10,000-household survey in gilgit-baltistan, pakistan
topic	Abstract
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10677587/ http://dx.doi.org/10.1093/ofid/ofad500.1217
work_keys_str_mv	AT farrardaniels 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT pelllisag 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT muhammadyasin 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT hafizsher 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT erdmanlauren 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT bassanidiegog 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT tannerzachary 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT ahmedimran 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT muhammadkarim 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT madhanifalak 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT parachashariq 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT khanmasoodali 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT soofisajidb 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT taljaardmonica 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT spitzerrachel 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT abufadalehsarahm 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT bhuttazulfiqara 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan AT morrisshaun 1380machinelearningbasedestimationofunconfirmedcovid19casesfroma10000householdsurveyingilgitbaltistanpakistan

1380. Machine Learning-based Estimation of Unconfirmed COVID-19 Cases from a 10,000-Household Survey in Gilgit-Baltistan, Pakistan

Ejemplares similares