Cargando…

Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan

AIMS: We aimed at giving a preliminary analysis of the weakness of a current test strategy, and proposing a data-driven strategy that was self-adaptive to the dynamic change of pandemic. The effect of driven-data selection over time and space was also within the deep concern. METHODS: A mathematical...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Chuanli, Wang, Min, Rafaqat, Warda, Shabbir, Salman, Lian, Liping, Zhang, Jun, Lo, Siuming, Song, Weiguo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Published by Elsevier Ltd. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8184360/
https://www.ncbi.nlm.nih.gov/pubmed/34121777
http://dx.doi.org/10.1016/j.seps.2021.101091
_version_ 1783704578125987840
author Huang, Chuanli
Wang, Min
Rafaqat, Warda
Shabbir, Salman
Lian, Liping
Zhang, Jun
Lo, Siuming
Song, Weiguo
author_facet Huang, Chuanli
Wang, Min
Rafaqat, Warda
Shabbir, Salman
Lian, Liping
Zhang, Jun
Lo, Siuming
Song, Weiguo
author_sort Huang, Chuanli
collection PubMed
description AIMS: We aimed at giving a preliminary analysis of the weakness of a current test strategy, and proposing a data-driven strategy that was self-adaptive to the dynamic change of pandemic. The effect of driven-data selection over time and space was also within the deep concern. METHODS: A mathematical definition of the test strategy were given. With the real COVID-19 test data from March to July collected in Lahore, a significance analysis of the possible features was conducted. A machine learning method based on logistic regression and priority ranking were proposed for the data-driven test strategy. With performance assessed by the area under the receiver operating characteristic curve (AUC), time series analysis and spatial cross-test were conducted. RESULTS: The transition of risk factors accounted for the failure of the current test strategy. The proposed data-driven strategy could enhance the positive detection rate from 2.54% to 28.18%, and the recall rate from 8.05% to 89.35% under strictly limited test capacity. Much more optimal utilization of test resources could be realized where 89.35% of total positive cases could be detected with merely 48.17% of the original test amount. The strategy showed self-adaptability with the development of pandemic, while the strategy driven by local data was proved to be optimal. CONCLUSIONS: We recommended a generalization of such a data-driven test strategy for a better response to the global developing pandemic. Besides, the construction of the COVID-19 data system should be more refined on space for local applications.
format Online
Article
Text
id pubmed-8184360
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Published by Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-81843602021-06-08 Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan Huang, Chuanli Wang, Min Rafaqat, Warda Shabbir, Salman Lian, Liping Zhang, Jun Lo, Siuming Song, Weiguo Socioecon Plann Sci Article AIMS: We aimed at giving a preliminary analysis of the weakness of a current test strategy, and proposing a data-driven strategy that was self-adaptive to the dynamic change of pandemic. The effect of driven-data selection over time and space was also within the deep concern. METHODS: A mathematical definition of the test strategy were given. With the real COVID-19 test data from March to July collected in Lahore, a significance analysis of the possible features was conducted. A machine learning method based on logistic regression and priority ranking were proposed for the data-driven test strategy. With performance assessed by the area under the receiver operating characteristic curve (AUC), time series analysis and spatial cross-test were conducted. RESULTS: The transition of risk factors accounted for the failure of the current test strategy. The proposed data-driven strategy could enhance the positive detection rate from 2.54% to 28.18%, and the recall rate from 8.05% to 89.35% under strictly limited test capacity. Much more optimal utilization of test resources could be realized where 89.35% of total positive cases could be detected with merely 48.17% of the original test amount. The strategy showed self-adaptability with the development of pandemic, while the strategy driven by local data was proved to be optimal. CONCLUSIONS: We recommended a generalization of such a data-driven test strategy for a better response to the global developing pandemic. Besides, the construction of the COVID-19 data system should be more refined on space for local applications. Published by Elsevier Ltd. 2022-03 2021-06-08 /pmc/articles/PMC8184360/ /pubmed/34121777 http://dx.doi.org/10.1016/j.seps.2021.101091 Text en © 2021 Published by Elsevier Ltd. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Huang, Chuanli
Wang, Min
Rafaqat, Warda
Shabbir, Salman
Lian, Liping
Zhang, Jun
Lo, Siuming
Song, Weiguo
Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan
title Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan
title_full Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan
title_fullStr Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan
title_full_unstemmed Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan
title_short Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan
title_sort data-driven test strategy for covid-19 using machine learning: a study in lahore, pakistan
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8184360/
https://www.ncbi.nlm.nih.gov/pubmed/34121777
http://dx.doi.org/10.1016/j.seps.2021.101091
work_keys_str_mv AT huangchuanli datadriventeststrategyforcovid19usingmachinelearningastudyinlahorepakistan
AT wangmin datadriventeststrategyforcovid19usingmachinelearningastudyinlahorepakistan
AT rafaqatwarda datadriventeststrategyforcovid19usingmachinelearningastudyinlahorepakistan
AT shabbirsalman datadriventeststrategyforcovid19usingmachinelearningastudyinlahorepakistan
AT lianliping datadriventeststrategyforcovid19usingmachinelearningastudyinlahorepakistan
AT zhangjun datadriventeststrategyforcovid19usingmachinelearningastudyinlahorepakistan
AT losiuming datadriventeststrategyforcovid19usingmachinelearningastudyinlahorepakistan
AT songweiguo datadriventeststrategyforcovid19usingmachinelearningastudyinlahorepakistan