Cargando…
Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan
AIMS: We aimed at giving a preliminary analysis of the weakness of a current test strategy, and proposing a data-driven strategy that was self-adaptive to the dynamic change of pandemic. The effect of driven-data selection over time and space was also within the deep concern. METHODS: A mathematical...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Published by Elsevier Ltd.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8184360/ https://www.ncbi.nlm.nih.gov/pubmed/34121777 http://dx.doi.org/10.1016/j.seps.2021.101091 |
_version_ | 1783704578125987840 |
---|---|
author | Huang, Chuanli Wang, Min Rafaqat, Warda Shabbir, Salman Lian, Liping Zhang, Jun Lo, Siuming Song, Weiguo |
author_facet | Huang, Chuanli Wang, Min Rafaqat, Warda Shabbir, Salman Lian, Liping Zhang, Jun Lo, Siuming Song, Weiguo |
author_sort | Huang, Chuanli |
collection | PubMed |
description | AIMS: We aimed at giving a preliminary analysis of the weakness of a current test strategy, and proposing a data-driven strategy that was self-adaptive to the dynamic change of pandemic. The effect of driven-data selection over time and space was also within the deep concern. METHODS: A mathematical definition of the test strategy were given. With the real COVID-19 test data from March to July collected in Lahore, a significance analysis of the possible features was conducted. A machine learning method based on logistic regression and priority ranking were proposed for the data-driven test strategy. With performance assessed by the area under the receiver operating characteristic curve (AUC), time series analysis and spatial cross-test were conducted. RESULTS: The transition of risk factors accounted for the failure of the current test strategy. The proposed data-driven strategy could enhance the positive detection rate from 2.54% to 28.18%, and the recall rate from 8.05% to 89.35% under strictly limited test capacity. Much more optimal utilization of test resources could be realized where 89.35% of total positive cases could be detected with merely 48.17% of the original test amount. The strategy showed self-adaptability with the development of pandemic, while the strategy driven by local data was proved to be optimal. CONCLUSIONS: We recommended a generalization of such a data-driven test strategy for a better response to the global developing pandemic. Besides, the construction of the COVID-19 data system should be more refined on space for local applications. |
format | Online Article Text |
id | pubmed-8184360 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Published by Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-81843602021-06-08 Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan Huang, Chuanli Wang, Min Rafaqat, Warda Shabbir, Salman Lian, Liping Zhang, Jun Lo, Siuming Song, Weiguo Socioecon Plann Sci Article AIMS: We aimed at giving a preliminary analysis of the weakness of a current test strategy, and proposing a data-driven strategy that was self-adaptive to the dynamic change of pandemic. The effect of driven-data selection over time and space was also within the deep concern. METHODS: A mathematical definition of the test strategy were given. With the real COVID-19 test data from March to July collected in Lahore, a significance analysis of the possible features was conducted. A machine learning method based on logistic regression and priority ranking were proposed for the data-driven test strategy. With performance assessed by the area under the receiver operating characteristic curve (AUC), time series analysis and spatial cross-test were conducted. RESULTS: The transition of risk factors accounted for the failure of the current test strategy. The proposed data-driven strategy could enhance the positive detection rate from 2.54% to 28.18%, and the recall rate from 8.05% to 89.35% under strictly limited test capacity. Much more optimal utilization of test resources could be realized where 89.35% of total positive cases could be detected with merely 48.17% of the original test amount. The strategy showed self-adaptability with the development of pandemic, while the strategy driven by local data was proved to be optimal. CONCLUSIONS: We recommended a generalization of such a data-driven test strategy for a better response to the global developing pandemic. Besides, the construction of the COVID-19 data system should be more refined on space for local applications. Published by Elsevier Ltd. 2022-03 2021-06-08 /pmc/articles/PMC8184360/ /pubmed/34121777 http://dx.doi.org/10.1016/j.seps.2021.101091 Text en © 2021 Published by Elsevier Ltd. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Huang, Chuanli Wang, Min Rafaqat, Warda Shabbir, Salman Lian, Liping Zhang, Jun Lo, Siuming Song, Weiguo Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan |
title | Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan |
title_full | Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan |
title_fullStr | Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan |
title_full_unstemmed | Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan |
title_short | Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan |
title_sort | data-driven test strategy for covid-19 using machine learning: a study in lahore, pakistan |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8184360/ https://www.ncbi.nlm.nih.gov/pubmed/34121777 http://dx.doi.org/10.1016/j.seps.2021.101091 |
work_keys_str_mv | AT huangchuanli datadriventeststrategyforcovid19usingmachinelearningastudyinlahorepakistan AT wangmin datadriventeststrategyforcovid19usingmachinelearningastudyinlahorepakistan AT rafaqatwarda datadriventeststrategyforcovid19usingmachinelearningastudyinlahorepakistan AT shabbirsalman datadriventeststrategyforcovid19usingmachinelearningastudyinlahorepakistan AT lianliping datadriventeststrategyforcovid19usingmachinelearningastudyinlahorepakistan AT zhangjun datadriventeststrategyforcovid19usingmachinelearningastudyinlahorepakistan AT losiuming datadriventeststrategyforcovid19usingmachinelearningastudyinlahorepakistan AT songweiguo datadriventeststrategyforcovid19usingmachinelearningastudyinlahorepakistan |