Cargando…

Data quality model for assessing public COVID-19 big datasets

For decision-making support and evidence based on healthcare, high quality data are crucial, particularly if the emphasized knowledge is lacking. For public health practitioners and researchers, the reporting of COVID-19 data need to be accurate and easily available. Each nation has a system in plac...

Descripción completa

Detalles Bibliográficos
Autores principales: Ngueilbaye, Alladoumbaye, Huang, Joshua Zhexue, Khan, Mehak, Wang, Hongzhi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10230148/
https://www.ncbi.nlm.nih.gov/pubmed/37359333
http://dx.doi.org/10.1007/s11227-023-05410-0
_version_ 1785051460005789696
author Ngueilbaye, Alladoumbaye
Huang, Joshua Zhexue
Khan, Mehak
Wang, Hongzhi
author_facet Ngueilbaye, Alladoumbaye
Huang, Joshua Zhexue
Khan, Mehak
Wang, Hongzhi
author_sort Ngueilbaye, Alladoumbaye
collection PubMed
description For decision-making support and evidence based on healthcare, high quality data are crucial, particularly if the emphasized knowledge is lacking. For public health practitioners and researchers, the reporting of COVID-19 data need to be accurate and easily available. Each nation has a system in place for reporting COVID-19 data, albeit these systems’ efficacy has not been thoroughly evaluated. However, the current COVID-19 pandemic has shown widespread flaws in data quality. We propose a data quality model (canonical data model, four adequacy levels, and Benford’s law) to assess the quality issue of COVID-19 data reporting carried out by the World Health Organization (WHO) in the six Central African Economic and Monitory Community (CEMAC) region countries between March 6,2020, and June 22, 2022, and suggest potential solutions. These levels of data quality sufficiency can be interpreted as dependability indicators and sufficiency of Big Dataset inspection. This model effectively identified the quality of the entry data for big dataset analytics. The future development of this model requires scholars and institutions from all sectors to deepen their understanding of its core concepts, improve integration with other data processing technologies, and broaden the scope of its applications.
format Online
Article
Text
id pubmed-10230148
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-102301482023-06-01 Data quality model for assessing public COVID-19 big datasets Ngueilbaye, Alladoumbaye Huang, Joshua Zhexue Khan, Mehak Wang, Hongzhi J Supercomput Article For decision-making support and evidence based on healthcare, high quality data are crucial, particularly if the emphasized knowledge is lacking. For public health practitioners and researchers, the reporting of COVID-19 data need to be accurate and easily available. Each nation has a system in place for reporting COVID-19 data, albeit these systems’ efficacy has not been thoroughly evaluated. However, the current COVID-19 pandemic has shown widespread flaws in data quality. We propose a data quality model (canonical data model, four adequacy levels, and Benford’s law) to assess the quality issue of COVID-19 data reporting carried out by the World Health Organization (WHO) in the six Central African Economic and Monitory Community (CEMAC) region countries between March 6,2020, and June 22, 2022, and suggest potential solutions. These levels of data quality sufficiency can be interpreted as dependability indicators and sufficiency of Big Dataset inspection. This model effectively identified the quality of the entry data for big dataset analytics. The future development of this model requires scholars and institutions from all sectors to deepen their understanding of its core concepts, improve integration with other data processing technologies, and broaden the scope of its applications. Springer US 2023-05-31 /pmc/articles/PMC10230148/ /pubmed/37359333 http://dx.doi.org/10.1007/s11227-023-05410-0 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Ngueilbaye, Alladoumbaye
Huang, Joshua Zhexue
Khan, Mehak
Wang, Hongzhi
Data quality model for assessing public COVID-19 big datasets
title Data quality model for assessing public COVID-19 big datasets
title_full Data quality model for assessing public COVID-19 big datasets
title_fullStr Data quality model for assessing public COVID-19 big datasets
title_full_unstemmed Data quality model for assessing public COVID-19 big datasets
title_short Data quality model for assessing public COVID-19 big datasets
title_sort data quality model for assessing public covid-19 big datasets
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10230148/
https://www.ncbi.nlm.nih.gov/pubmed/37359333
http://dx.doi.org/10.1007/s11227-023-05410-0
work_keys_str_mv AT ngueilbayealladoumbaye dataqualitymodelforassessingpubliccovid19bigdatasets
AT huangjoshuazhexue dataqualitymodelforassessingpubliccovid19bigdatasets
AT khanmehak dataqualitymodelforassessingpubliccovid19bigdatasets
AT wanghongzhi dataqualitymodelforassessingpubliccovid19bigdatasets