Cargando…
Data quality model for assessing public COVID-19 big datasets
For decision-making support and evidence based on healthcare, high quality data are crucial, particularly if the emphasized knowledge is lacking. For public health practitioners and researchers, the reporting of COVID-19 data need to be accurate and easily available. Each nation has a system in plac...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10230148/ https://www.ncbi.nlm.nih.gov/pubmed/37359333 http://dx.doi.org/10.1007/s11227-023-05410-0 |
_version_ | 1785051460005789696 |
---|---|
author | Ngueilbaye, Alladoumbaye Huang, Joshua Zhexue Khan, Mehak Wang, Hongzhi |
author_facet | Ngueilbaye, Alladoumbaye Huang, Joshua Zhexue Khan, Mehak Wang, Hongzhi |
author_sort | Ngueilbaye, Alladoumbaye |
collection | PubMed |
description | For decision-making support and evidence based on healthcare, high quality data are crucial, particularly if the emphasized knowledge is lacking. For public health practitioners and researchers, the reporting of COVID-19 data need to be accurate and easily available. Each nation has a system in place for reporting COVID-19 data, albeit these systems’ efficacy has not been thoroughly evaluated. However, the current COVID-19 pandemic has shown widespread flaws in data quality. We propose a data quality model (canonical data model, four adequacy levels, and Benford’s law) to assess the quality issue of COVID-19 data reporting carried out by the World Health Organization (WHO) in the six Central African Economic and Monitory Community (CEMAC) region countries between March 6,2020, and June 22, 2022, and suggest potential solutions. These levels of data quality sufficiency can be interpreted as dependability indicators and sufficiency of Big Dataset inspection. This model effectively identified the quality of the entry data for big dataset analytics. The future development of this model requires scholars and institutions from all sectors to deepen their understanding of its core concepts, improve integration with other data processing technologies, and broaden the scope of its applications. |
format | Online Article Text |
id | pubmed-10230148 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-102301482023-06-01 Data quality model for assessing public COVID-19 big datasets Ngueilbaye, Alladoumbaye Huang, Joshua Zhexue Khan, Mehak Wang, Hongzhi J Supercomput Article For decision-making support and evidence based on healthcare, high quality data are crucial, particularly if the emphasized knowledge is lacking. For public health practitioners and researchers, the reporting of COVID-19 data need to be accurate and easily available. Each nation has a system in place for reporting COVID-19 data, albeit these systems’ efficacy has not been thoroughly evaluated. However, the current COVID-19 pandemic has shown widespread flaws in data quality. We propose a data quality model (canonical data model, four adequacy levels, and Benford’s law) to assess the quality issue of COVID-19 data reporting carried out by the World Health Organization (WHO) in the six Central African Economic and Monitory Community (CEMAC) region countries between March 6,2020, and June 22, 2022, and suggest potential solutions. These levels of data quality sufficiency can be interpreted as dependability indicators and sufficiency of Big Dataset inspection. This model effectively identified the quality of the entry data for big dataset analytics. The future development of this model requires scholars and institutions from all sectors to deepen their understanding of its core concepts, improve integration with other data processing technologies, and broaden the scope of its applications. Springer US 2023-05-31 /pmc/articles/PMC10230148/ /pubmed/37359333 http://dx.doi.org/10.1007/s11227-023-05410-0 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Ngueilbaye, Alladoumbaye Huang, Joshua Zhexue Khan, Mehak Wang, Hongzhi Data quality model for assessing public COVID-19 big datasets |
title | Data quality model for assessing public COVID-19 big datasets |
title_full | Data quality model for assessing public COVID-19 big datasets |
title_fullStr | Data quality model for assessing public COVID-19 big datasets |
title_full_unstemmed | Data quality model for assessing public COVID-19 big datasets |
title_short | Data quality model for assessing public COVID-19 big datasets |
title_sort | data quality model for assessing public covid-19 big datasets |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10230148/ https://www.ncbi.nlm.nih.gov/pubmed/37359333 http://dx.doi.org/10.1007/s11227-023-05410-0 |
work_keys_str_mv | AT ngueilbayealladoumbaye dataqualitymodelforassessingpubliccovid19bigdatasets AT huangjoshuazhexue dataqualitymodelforassessingpubliccovid19bigdatasets AT khanmehak dataqualitymodelforassessingpubliccovid19bigdatasets AT wanghongzhi dataqualitymodelforassessingpubliccovid19bigdatasets |