Cargando…

Healthcare data quality assessment for improving the quality of the Korea Biobank Network

Numerous studies make extensive use of healthcare data, including human materials and clinical information, and acknowledge its significance. However, limitations in data collection methods can impact the quality of healthcare data obtained from multiple institutions. In order to secure high-quality...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Ki-Hoon, Oh, Seol Whan, Ko, Soo Jeong, Lee, Kang Hyuck, Choi, Wona, Choi, In Young
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10659164/
https://www.ncbi.nlm.nih.gov/pubmed/37983215
http://dx.doi.org/10.1371/journal.pone.0294554
_version_ 1785148287324520448
author Kim, Ki-Hoon
Oh, Seol Whan
Ko, Soo Jeong
Lee, Kang Hyuck
Choi, Wona
Choi, In Young
author_facet Kim, Ki-Hoon
Oh, Seol Whan
Ko, Soo Jeong
Lee, Kang Hyuck
Choi, Wona
Choi, In Young
author_sort Kim, Ki-Hoon
collection PubMed
description Numerous studies make extensive use of healthcare data, including human materials and clinical information, and acknowledge its significance. However, limitations in data collection methods can impact the quality of healthcare data obtained from multiple institutions. In order to secure high-quality data related to human materials, research focused on data quality is necessary. This study validated the quality of data collected in 2020 from 16 institutions constituting the Korea Biobank Network using 104 validation rules. The validation rules were developed based on the DQ4HEALTH model and were divided into four dimensions: completeness, validity, accuracy, and uniqueness. Korea Biobank Network collects and manages human materials and clinical information from multiple biobanks, and is in the process of developing a common data model for data integration. The results of the data quality verification revealed an error rate of 0.74%. Furthermore, an analysis of the data from each institution was performed to examine the relationship between the institution’s characteristics and error count. The results from a chi-square test indicated that there was an independent correlation between each institution and its error count. To confirm this correlation between error counts and the characteristics of each institution, a correlation analysis was conducted. The results, shown in a graph, revealed the relationship between factors that had high correlation coefficients and the error count. The findings suggest that the data quality was impacted by biases in the evaluation system, including the institution’s IT environment, infrastructure, and the number of collected samples. These results highlight the need to consider the scalability of research quality when evaluating clinical epidemiological information linked to human materials in future validation studies of data quality.
format Online
Article
Text
id pubmed-10659164
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-106591642023-11-20 Healthcare data quality assessment for improving the quality of the Korea Biobank Network Kim, Ki-Hoon Oh, Seol Whan Ko, Soo Jeong Lee, Kang Hyuck Choi, Wona Choi, In Young PLoS One Research Article Numerous studies make extensive use of healthcare data, including human materials and clinical information, and acknowledge its significance. However, limitations in data collection methods can impact the quality of healthcare data obtained from multiple institutions. In order to secure high-quality data related to human materials, research focused on data quality is necessary. This study validated the quality of data collected in 2020 from 16 institutions constituting the Korea Biobank Network using 104 validation rules. The validation rules were developed based on the DQ4HEALTH model and were divided into four dimensions: completeness, validity, accuracy, and uniqueness. Korea Biobank Network collects and manages human materials and clinical information from multiple biobanks, and is in the process of developing a common data model for data integration. The results of the data quality verification revealed an error rate of 0.74%. Furthermore, an analysis of the data from each institution was performed to examine the relationship between the institution’s characteristics and error count. The results from a chi-square test indicated that there was an independent correlation between each institution and its error count. To confirm this correlation between error counts and the characteristics of each institution, a correlation analysis was conducted. The results, shown in a graph, revealed the relationship between factors that had high correlation coefficients and the error count. The findings suggest that the data quality was impacted by biases in the evaluation system, including the institution’s IT environment, infrastructure, and the number of collected samples. These results highlight the need to consider the scalability of research quality when evaluating clinical epidemiological information linked to human materials in future validation studies of data quality. Public Library of Science 2023-11-20 /pmc/articles/PMC10659164/ /pubmed/37983215 http://dx.doi.org/10.1371/journal.pone.0294554 Text en © 2023 Kim et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kim, Ki-Hoon
Oh, Seol Whan
Ko, Soo Jeong
Lee, Kang Hyuck
Choi, Wona
Choi, In Young
Healthcare data quality assessment for improving the quality of the Korea Biobank Network
title Healthcare data quality assessment for improving the quality of the Korea Biobank Network
title_full Healthcare data quality assessment for improving the quality of the Korea Biobank Network
title_fullStr Healthcare data quality assessment for improving the quality of the Korea Biobank Network
title_full_unstemmed Healthcare data quality assessment for improving the quality of the Korea Biobank Network
title_short Healthcare data quality assessment for improving the quality of the Korea Biobank Network
title_sort healthcare data quality assessment for improving the quality of the korea biobank network
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10659164/
https://www.ncbi.nlm.nih.gov/pubmed/37983215
http://dx.doi.org/10.1371/journal.pone.0294554
work_keys_str_mv AT kimkihoon healthcaredataqualityassessmentforimprovingthequalityofthekoreabiobanknetwork
AT ohseolwhan healthcaredataqualityassessmentforimprovingthequalityofthekoreabiobanknetwork
AT kosoojeong healthcaredataqualityassessmentforimprovingthequalityofthekoreabiobanknetwork
AT leekanghyuck healthcaredataqualityassessmentforimprovingthequalityofthekoreabiobanknetwork
AT choiwona healthcaredataqualityassessmentforimprovingthequalityofthekoreabiobanknetwork
AT choiinyoung healthcaredataqualityassessmentforimprovingthequalityofthekoreabiobanknetwork