Cargando…

Data Pseudonymization in a Range That Does Not Affect Data Quality: Correlation with the Degree of Participation of Clinicians

Personal medical information is an essential resource for research; however, there are laws that regulate its use, and it typically has to be pseudonymized or anonymized. When data are anonymized, the quantity and quality of extractable information decrease significantly. From the perspective of a c...

Descripción completa

Detalles Bibliográficos
Autores principales: Shin, Soo-Yong, Kim, Hun-Sung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Korean Academy of Medical Sciences 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8593412/
https://www.ncbi.nlm.nih.gov/pubmed/34783216
http://dx.doi.org/10.3346/jkms.2021.36.e299
_version_ 1784599734127689728
author Shin, Soo-Yong
Kim, Hun-Sung
author_facet Shin, Soo-Yong
Kim, Hun-Sung
author_sort Shin, Soo-Yong
collection PubMed
description Personal medical information is an essential resource for research; however, there are laws that regulate its use, and it typically has to be pseudonymized or anonymized. When data are anonymized, the quantity and quality of extractable information decrease significantly. From the perspective of a clinical researcher, a method of achieving pseudonymized data without degrading data quality while also preventing data loss is proposed herein. As the level of pseudonymization varies according to the research purpose, the pseudonymization method applied should be carefully chosen. Therefore, the active participation of clinicians is crucial to transform the data according to the research purpose. This can contribute to data security by simply transforming the data through secondary data processing. Case studies demonstrated that, compared with the initial baseline data, there was a clinically significant difference in the number of datapoints added with the participation of a clinician (from 267,979 to 280,127 points, P < 0.001). Thus, depending on the degree of clinician participation, data anonymization may not affect data quality and quantity, and proper data quality management along with data security are emphasized. Although the pseudonymization level and clinical use of data have a trade-off relationship, it is possible to create pseudonymized data while maintaining the data quality required for a given research purpose. Therefore, rather than relying solely on security guidelines, the active participation of clinicians is important.
format Online
Article
Text
id pubmed-8593412
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Korean Academy of Medical Sciences
record_format MEDLINE/PubMed
spelling pubmed-85934122021-11-17 Data Pseudonymization in a Range That Does Not Affect Data Quality: Correlation with the Degree of Participation of Clinicians Shin, Soo-Yong Kim, Hun-Sung J Korean Med Sci Review Article Personal medical information is an essential resource for research; however, there are laws that regulate its use, and it typically has to be pseudonymized or anonymized. When data are anonymized, the quantity and quality of extractable information decrease significantly. From the perspective of a clinical researcher, a method of achieving pseudonymized data without degrading data quality while also preventing data loss is proposed herein. As the level of pseudonymization varies according to the research purpose, the pseudonymization method applied should be carefully chosen. Therefore, the active participation of clinicians is crucial to transform the data according to the research purpose. This can contribute to data security by simply transforming the data through secondary data processing. Case studies demonstrated that, compared with the initial baseline data, there was a clinically significant difference in the number of datapoints added with the participation of a clinician (from 267,979 to 280,127 points, P < 0.001). Thus, depending on the degree of clinician participation, data anonymization may not affect data quality and quantity, and proper data quality management along with data security are emphasized. Although the pseudonymization level and clinical use of data have a trade-off relationship, it is possible to create pseudonymized data while maintaining the data quality required for a given research purpose. Therefore, rather than relying solely on security guidelines, the active participation of clinicians is important. The Korean Academy of Medical Sciences 2021-10-26 /pmc/articles/PMC8593412/ /pubmed/34783216 http://dx.doi.org/10.3346/jkms.2021.36.e299 Text en © 2021 The Korean Academy of Medical Sciences. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Review Article
Shin, Soo-Yong
Kim, Hun-Sung
Data Pseudonymization in a Range That Does Not Affect Data Quality: Correlation with the Degree of Participation of Clinicians
title Data Pseudonymization in a Range That Does Not Affect Data Quality: Correlation with the Degree of Participation of Clinicians
title_full Data Pseudonymization in a Range That Does Not Affect Data Quality: Correlation with the Degree of Participation of Clinicians
title_fullStr Data Pseudonymization in a Range That Does Not Affect Data Quality: Correlation with the Degree of Participation of Clinicians
title_full_unstemmed Data Pseudonymization in a Range That Does Not Affect Data Quality: Correlation with the Degree of Participation of Clinicians
title_short Data Pseudonymization in a Range That Does Not Affect Data Quality: Correlation with the Degree of Participation of Clinicians
title_sort data pseudonymization in a range that does not affect data quality: correlation with the degree of participation of clinicians
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8593412/
https://www.ncbi.nlm.nih.gov/pubmed/34783216
http://dx.doi.org/10.3346/jkms.2021.36.e299
work_keys_str_mv AT shinsooyong datapseudonymizationinarangethatdoesnotaffectdataqualitycorrelationwiththedegreeofparticipationofclinicians
AT kimhunsung datapseudonymizationinarangethatdoesnotaffectdataqualitycorrelationwiththedegreeofparticipationofclinicians