Cargando…

Outsourcing Medical Data Analyses: Can Technology Overcome Legal, Privacy, and Confidentiality Issues?

BACKGROUND: Medical data are gold mines for deriving the knowledge that could change the course of a single patient’s life or even the health of the entire population. A data analyst needs to have full access to relevant data, but full access may be denied by privacy and confidentiality of medical d...

Descripción completa

Detalles Bibliográficos
Autores principales: Brumen, Bostjan, Heričko, Marjan, Sevčnikar, Andrej, Završnik, Jernej, Hölbl, Marko
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications Inc. 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3877744/
https://www.ncbi.nlm.nih.gov/pubmed/24342053
http://dx.doi.org/10.2196/jmir.2471
_version_ 1782297697664892928
author Brumen, Bostjan
Heričko, Marjan
Sevčnikar, Andrej
Završnik, Jernej
Hölbl, Marko
author_facet Brumen, Bostjan
Heričko, Marjan
Sevčnikar, Andrej
Završnik, Jernej
Hölbl, Marko
author_sort Brumen, Bostjan
collection PubMed
description BACKGROUND: Medical data are gold mines for deriving the knowledge that could change the course of a single patient’s life or even the health of the entire population. A data analyst needs to have full access to relevant data, but full access may be denied by privacy and confidentiality of medical data legal regulations, especially when the data analyst is not affiliated with the data owner. OBJECTIVE: Our first objective was to analyze the privacy and confidentiality issues and the associated regulations pertaining to medical data, and to identify technologies to properly address these issues. Our second objective was to develop a procedure to protect medical data in such a way that the outsourced analyst would be capable of doing analyses on protected data and the results would be comparable, if not the same, as if they had been done on the original data. Specifically, our hypothesis was there would not be a difference between the outsourced decision trees built on encrypted data and the ones built on original data. METHODS: Using formal definitions, we developed an algorithm to protect medical data for outsourced analyses. The algorithm was applied to publicly available datasets (N=30) from the medical and life sciences fields. The analyses were performed on the original and the protected datasets and the results of the analyses were compared. Bootstrapped paired t tests for 2 dependent samples were used to test whether the mean differences in size, number of leaves, and the accuracy of the original and the encrypted decision trees were significantly different. RESULTS: The decision trees built on encrypted data were virtually the same as those built on original data. Out of 30 datasets, 100% of the trees had identical accuracy. The size of a tree and the number of leaves was different only once (1/30, 3%, P=.19). CONCLUSIONS: The proposed algorithm encrypts a file with plain text medical data into an encrypted file with the data protected in such a way that external data analyses are still possible. The results show that the results of analyses on original and on protected data are identical or comparably similar. The approach addresses the privacy and confidentiality issues that arise with medical data and is adherent to strict legal rules in the United States and Europe regarding the processing of the medical data.
format Online
Article
Text
id pubmed-3877744
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher JMIR Publications Inc.
record_format MEDLINE/PubMed
spelling pubmed-38777442014-01-02 Outsourcing Medical Data Analyses: Can Technology Overcome Legal, Privacy, and Confidentiality Issues? Brumen, Bostjan Heričko, Marjan Sevčnikar, Andrej Završnik, Jernej Hölbl, Marko J Med Internet Res Original Paper BACKGROUND: Medical data are gold mines for deriving the knowledge that could change the course of a single patient’s life or even the health of the entire population. A data analyst needs to have full access to relevant data, but full access may be denied by privacy and confidentiality of medical data legal regulations, especially when the data analyst is not affiliated with the data owner. OBJECTIVE: Our first objective was to analyze the privacy and confidentiality issues and the associated regulations pertaining to medical data, and to identify technologies to properly address these issues. Our second objective was to develop a procedure to protect medical data in such a way that the outsourced analyst would be capable of doing analyses on protected data and the results would be comparable, if not the same, as if they had been done on the original data. Specifically, our hypothesis was there would not be a difference between the outsourced decision trees built on encrypted data and the ones built on original data. METHODS: Using formal definitions, we developed an algorithm to protect medical data for outsourced analyses. The algorithm was applied to publicly available datasets (N=30) from the medical and life sciences fields. The analyses were performed on the original and the protected datasets and the results of the analyses were compared. Bootstrapped paired t tests for 2 dependent samples were used to test whether the mean differences in size, number of leaves, and the accuracy of the original and the encrypted decision trees were significantly different. RESULTS: The decision trees built on encrypted data were virtually the same as those built on original data. Out of 30 datasets, 100% of the trees had identical accuracy. The size of a tree and the number of leaves was different only once (1/30, 3%, P=.19). CONCLUSIONS: The proposed algorithm encrypts a file with plain text medical data into an encrypted file with the data protected in such a way that external data analyses are still possible. The results show that the results of analyses on original and on protected data are identical or comparably similar. The approach addresses the privacy and confidentiality issues that arise with medical data and is adherent to strict legal rules in the United States and Europe regarding the processing of the medical data. JMIR Publications Inc. 2013-12-16 /pmc/articles/PMC3877744/ /pubmed/24342053 http://dx.doi.org/10.2196/jmir.2471 Text en ©Bostjan Brumen, Marjan Heričko, Andrej Sevčnikar, Jernej Završnik, Marko Hölbl. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 16.12.2013. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Brumen, Bostjan
Heričko, Marjan
Sevčnikar, Andrej
Završnik, Jernej
Hölbl, Marko
Outsourcing Medical Data Analyses: Can Technology Overcome Legal, Privacy, and Confidentiality Issues?
title Outsourcing Medical Data Analyses: Can Technology Overcome Legal, Privacy, and Confidentiality Issues?
title_full Outsourcing Medical Data Analyses: Can Technology Overcome Legal, Privacy, and Confidentiality Issues?
title_fullStr Outsourcing Medical Data Analyses: Can Technology Overcome Legal, Privacy, and Confidentiality Issues?
title_full_unstemmed Outsourcing Medical Data Analyses: Can Technology Overcome Legal, Privacy, and Confidentiality Issues?
title_short Outsourcing Medical Data Analyses: Can Technology Overcome Legal, Privacy, and Confidentiality Issues?
title_sort outsourcing medical data analyses: can technology overcome legal, privacy, and confidentiality issues?
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3877744/
https://www.ncbi.nlm.nih.gov/pubmed/24342053
http://dx.doi.org/10.2196/jmir.2471
work_keys_str_mv AT brumenbostjan outsourcingmedicaldataanalysescantechnologyovercomelegalprivacyandconfidentialityissues
AT herickomarjan outsourcingmedicaldataanalysescantechnologyovercomelegalprivacyandconfidentialityissues
AT sevcnikarandrej outsourcingmedicaldataanalysescantechnologyovercomelegalprivacyandconfidentialityissues
AT zavrsnikjernej outsourcingmedicaldataanalysescantechnologyovercomelegalprivacyandconfidentialityissues
AT holblmarko outsourcingmedicaldataanalysescantechnologyovercomelegalprivacyandconfidentialityissues