Cargando…

Privacy-Preserving Patient Similarity Learning in a Federated Environment: Development and Analysis

BACKGROUND: There is an urgent need for the development of global analytic frameworks that can perform analyses in a privacy-preserving federated environment across multiple institutions without privacy leakage. A few studies on the topic of federated medical analysis have been conducted recently wi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lee, Junghye, Sun, Jimeng, Wang, Fei, Wang, Shuang, Jun, Chi-Hyuck, Jiang, Xiaoqian
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2018
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5924379/ https://www.ncbi.nlm.nih.gov/pubmed/29653917 http://dx.doi.org/10.2196/medinform.7744

_version_	1783318546076401664
author	Lee, Junghye Sun, Jimeng Wang, Fei Wang, Shuang Jun, Chi-Hyuck Jiang, Xiaoqian
author_facet	Lee, Junghye Sun, Jimeng Wang, Fei Wang, Shuang Jun, Chi-Hyuck Jiang, Xiaoqian
author_sort	Lee, Junghye
collection	PubMed
description	BACKGROUND: There is an urgent need for the development of global analytic frameworks that can perform analyses in a privacy-preserving federated environment across multiple institutions without privacy leakage. A few studies on the topic of federated medical analysis have been conducted recently with the focus on several algorithms. However, none of them have solved similar patient matching, which is useful for applications such as cohort construction for cross-institution observational studies, disease surveillance, and clinical trials recruitment. OBJECTIVE: The aim of this study was to present a privacy-preserving platform in a federated setting for patient similarity learning across institutions. Without sharing patient-level information, our model can find similar patients from one hospital to another. METHODS: We proposed a federated patient hashing framework and developed a novel algorithm to learn context-specific hash codes to represent patients across institutions. The similarities between patients can be efficiently computed using the resulting hash codes of corresponding patients. To avoid security attack from reverse engineering on the model, we applied homomorphic encryption to patient similarity search in a federated setting. RESULTS: We used sequential medical events extracted from the Multiparameter Intelligent Monitoring in Intensive Care-III database to evaluate the proposed algorithm in predicting the incidence of five diseases independently. Our algorithm achieved averaged area under the curves of 0.9154 and 0.8012 with balanced and imbalanced data, respectively, in κ-nearest neighbor with κ=3. We also confirmed privacy preservation in similarity search by using homomorphic encryption. CONCLUSIONS: The proposed algorithm can help search similar patients across institutions effectively to support federated data analysis in a privacy-preserving manner.
format	Online Article Text
id	pubmed-5924379
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-59243792018-05-03 Privacy-Preserving Patient Similarity Learning in a Federated Environment: Development and Analysis Lee, Junghye Sun, Jimeng Wang, Fei Wang, Shuang Jun, Chi-Hyuck Jiang, Xiaoqian JMIR Med Inform Original Paper BACKGROUND: There is an urgent need for the development of global analytic frameworks that can perform analyses in a privacy-preserving federated environment across multiple institutions without privacy leakage. A few studies on the topic of federated medical analysis have been conducted recently with the focus on several algorithms. However, none of them have solved similar patient matching, which is useful for applications such as cohort construction for cross-institution observational studies, disease surveillance, and clinical trials recruitment. OBJECTIVE: The aim of this study was to present a privacy-preserving platform in a federated setting for patient similarity learning across institutions. Without sharing patient-level information, our model can find similar patients from one hospital to another. METHODS: We proposed a federated patient hashing framework and developed a novel algorithm to learn context-specific hash codes to represent patients across institutions. The similarities between patients can be efficiently computed using the resulting hash codes of corresponding patients. To avoid security attack from reverse engineering on the model, we applied homomorphic encryption to patient similarity search in a federated setting. RESULTS: We used sequential medical events extracted from the Multiparameter Intelligent Monitoring in Intensive Care-III database to evaluate the proposed algorithm in predicting the incidence of five diseases independently. Our algorithm achieved averaged area under the curves of 0.9154 and 0.8012 with balanced and imbalanced data, respectively, in κ-nearest neighbor with κ=3. We also confirmed privacy preservation in similarity search by using homomorphic encryption. CONCLUSIONS: The proposed algorithm can help search similar patients across institutions effectively to support federated data analysis in a privacy-preserving manner. JMIR Publications 2018-04-13 /pmc/articles/PMC5924379/ /pubmed/29653917 http://dx.doi.org/10.2196/medinform.7744 Text en ©Junghye Lee, Jimeng Sun, Fei Wang, Shuang Wang, Chi-Hyuck Jun, Xiaoqian Jiang. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 13.04.2018. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Lee, Junghye Sun, Jimeng Wang, Fei Wang, Shuang Jun, Chi-Hyuck Jiang, Xiaoqian Privacy-Preserving Patient Similarity Learning in a Federated Environment: Development and Analysis
title	Privacy-Preserving Patient Similarity Learning in a Federated Environment: Development and Analysis
title_full	Privacy-Preserving Patient Similarity Learning in a Federated Environment: Development and Analysis
title_fullStr	Privacy-Preserving Patient Similarity Learning in a Federated Environment: Development and Analysis
title_full_unstemmed	Privacy-Preserving Patient Similarity Learning in a Federated Environment: Development and Analysis
title_short	Privacy-Preserving Patient Similarity Learning in a Federated Environment: Development and Analysis
title_sort	privacy-preserving patient similarity learning in a federated environment: development and analysis
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5924379/ https://www.ncbi.nlm.nih.gov/pubmed/29653917 http://dx.doi.org/10.2196/medinform.7744
work_keys_str_mv	AT leejunghye privacypreservingpatientsimilaritylearninginafederatedenvironmentdevelopmentandanalysis AT sunjimeng privacypreservingpatientsimilaritylearninginafederatedenvironmentdevelopmentandanalysis AT wangfei privacypreservingpatientsimilaritylearninginafederatedenvironmentdevelopmentandanalysis AT wangshuang privacypreservingpatientsimilaritylearninginafederatedenvironmentdevelopmentandanalysis AT junchihyuck privacypreservingpatientsimilaritylearninginafederatedenvironmentdevelopmentandanalysis AT jiangxiaoqian privacypreservingpatientsimilaritylearninginafederatedenvironmentdevelopmentandanalysis

Privacy-Preserving Patient Similarity Learning in a Federated Environment: Development and Analysis

Ejemplares similares