Cargando…
High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study
BACKGROUND: Multicenter medical research is becoming a new trend. However, interagency data privacy security protection has become a major bottleneck of multicenter research. Therefore, to overcome this privacy protection issue, the aim of the present study was to apply a self-developed privacy-pres...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
AME Publishing Company
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9577776/ https://www.ncbi.nlm.nih.gov/pubmed/36267731 http://dx.doi.org/10.21037/atm-22-4331 |
_version_ | 1784811834756300800 |
---|---|
author | Xu, Jie Zhang, Yu Yu, Huamin Lin, Bo Xiang, Peng Lin, Te Lu, Huizhe Zhang, Guiying Wang, Dejian Yuan, Hong Hu, Bin Jiang, Jun |
author_facet | Xu, Jie Zhang, Yu Yu, Huamin Lin, Bo Xiang, Peng Lin, Te Lu, Huizhe Zhang, Guiying Wang, Dejian Yuan, Hong Hu, Bin Jiang, Jun |
author_sort | Xu, Jie |
collection | PubMed |
description | BACKGROUND: Multicenter medical research is becoming a new trend. However, interagency data privacy security protection has become a major bottleneck of multicenter research. Therefore, to overcome this privacy protection issue, the aim of the present study was to apply a self-developed privacy-preserving machine learning framework for researchers who can build models on medical data from multiple sources, while providing privacy protection for both sensitive data and the learned model. METHODS: Based on Arya, a novel privacy computing platform developed by Healink, we constructed a privacy-preserving federated learning (FL) model using the fully connected neural network with datasets from 2–3 individual medical institutions. In the dataset, 80% of records were used for joint modeling on acute myocardial infarction (AMI) diagnosis. Modeling efficacy was evaluated with the remaining 20% of records. As the control, 1,500 medical records from 1 medical institution were used for single-center modeling and efficacy evaluation. During the process, the original data were still kept in individual hospital without moving or transferring out of the hospitial. The diagnostic efficacy (sensitivity, positive predictive value, and accuracy) was evaluated. RESULTS: Our privacy-preserving FL model gives reliable AMI diagnostic efficacy. Three-center modeling (79% sensitivity, 88% positive predictive value, and 82.3% accuracy) and two-center modeling (77.8% or 77.6% sensitivity, 86.7% or 85.30% positive predictive value, and 81% or 79.7% accuracy) achieved relative high diagnostic efficacy; and single-center modeling achieved relative low diagnostic efficacy (76% sensitivity, 84.7% positive predictive value, and 79% accuracy). CONCLUSIONS: The Arya privacy computing platform is efficient and practical for the FL model, which could promote multicenter medical research securely without sacrificing diagnostic efficacy. |
format | Online Article Text |
id | pubmed-9577776 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | AME Publishing Company |
record_format | MEDLINE/PubMed |
spelling | pubmed-95777762022-10-19 High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study Xu, Jie Zhang, Yu Yu, Huamin Lin, Bo Xiang, Peng Lin, Te Lu, Huizhe Zhang, Guiying Wang, Dejian Yuan, Hong Hu, Bin Jiang, Jun Ann Transl Med Original Article BACKGROUND: Multicenter medical research is becoming a new trend. However, interagency data privacy security protection has become a major bottleneck of multicenter research. Therefore, to overcome this privacy protection issue, the aim of the present study was to apply a self-developed privacy-preserving machine learning framework for researchers who can build models on medical data from multiple sources, while providing privacy protection for both sensitive data and the learned model. METHODS: Based on Arya, a novel privacy computing platform developed by Healink, we constructed a privacy-preserving federated learning (FL) model using the fully connected neural network with datasets from 2–3 individual medical institutions. In the dataset, 80% of records were used for joint modeling on acute myocardial infarction (AMI) diagnosis. Modeling efficacy was evaluated with the remaining 20% of records. As the control, 1,500 medical records from 1 medical institution were used for single-center modeling and efficacy evaluation. During the process, the original data were still kept in individual hospital without moving or transferring out of the hospitial. The diagnostic efficacy (sensitivity, positive predictive value, and accuracy) was evaluated. RESULTS: Our privacy-preserving FL model gives reliable AMI diagnostic efficacy. Three-center modeling (79% sensitivity, 88% positive predictive value, and 82.3% accuracy) and two-center modeling (77.8% or 77.6% sensitivity, 86.7% or 85.30% positive predictive value, and 81% or 79.7% accuracy) achieved relative high diagnostic efficacy; and single-center modeling achieved relative low diagnostic efficacy (76% sensitivity, 84.7% positive predictive value, and 79% accuracy). CONCLUSIONS: The Arya privacy computing platform is efficient and practical for the FL model, which could promote multicenter medical research securely without sacrificing diagnostic efficacy. AME Publishing Company 2022-09 /pmc/articles/PMC9577776/ /pubmed/36267731 http://dx.doi.org/10.21037/atm-22-4331 Text en 2022 Annals of Translational Medicine. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Original Article Xu, Jie Zhang, Yu Yu, Huamin Lin, Bo Xiang, Peng Lin, Te Lu, Huizhe Zhang, Guiying Wang, Dejian Yuan, Hong Hu, Bin Jiang, Jun High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study |
title | High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study |
title_full | High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study |
title_fullStr | High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study |
title_full_unstemmed | High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study |
title_short | High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study |
title_sort | high performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9577776/ https://www.ncbi.nlm.nih.gov/pubmed/36267731 http://dx.doi.org/10.21037/atm-22-4331 |
work_keys_str_mv | AT xujie highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy AT zhangyu highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy AT yuhuamin highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy AT linbo highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy AT xiangpeng highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy AT linte highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy AT luhuizhe highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy AT zhangguiying highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy AT wangdejian highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy AT yuanhong highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy AT hubin highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy AT jiangjun highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy |