Cargando…

High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study

BACKGROUND: Multicenter medical research is becoming a new trend. However, interagency data privacy security protection has become a major bottleneck of multicenter research. Therefore, to overcome this privacy protection issue, the aim of the present study was to apply a self-developed privacy-pres...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Jie, Zhang, Yu, Yu, Huamin, Lin, Bo, Xiang, Peng, Lin, Te, Lu, Huizhe, Zhang, Guiying, Wang, Dejian, Yuan, Hong, Hu, Bin, Jiang, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: AME Publishing Company 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9577776/
https://www.ncbi.nlm.nih.gov/pubmed/36267731
http://dx.doi.org/10.21037/atm-22-4331
_version_ 1784811834756300800
author Xu, Jie
Zhang, Yu
Yu, Huamin
Lin, Bo
Xiang, Peng
Lin, Te
Lu, Huizhe
Zhang, Guiying
Wang, Dejian
Yuan, Hong
Hu, Bin
Jiang, Jun
author_facet Xu, Jie
Zhang, Yu
Yu, Huamin
Lin, Bo
Xiang, Peng
Lin, Te
Lu, Huizhe
Zhang, Guiying
Wang, Dejian
Yuan, Hong
Hu, Bin
Jiang, Jun
author_sort Xu, Jie
collection PubMed
description BACKGROUND: Multicenter medical research is becoming a new trend. However, interagency data privacy security protection has become a major bottleneck of multicenter research. Therefore, to overcome this privacy protection issue, the aim of the present study was to apply a self-developed privacy-preserving machine learning framework for researchers who can build models on medical data from multiple sources, while providing privacy protection for both sensitive data and the learned model. METHODS: Based on Arya, a novel privacy computing platform developed by Healink, we constructed a privacy-preserving federated learning (FL) model using the fully connected neural network with datasets from 2–3 individual medical institutions. In the dataset, 80% of records were used for joint modeling on acute myocardial infarction (AMI) diagnosis. Modeling efficacy was evaluated with the remaining 20% of records. As the control, 1,500 medical records from 1 medical institution were used for single-center modeling and efficacy evaluation. During the process, the original data were still kept in individual hospital without moving or transferring out of the hospitial. The diagnostic efficacy (sensitivity, positive predictive value, and accuracy) was evaluated. RESULTS: Our privacy-preserving FL model gives reliable AMI diagnostic efficacy. Three-center modeling (79% sensitivity, 88% positive predictive value, and 82.3% accuracy) and two-center modeling (77.8% or 77.6% sensitivity, 86.7% or 85.30% positive predictive value, and 81% or 79.7% accuracy) achieved relative high diagnostic efficacy; and single-center modeling achieved relative low diagnostic efficacy (76% sensitivity, 84.7% positive predictive value, and 79% accuracy). CONCLUSIONS: The Arya privacy computing platform is efficient and practical for the FL model, which could promote multicenter medical research securely without sacrificing diagnostic efficacy.
format Online
Article
Text
id pubmed-9577776
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher AME Publishing Company
record_format MEDLINE/PubMed
spelling pubmed-95777762022-10-19 High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study Xu, Jie Zhang, Yu Yu, Huamin Lin, Bo Xiang, Peng Lin, Te Lu, Huizhe Zhang, Guiying Wang, Dejian Yuan, Hong Hu, Bin Jiang, Jun Ann Transl Med Original Article BACKGROUND: Multicenter medical research is becoming a new trend. However, interagency data privacy security protection has become a major bottleneck of multicenter research. Therefore, to overcome this privacy protection issue, the aim of the present study was to apply a self-developed privacy-preserving machine learning framework for researchers who can build models on medical data from multiple sources, while providing privacy protection for both sensitive data and the learned model. METHODS: Based on Arya, a novel privacy computing platform developed by Healink, we constructed a privacy-preserving federated learning (FL) model using the fully connected neural network with datasets from 2–3 individual medical institutions. In the dataset, 80% of records were used for joint modeling on acute myocardial infarction (AMI) diagnosis. Modeling efficacy was evaluated with the remaining 20% of records. As the control, 1,500 medical records from 1 medical institution were used for single-center modeling and efficacy evaluation. During the process, the original data were still kept in individual hospital without moving or transferring out of the hospitial. The diagnostic efficacy (sensitivity, positive predictive value, and accuracy) was evaluated. RESULTS: Our privacy-preserving FL model gives reliable AMI diagnostic efficacy. Three-center modeling (79% sensitivity, 88% positive predictive value, and 82.3% accuracy) and two-center modeling (77.8% or 77.6% sensitivity, 86.7% or 85.30% positive predictive value, and 81% or 79.7% accuracy) achieved relative high diagnostic efficacy; and single-center modeling achieved relative low diagnostic efficacy (76% sensitivity, 84.7% positive predictive value, and 79% accuracy). CONCLUSIONS: The Arya privacy computing platform is efficient and practical for the FL model, which could promote multicenter medical research securely without sacrificing diagnostic efficacy. AME Publishing Company 2022-09 /pmc/articles/PMC9577776/ /pubmed/36267731 http://dx.doi.org/10.21037/atm-22-4331 Text en 2022 Annals of Translational Medicine. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Original Article
Xu, Jie
Zhang, Yu
Yu, Huamin
Lin, Bo
Xiang, Peng
Lin, Te
Lu, Huizhe
Zhang, Guiying
Wang, Dejian
Yuan, Hong
Hu, Bin
Jiang, Jun
High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study
title High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study
title_full High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study
title_fullStr High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study
title_full_unstemmed High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study
title_short High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study
title_sort high performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9577776/
https://www.ncbi.nlm.nih.gov/pubmed/36267731
http://dx.doi.org/10.21037/atm-22-4331
work_keys_str_mv AT xujie highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy
AT zhangyu highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy
AT yuhuamin highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy
AT linbo highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy
AT xiangpeng highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy
AT linte highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy
AT luhuizhe highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy
AT zhangguiying highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy
AT wangdejian highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy
AT yuanhong highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy
AT hubin highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy
AT jiangjun highperformanceofprivacypreservingacutemyocardialinfarctionauxiliarydiagnosisbasedonfederatedlearningamulticenterretrospectivestudy