Cargando…

Feasibility study of multi-site split learning for privacy-preserving medical systems under data imbalance constraints in COVID-19, X-ray, and cholesterol dataset

It seems as though progressively more people are in the race to upload content, data, and information online; and hospitals haven’t neglected this trend either. Hospitals are now at the forefront for multi-site medical data sharing to provide ground-breaking advancements in the way health records ar...

Descripción completa

Detalles Bibliográficos
Autores principales: Ha, Yoo Jeong, Lee, Gusang, Yoo, Minjae, Jung, Soyi, Yoo, Seehwan, Kim, Joongheon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8795162/
https://www.ncbi.nlm.nih.gov/pubmed/35087165
http://dx.doi.org/10.1038/s41598-022-05615-y
_version_ 1784640990199414784
author Ha, Yoo Jeong
Lee, Gusang
Yoo, Minjae
Jung, Soyi
Yoo, Seehwan
Kim, Joongheon
author_facet Ha, Yoo Jeong
Lee, Gusang
Yoo, Minjae
Jung, Soyi
Yoo, Seehwan
Kim, Joongheon
author_sort Ha, Yoo Jeong
collection PubMed
description It seems as though progressively more people are in the race to upload content, data, and information online; and hospitals haven’t neglected this trend either. Hospitals are now at the forefront for multi-site medical data sharing to provide ground-breaking advancements in the way health records are shared and patients are diagnosed. Sharing of medical data is essential in modern medical research. Yet, as with all data sharing technology, the challenge is to balance improved treatment with protecting patient’s personal information. This paper provides a novel split learning algorithm coined the term, “multi-site split learning”, which enables a secure transfer of medical data between multiple hospitals without fear of exposing personal data contained in patient records. It also explores the effects of varying the number of end-systems and the ratio of data-imbalance on the deep learning performance. A guideline for the most optimal configuration of split learning that ensures privacy of patient data whilst achieving performance is empirically given. We argue the benefits of our multi-site split learning algorithm, especially regarding the privacy preserving factor, using CT scans of COVID-19 patients, X-ray bone scans, and cholesterol level medical data.
format Online
Article
Text
id pubmed-8795162
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-87951622022-01-28 Feasibility study of multi-site split learning for privacy-preserving medical systems under data imbalance constraints in COVID-19, X-ray, and cholesterol dataset Ha, Yoo Jeong Lee, Gusang Yoo, Minjae Jung, Soyi Yoo, Seehwan Kim, Joongheon Sci Rep Article It seems as though progressively more people are in the race to upload content, data, and information online; and hospitals haven’t neglected this trend either. Hospitals are now at the forefront for multi-site medical data sharing to provide ground-breaking advancements in the way health records are shared and patients are diagnosed. Sharing of medical data is essential in modern medical research. Yet, as with all data sharing technology, the challenge is to balance improved treatment with protecting patient’s personal information. This paper provides a novel split learning algorithm coined the term, “multi-site split learning”, which enables a secure transfer of medical data between multiple hospitals without fear of exposing personal data contained in patient records. It also explores the effects of varying the number of end-systems and the ratio of data-imbalance on the deep learning performance. A guideline for the most optimal configuration of split learning that ensures privacy of patient data whilst achieving performance is empirically given. We argue the benefits of our multi-site split learning algorithm, especially regarding the privacy preserving factor, using CT scans of COVID-19 patients, X-ray bone scans, and cholesterol level medical data. Nature Publishing Group UK 2022-01-27 /pmc/articles/PMC8795162/ /pubmed/35087165 http://dx.doi.org/10.1038/s41598-022-05615-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Ha, Yoo Jeong
Lee, Gusang
Yoo, Minjae
Jung, Soyi
Yoo, Seehwan
Kim, Joongheon
Feasibility study of multi-site split learning for privacy-preserving medical systems under data imbalance constraints in COVID-19, X-ray, and cholesterol dataset
title Feasibility study of multi-site split learning for privacy-preserving medical systems under data imbalance constraints in COVID-19, X-ray, and cholesterol dataset
title_full Feasibility study of multi-site split learning for privacy-preserving medical systems under data imbalance constraints in COVID-19, X-ray, and cholesterol dataset
title_fullStr Feasibility study of multi-site split learning for privacy-preserving medical systems under data imbalance constraints in COVID-19, X-ray, and cholesterol dataset
title_full_unstemmed Feasibility study of multi-site split learning for privacy-preserving medical systems under data imbalance constraints in COVID-19, X-ray, and cholesterol dataset
title_short Feasibility study of multi-site split learning for privacy-preserving medical systems under data imbalance constraints in COVID-19, X-ray, and cholesterol dataset
title_sort feasibility study of multi-site split learning for privacy-preserving medical systems under data imbalance constraints in covid-19, x-ray, and cholesterol dataset
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8795162/
https://www.ncbi.nlm.nih.gov/pubmed/35087165
http://dx.doi.org/10.1038/s41598-022-05615-y
work_keys_str_mv AT hayoojeong feasibilitystudyofmultisitesplitlearningforprivacypreservingmedicalsystemsunderdataimbalanceconstraintsincovid19xrayandcholesteroldataset
AT leegusang feasibilitystudyofmultisitesplitlearningforprivacypreservingmedicalsystemsunderdataimbalanceconstraintsincovid19xrayandcholesteroldataset
AT yoominjae feasibilitystudyofmultisitesplitlearningforprivacypreservingmedicalsystemsunderdataimbalanceconstraintsincovid19xrayandcholesteroldataset
AT jungsoyi feasibilitystudyofmultisitesplitlearningforprivacypreservingmedicalsystemsunderdataimbalanceconstraintsincovid19xrayandcholesteroldataset
AT yooseehwan feasibilitystudyofmultisitesplitlearningforprivacypreservingmedicalsystemsunderdataimbalanceconstraintsincovid19xrayandcholesteroldataset
AT kimjoongheon feasibilitystudyofmultisitesplitlearningforprivacypreservingmedicalsystemsunderdataimbalanceconstraintsincovid19xrayandcholesteroldataset