Cargando…

Challenges in Implementing the Local Node Infrastructure for a National Federated Machine Learning Network in Radiology

Data-driven machine learning in medical research and diagnostics needs large-scale datasets curated by clinical experts. The generation of large datasets can be challenging in terms of resource consumption and time effort, while generalizability and validation of the developed models significantly b...

Descripción completa

Detalles Bibliográficos
Autores principales: Jacobs, Paul-Philipp, Ehrengut, Constantin, Bucher, Andreas Michael, Penzkofer, Tobias, Lukas, Mathias, Kleesiek, Jens, Denecke, Timm
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10487228/
https://www.ncbi.nlm.nih.gov/pubmed/37685411
http://dx.doi.org/10.3390/healthcare11172377
_version_ 1785103190094512128
author Jacobs, Paul-Philipp
Ehrengut, Constantin
Bucher, Andreas Michael
Penzkofer, Tobias
Lukas, Mathias
Kleesiek, Jens
Denecke, Timm
author_facet Jacobs, Paul-Philipp
Ehrengut, Constantin
Bucher, Andreas Michael
Penzkofer, Tobias
Lukas, Mathias
Kleesiek, Jens
Denecke, Timm
author_sort Jacobs, Paul-Philipp
collection PubMed
description Data-driven machine learning in medical research and diagnostics needs large-scale datasets curated by clinical experts. The generation of large datasets can be challenging in terms of resource consumption and time effort, while generalizability and validation of the developed models significantly benefit from variety in data sources. Training algorithms on smaller decentralized datasets through federated learning can reduce effort, but require the implementation of a specific and ambitious infrastructure to share data, algorithms and computing time. Additionally, it offers the opportunity of maintaining and keeping the data locally. Thus, data safety issues can be avoided because patient data must not be shared. Machine learning models are trained on local data by sharing the model and through an established network. In addition to commercial applications, there are also numerous academic and customized implementations of network infrastructures available. The configuration of these networks primarily differs, yet adheres to a standard framework composed of fundamental components. In this technical note, we propose basic infrastructure requirements for data governance, data science workflows, and local node set-up, and report on the advantages and experienced pitfalls in implementing the local infrastructure with the German Radiological Cooperative Network initiative as the use case example. We show how the infrastructure can be built upon some base components to reflect the needs of a federated learning network and how they can be implemented considering both local and global network requirements. After analyzing the deployment process in different settings and scenarios, we recommend integrating the local node into an existing clinical IT infrastructure. This approach offers benefits in terms of maintenance and deployment effort compared to external integration in a separate environment (e.g., the radiology department). This proposed groundwork can be taken as an exemplary development guideline for future applications of federated learning networks in clinical and scientific environments.
format Online
Article
Text
id pubmed-10487228
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-104872282023-09-09 Challenges in Implementing the Local Node Infrastructure for a National Federated Machine Learning Network in Radiology Jacobs, Paul-Philipp Ehrengut, Constantin Bucher, Andreas Michael Penzkofer, Tobias Lukas, Mathias Kleesiek, Jens Denecke, Timm Healthcare (Basel) Technical Note Data-driven machine learning in medical research and diagnostics needs large-scale datasets curated by clinical experts. The generation of large datasets can be challenging in terms of resource consumption and time effort, while generalizability and validation of the developed models significantly benefit from variety in data sources. Training algorithms on smaller decentralized datasets through federated learning can reduce effort, but require the implementation of a specific and ambitious infrastructure to share data, algorithms and computing time. Additionally, it offers the opportunity of maintaining and keeping the data locally. Thus, data safety issues can be avoided because patient data must not be shared. Machine learning models are trained on local data by sharing the model and through an established network. In addition to commercial applications, there are also numerous academic and customized implementations of network infrastructures available. The configuration of these networks primarily differs, yet adheres to a standard framework composed of fundamental components. In this technical note, we propose basic infrastructure requirements for data governance, data science workflows, and local node set-up, and report on the advantages and experienced pitfalls in implementing the local infrastructure with the German Radiological Cooperative Network initiative as the use case example. We show how the infrastructure can be built upon some base components to reflect the needs of a federated learning network and how they can be implemented considering both local and global network requirements. After analyzing the deployment process in different settings and scenarios, we recommend integrating the local node into an existing clinical IT infrastructure. This approach offers benefits in terms of maintenance and deployment effort compared to external integration in a separate environment (e.g., the radiology department). This proposed groundwork can be taken as an exemplary development guideline for future applications of federated learning networks in clinical and scientific environments. MDPI 2023-08-23 /pmc/articles/PMC10487228/ /pubmed/37685411 http://dx.doi.org/10.3390/healthcare11172377 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Technical Note
Jacobs, Paul-Philipp
Ehrengut, Constantin
Bucher, Andreas Michael
Penzkofer, Tobias
Lukas, Mathias
Kleesiek, Jens
Denecke, Timm
Challenges in Implementing the Local Node Infrastructure for a National Federated Machine Learning Network in Radiology
title Challenges in Implementing the Local Node Infrastructure for a National Federated Machine Learning Network in Radiology
title_full Challenges in Implementing the Local Node Infrastructure for a National Federated Machine Learning Network in Radiology
title_fullStr Challenges in Implementing the Local Node Infrastructure for a National Federated Machine Learning Network in Radiology
title_full_unstemmed Challenges in Implementing the Local Node Infrastructure for a National Federated Machine Learning Network in Radiology
title_short Challenges in Implementing the Local Node Infrastructure for a National Federated Machine Learning Network in Radiology
title_sort challenges in implementing the local node infrastructure for a national federated machine learning network in radiology
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10487228/
https://www.ncbi.nlm.nih.gov/pubmed/37685411
http://dx.doi.org/10.3390/healthcare11172377
work_keys_str_mv AT jacobspaulphilipp challengesinimplementingthelocalnodeinfrastructureforanationalfederatedmachinelearningnetworkinradiology
AT ehrengutconstantin challengesinimplementingthelocalnodeinfrastructureforanationalfederatedmachinelearningnetworkinradiology
AT bucherandreasmichael challengesinimplementingthelocalnodeinfrastructureforanationalfederatedmachinelearningnetworkinradiology
AT penzkofertobias challengesinimplementingthelocalnodeinfrastructureforanationalfederatedmachinelearningnetworkinradiology
AT lukasmathias challengesinimplementingthelocalnodeinfrastructureforanationalfederatedmachinelearningnetworkinradiology
AT kleesiekjens challengesinimplementingthelocalnodeinfrastructureforanationalfederatedmachinelearningnetworkinradiology
AT denecketimm challengesinimplementingthelocalnodeinfrastructureforanationalfederatedmachinelearningnetworkinradiology