Cargando…

Collaborative training of medical artificial intelligence models with non-uniform labels

Due to the rapid advancements in recent years, medical image analysis is largely dominated by deep learning (DL). However, building powerful and robust DL models requires training with large multi-party datasets. While multiple stakeholders have provided publicly available datasets, the ways in whic...

Descripción completa

Detalles Bibliográficos
Autores principales: Tayebi Arasteh, Soroosh, Isfort, Peter, Saehn, Marwin, Mueller-Franzes, Gustav, Khader, Firas, Kather, Jakob Nikolas, Kuhl, Christiane, Nebelung, Sven, Truhn, Daniel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10102221/
https://www.ncbi.nlm.nih.gov/pubmed/37055456
http://dx.doi.org/10.1038/s41598-023-33303-y
_version_ 1785025652047478784
author Tayebi Arasteh, Soroosh
Isfort, Peter
Saehn, Marwin
Mueller-Franzes, Gustav
Khader, Firas
Kather, Jakob Nikolas
Kuhl, Christiane
Nebelung, Sven
Truhn, Daniel
author_facet Tayebi Arasteh, Soroosh
Isfort, Peter
Saehn, Marwin
Mueller-Franzes, Gustav
Khader, Firas
Kather, Jakob Nikolas
Kuhl, Christiane
Nebelung, Sven
Truhn, Daniel
author_sort Tayebi Arasteh, Soroosh
collection PubMed
description Due to the rapid advancements in recent years, medical image analysis is largely dominated by deep learning (DL). However, building powerful and robust DL models requires training with large multi-party datasets. While multiple stakeholders have provided publicly available datasets, the ways in which these data are labeled vary widely. For Instance, an institution might provide a dataset of chest radiographs containing labels denoting the presence of pneumonia, while another institution might have a focus on determining the presence of metastases in the lung. Training a single AI model utilizing all these data is not feasible with conventional federated learning (FL). This prompts us to propose an extension to the widespread FL process, namely flexible federated learning (FFL) for collaborative training on such data. Using 695,000 chest radiographs from five institutions from across the globe—each with differing labels—we demonstrate that having heterogeneously labeled datasets, FFL-based training leads to significant performance increase compared to conventional FL training, where only the uniformly annotated images are utilized. We believe that our proposed algorithm could accelerate the process of bringing collaborative training methods from research and simulation phase to the real-world applications in healthcare.
format Online
Article
Text
id pubmed-10102221
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-101022212023-04-15 Collaborative training of medical artificial intelligence models with non-uniform labels Tayebi Arasteh, Soroosh Isfort, Peter Saehn, Marwin Mueller-Franzes, Gustav Khader, Firas Kather, Jakob Nikolas Kuhl, Christiane Nebelung, Sven Truhn, Daniel Sci Rep Article Due to the rapid advancements in recent years, medical image analysis is largely dominated by deep learning (DL). However, building powerful and robust DL models requires training with large multi-party datasets. While multiple stakeholders have provided publicly available datasets, the ways in which these data are labeled vary widely. For Instance, an institution might provide a dataset of chest radiographs containing labels denoting the presence of pneumonia, while another institution might have a focus on determining the presence of metastases in the lung. Training a single AI model utilizing all these data is not feasible with conventional federated learning (FL). This prompts us to propose an extension to the widespread FL process, namely flexible federated learning (FFL) for collaborative training on such data. Using 695,000 chest radiographs from five institutions from across the globe—each with differing labels—we demonstrate that having heterogeneously labeled datasets, FFL-based training leads to significant performance increase compared to conventional FL training, where only the uniformly annotated images are utilized. We believe that our proposed algorithm could accelerate the process of bringing collaborative training methods from research and simulation phase to the real-world applications in healthcare. Nature Publishing Group UK 2023-04-13 /pmc/articles/PMC10102221/ /pubmed/37055456 http://dx.doi.org/10.1038/s41598-023-33303-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Tayebi Arasteh, Soroosh
Isfort, Peter
Saehn, Marwin
Mueller-Franzes, Gustav
Khader, Firas
Kather, Jakob Nikolas
Kuhl, Christiane
Nebelung, Sven
Truhn, Daniel
Collaborative training of medical artificial intelligence models with non-uniform labels
title Collaborative training of medical artificial intelligence models with non-uniform labels
title_full Collaborative training of medical artificial intelligence models with non-uniform labels
title_fullStr Collaborative training of medical artificial intelligence models with non-uniform labels
title_full_unstemmed Collaborative training of medical artificial intelligence models with non-uniform labels
title_short Collaborative training of medical artificial intelligence models with non-uniform labels
title_sort collaborative training of medical artificial intelligence models with non-uniform labels
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10102221/
https://www.ncbi.nlm.nih.gov/pubmed/37055456
http://dx.doi.org/10.1038/s41598-023-33303-y
work_keys_str_mv AT tayebiarastehsoroosh collaborativetrainingofmedicalartificialintelligencemodelswithnonuniformlabels
AT isfortpeter collaborativetrainingofmedicalartificialintelligencemodelswithnonuniformlabels
AT saehnmarwin collaborativetrainingofmedicalartificialintelligencemodelswithnonuniformlabels
AT muellerfranzesgustav collaborativetrainingofmedicalartificialintelligencemodelswithnonuniformlabels
AT khaderfiras collaborativetrainingofmedicalartificialintelligencemodelswithnonuniformlabels
AT katherjakobnikolas collaborativetrainingofmedicalartificialintelligencemodelswithnonuniformlabels
AT kuhlchristiane collaborativetrainingofmedicalartificialintelligencemodelswithnonuniformlabels
AT nebelungsven collaborativetrainingofmedicalartificialintelligencemodelswithnonuniformlabels
AT truhndaniel collaborativetrainingofmedicalartificialintelligencemodelswithnonuniformlabels