Cargando…

Federated learning for computational pathology on gigapixel whole slide images

Deep Learning-based computational pathology algorithms have demonstrated profound ability to excel in a wide array of tasks that range from characterization of well known morphological phenotypes to predicting non human-identifiable features from histology such as molecular alterations. However, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Ming Y., Chen, Richard J., Kong, Dehan, Lipkova, Jana, Singh, Rajendra, Williamson, Drew F.K., Chen, Tiffany Y., Mahmood, Faisal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9340569/
https://www.ncbi.nlm.nih.gov/pubmed/34911013
http://dx.doi.org/10.1016/j.media.2021.102298
_version_ 1784760433070047232
author Lu, Ming Y.
Chen, Richard J.
Kong, Dehan
Lipkova, Jana
Singh, Rajendra
Williamson, Drew F.K.
Chen, Tiffany Y.
Mahmood, Faisal
author_facet Lu, Ming Y.
Chen, Richard J.
Kong, Dehan
Lipkova, Jana
Singh, Rajendra
Williamson, Drew F.K.
Chen, Tiffany Y.
Mahmood, Faisal
author_sort Lu, Ming Y.
collection PubMed
description Deep Learning-based computational pathology algorithms have demonstrated profound ability to excel in a wide array of tasks that range from characterization of well known morphological phenotypes to predicting non human-identifiable features from histology such as molecular alterations. However, the development of robust, adaptable and accurate deep learning-based models often rely on the collection and time-costly curation large high-quality annotated training data that should ideally come from diverse sources and patient populations to cater for the heterogeneity that exists in such datasets. Multi-centric and collaborative integration of medical data across multiple institutions can naturally help overcome this challenge and boost the model performance but is limited by privacy concerns among other difficulties that may arise in the complex data sharing process as models scale towards using hundreds of thousands of gigapixel whole slide images. In this paper, we introduce privacy-preserving federated learning for gigapixel whole slide images in computational pathology using weakly-supervised attention multiple instance learning and differential privacy. We evaluated our approach on two different diagnostic problems using thousands of histology whole slide images with only slide-level labels. Additionally, we present a weakly-supervised learning framework for survival prediction and patient stratification from whole slide images and demonstrate its effectiveness in a federated setting. Our results show that using federated learning, we can effectively develop accurate weakly-supervised deep learning models from distributed data silos without direct data sharing and its associated complexities, while also preserving differential privacy using randomized noise generation. We also make available an easy-to-use federated learning for computational pathology software package: http://github.com/mahmoodlab/HistoFL.
format Online
Article
Text
id pubmed-9340569
institution National Center for Biotechnology Information
language English
publishDate 2022
record_format MEDLINE/PubMed
spelling pubmed-93405692022-08-01 Federated learning for computational pathology on gigapixel whole slide images Lu, Ming Y. Chen, Richard J. Kong, Dehan Lipkova, Jana Singh, Rajendra Williamson, Drew F.K. Chen, Tiffany Y. Mahmood, Faisal Med Image Anal Article Deep Learning-based computational pathology algorithms have demonstrated profound ability to excel in a wide array of tasks that range from characterization of well known morphological phenotypes to predicting non human-identifiable features from histology such as molecular alterations. However, the development of robust, adaptable and accurate deep learning-based models often rely on the collection and time-costly curation large high-quality annotated training data that should ideally come from diverse sources and patient populations to cater for the heterogeneity that exists in such datasets. Multi-centric and collaborative integration of medical data across multiple institutions can naturally help overcome this challenge and boost the model performance but is limited by privacy concerns among other difficulties that may arise in the complex data sharing process as models scale towards using hundreds of thousands of gigapixel whole slide images. In this paper, we introduce privacy-preserving federated learning for gigapixel whole slide images in computational pathology using weakly-supervised attention multiple instance learning and differential privacy. We evaluated our approach on two different diagnostic problems using thousands of histology whole slide images with only slide-level labels. Additionally, we present a weakly-supervised learning framework for survival prediction and patient stratification from whole slide images and demonstrate its effectiveness in a federated setting. Our results show that using federated learning, we can effectively develop accurate weakly-supervised deep learning models from distributed data silos without direct data sharing and its associated complexities, while also preserving differential privacy using randomized noise generation. We also make available an easy-to-use federated learning for computational pathology software package: http://github.com/mahmoodlab/HistoFL. 2022-02 2021-11-25 /pmc/articles/PMC9340569/ /pubmed/34911013 http://dx.doi.org/10.1016/j.media.2021.102298 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license ( http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) )
spellingShingle Article
Lu, Ming Y.
Chen, Richard J.
Kong, Dehan
Lipkova, Jana
Singh, Rajendra
Williamson, Drew F.K.
Chen, Tiffany Y.
Mahmood, Faisal
Federated learning for computational pathology on gigapixel whole slide images
title Federated learning for computational pathology on gigapixel whole slide images
title_full Federated learning for computational pathology on gigapixel whole slide images
title_fullStr Federated learning for computational pathology on gigapixel whole slide images
title_full_unstemmed Federated learning for computational pathology on gigapixel whole slide images
title_short Federated learning for computational pathology on gigapixel whole slide images
title_sort federated learning for computational pathology on gigapixel whole slide images
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9340569/
https://www.ncbi.nlm.nih.gov/pubmed/34911013
http://dx.doi.org/10.1016/j.media.2021.102298
work_keys_str_mv AT lumingy federatedlearningforcomputationalpathologyongigapixelwholeslideimages
AT chenrichardj federatedlearningforcomputationalpathologyongigapixelwholeslideimages
AT kongdehan federatedlearningforcomputationalpathologyongigapixelwholeslideimages
AT lipkovajana federatedlearningforcomputationalpathologyongigapixelwholeslideimages
AT singhrajendra federatedlearningforcomputationalpathologyongigapixelwholeslideimages
AT williamsondrewfk federatedlearningforcomputationalpathologyongigapixelwholeslideimages
AT chentiffanyy federatedlearningforcomputationalpathologyongigapixelwholeslideimages
AT mahmoodfaisal federatedlearningforcomputationalpathologyongigapixelwholeslideimages