Cargando…

Quantification of the Immune Content in Neuroblastoma: Deep Learning and Topological Data Analysis in Digital Pathology

We introduce here a novel machine learning (ML) framework to address the issue of the quantitative assessment of the immune content in neuroblastoma (NB) specimens. First, the EUNet, a U-Net with an EfficientNet encoder, is trained to detect lymphocytes on tissue digital slides stained with the CD3...

Descripción completa

Detalles Bibliográficos
Autores principales: Bussola, Nicole, Papa, Bruno, Melaiu, Ombretta, Castellano, Aurora, Fruci, Doriana, Jurman, Giuseppe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8396341/
https://www.ncbi.nlm.nih.gov/pubmed/34445517
http://dx.doi.org/10.3390/ijms22168804
_version_ 1783744351498665984
author Bussola, Nicole
Papa, Bruno
Melaiu, Ombretta
Castellano, Aurora
Fruci, Doriana
Jurman, Giuseppe
author_facet Bussola, Nicole
Papa, Bruno
Melaiu, Ombretta
Castellano, Aurora
Fruci, Doriana
Jurman, Giuseppe
author_sort Bussola, Nicole
collection PubMed
description We introduce here a novel machine learning (ML) framework to address the issue of the quantitative assessment of the immune content in neuroblastoma (NB) specimens. First, the EUNet, a U-Net with an EfficientNet encoder, is trained to detect lymphocytes on tissue digital slides stained with the CD3 T-cell marker. The training set consists of 3782 images extracted from an original collection of 54 whole slide images (WSIs), manually annotated for a total of 73,751 lymphocytes. Resampling strategies, data augmentation, and transfer learning approaches are adopted to warrant reproducibility and to reduce the risk of overfitting and selection bias. Topological data analysis (TDA) is then used to define activation maps from different layers of the neural network at different stages of the training process, described by persistence diagrams (PD) and Betti curves. TDA is further integrated with the uniform manifold approximation and projection (UMAP) dimensionality reduction and the hierarchical density-based spatial clustering of applications with noise (HDBSCAN) algorithm for clustering, by the deep features, the relevant subgroups and structures, across different levels of the neural network. Finally, the recent TwoNN approach is leveraged to study the variation of the intrinsic dimensionality of the U-Net model. As the main task, the proposed pipeline is employed to evaluate the density of lymphocytes over the whole tissue area of the WSIs. The model achieves good results with mean absolute error 3.1 on test set, showing significant agreement between densities estimated by our EUNet model and by trained pathologists, thus indicating the potentialities of a promising new strategy in the quantification of the immune content in NB specimens. Moreover, the UMAP algorithm unveiled interesting patterns compatible with pathological characteristics, also highlighting novel insights into the dynamics of the intrinsic dataset dimensionality at different stages of the training process. All the experiments were run on the Microsoft Azure cloud platform.
format Online
Article
Text
id pubmed-8396341
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83963412021-08-28 Quantification of the Immune Content in Neuroblastoma: Deep Learning and Topological Data Analysis in Digital Pathology Bussola, Nicole Papa, Bruno Melaiu, Ombretta Castellano, Aurora Fruci, Doriana Jurman, Giuseppe Int J Mol Sci Article We introduce here a novel machine learning (ML) framework to address the issue of the quantitative assessment of the immune content in neuroblastoma (NB) specimens. First, the EUNet, a U-Net with an EfficientNet encoder, is trained to detect lymphocytes on tissue digital slides stained with the CD3 T-cell marker. The training set consists of 3782 images extracted from an original collection of 54 whole slide images (WSIs), manually annotated for a total of 73,751 lymphocytes. Resampling strategies, data augmentation, and transfer learning approaches are adopted to warrant reproducibility and to reduce the risk of overfitting and selection bias. Topological data analysis (TDA) is then used to define activation maps from different layers of the neural network at different stages of the training process, described by persistence diagrams (PD) and Betti curves. TDA is further integrated with the uniform manifold approximation and projection (UMAP) dimensionality reduction and the hierarchical density-based spatial clustering of applications with noise (HDBSCAN) algorithm for clustering, by the deep features, the relevant subgroups and structures, across different levels of the neural network. Finally, the recent TwoNN approach is leveraged to study the variation of the intrinsic dimensionality of the U-Net model. As the main task, the proposed pipeline is employed to evaluate the density of lymphocytes over the whole tissue area of the WSIs. The model achieves good results with mean absolute error 3.1 on test set, showing significant agreement between densities estimated by our EUNet model and by trained pathologists, thus indicating the potentialities of a promising new strategy in the quantification of the immune content in NB specimens. Moreover, the UMAP algorithm unveiled interesting patterns compatible with pathological characteristics, also highlighting novel insights into the dynamics of the intrinsic dataset dimensionality at different stages of the training process. All the experiments were run on the Microsoft Azure cloud platform. MDPI 2021-08-16 /pmc/articles/PMC8396341/ /pubmed/34445517 http://dx.doi.org/10.3390/ijms22168804 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Bussola, Nicole
Papa, Bruno
Melaiu, Ombretta
Castellano, Aurora
Fruci, Doriana
Jurman, Giuseppe
Quantification of the Immune Content in Neuroblastoma: Deep Learning and Topological Data Analysis in Digital Pathology
title Quantification of the Immune Content in Neuroblastoma: Deep Learning and Topological Data Analysis in Digital Pathology
title_full Quantification of the Immune Content in Neuroblastoma: Deep Learning and Topological Data Analysis in Digital Pathology
title_fullStr Quantification of the Immune Content in Neuroblastoma: Deep Learning and Topological Data Analysis in Digital Pathology
title_full_unstemmed Quantification of the Immune Content in Neuroblastoma: Deep Learning and Topological Data Analysis in Digital Pathology
title_short Quantification of the Immune Content in Neuroblastoma: Deep Learning and Topological Data Analysis in Digital Pathology
title_sort quantification of the immune content in neuroblastoma: deep learning and topological data analysis in digital pathology
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8396341/
https://www.ncbi.nlm.nih.gov/pubmed/34445517
http://dx.doi.org/10.3390/ijms22168804
work_keys_str_mv AT bussolanicole quantificationoftheimmunecontentinneuroblastomadeeplearningandtopologicaldataanalysisindigitalpathology
AT papabruno quantificationoftheimmunecontentinneuroblastomadeeplearningandtopologicaldataanalysisindigitalpathology
AT melaiuombretta quantificationoftheimmunecontentinneuroblastomadeeplearningandtopologicaldataanalysisindigitalpathology
AT castellanoaurora quantificationoftheimmunecontentinneuroblastomadeeplearningandtopologicaldataanalysisindigitalpathology
AT frucidoriana quantificationoftheimmunecontentinneuroblastomadeeplearningandtopologicaldataanalysisindigitalpathology
AT jurmangiuseppe quantificationoftheimmunecontentinneuroblastomadeeplearningandtopologicaldataanalysisindigitalpathology