Cargando…

Staining Invariant Features for Improving Generalization of Deep Convolutional Neural Networks in Computational Pathology

One of the main obstacles for the implementation of deep convolutional neural networks (DCNNs) in the clinical pathology workflow is their low capability to overcome variability in slide preparation and scanner configuration, that leads to changes in tissue appearance. Some of these variations may n...

Descripción completa

Detalles Bibliográficos
Autores principales: Otálora, Sebastian, Atzori, Manfredo, Andrearczyk, Vincent, Khan, Amjad, Müller, Henning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6716536/
https://www.ncbi.nlm.nih.gov/pubmed/31508414
http://dx.doi.org/10.3389/fbioe.2019.00198
_version_ 1783447393064189952
author Otálora, Sebastian
Atzori, Manfredo
Andrearczyk, Vincent
Khan, Amjad
Müller, Henning
author_facet Otálora, Sebastian
Atzori, Manfredo
Andrearczyk, Vincent
Khan, Amjad
Müller, Henning
author_sort Otálora, Sebastian
collection PubMed
description One of the main obstacles for the implementation of deep convolutional neural networks (DCNNs) in the clinical pathology workflow is their low capability to overcome variability in slide preparation and scanner configuration, that leads to changes in tissue appearance. Some of these variations may not be not included in the training data, which means that the models have a risk to not generalize well. Addressing such variations and evaluating them in reproducible scenarios allows understanding of when the models generalize better, which is crucial for performance improvements and better DCNN models. Staining normalization techniques (often based on color deconvolution and deep learning) and color augmentation approaches have shown improvements in the generalization of the classification tasks for several tissue types. Domain-invariant training of DCNN's is also a promising technique to address the problem of training a single model for different domains, since it includes the source domain information to guide the training toward domain-invariant features, achieving state-of-the-art results in classification tasks. In this article, deep domain adaptation in convolutional networks (DANN) is applied to computational pathology and compared with widely used staining normalization and color augmentation methods in two challenging classification tasks. The classification tasks rely on two openly accessible datasets, targeting Gleason grading in prostate cancer, and mitosis classification in breast tissue. The benchmark of the different techniques and their combination in two DCNN architectures allows us to assess the generalization abilities and advantages of each method in the considered classification tasks. The code for reproducing our experiments and preprocessing the data is publicly available. Quantitative and qualitative results show that the use of DANN helps model generalization to external datasets. The combination of several techniques to manage color heterogeneity suggests that several methods together, such as color augmentation methods with DANN training, can generalize even further. The results do not show a single best technique among the considered methods, even when combining them. However, color augmentation and DANN training obtain most often the best results (alone or combined with color normalization and color augmentation). The statistical significance of the results and the embeddings visualizations provide useful insights to design DCNN that generalizes to unseen staining appearances. Furthermore, in this work, we release for the first time code for DANN evaluation in open access datasets for computational pathology. This work opens the possibility for further research on using DANN models together with techniques that can overcome the tissue preparation differences across datasets to tackle limited generalization.
format Online
Article
Text
id pubmed-6716536
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-67165362019-09-10 Staining Invariant Features for Improving Generalization of Deep Convolutional Neural Networks in Computational Pathology Otálora, Sebastian Atzori, Manfredo Andrearczyk, Vincent Khan, Amjad Müller, Henning Front Bioeng Biotechnol Bioengineering and Biotechnology One of the main obstacles for the implementation of deep convolutional neural networks (DCNNs) in the clinical pathology workflow is their low capability to overcome variability in slide preparation and scanner configuration, that leads to changes in tissue appearance. Some of these variations may not be not included in the training data, which means that the models have a risk to not generalize well. Addressing such variations and evaluating them in reproducible scenarios allows understanding of when the models generalize better, which is crucial for performance improvements and better DCNN models. Staining normalization techniques (often based on color deconvolution and deep learning) and color augmentation approaches have shown improvements in the generalization of the classification tasks for several tissue types. Domain-invariant training of DCNN's is also a promising technique to address the problem of training a single model for different domains, since it includes the source domain information to guide the training toward domain-invariant features, achieving state-of-the-art results in classification tasks. In this article, deep domain adaptation in convolutional networks (DANN) is applied to computational pathology and compared with widely used staining normalization and color augmentation methods in two challenging classification tasks. The classification tasks rely on two openly accessible datasets, targeting Gleason grading in prostate cancer, and mitosis classification in breast tissue. The benchmark of the different techniques and their combination in two DCNN architectures allows us to assess the generalization abilities and advantages of each method in the considered classification tasks. The code for reproducing our experiments and preprocessing the data is publicly available. Quantitative and qualitative results show that the use of DANN helps model generalization to external datasets. The combination of several techniques to manage color heterogeneity suggests that several methods together, such as color augmentation methods with DANN training, can generalize even further. The results do not show a single best technique among the considered methods, even when combining them. However, color augmentation and DANN training obtain most often the best results (alone or combined with color normalization and color augmentation). The statistical significance of the results and the embeddings visualizations provide useful insights to design DCNN that generalizes to unseen staining appearances. Furthermore, in this work, we release for the first time code for DANN evaluation in open access datasets for computational pathology. This work opens the possibility for further research on using DANN models together with techniques that can overcome the tissue preparation differences across datasets to tackle limited generalization. Frontiers Media S.A. 2019-08-23 /pmc/articles/PMC6716536/ /pubmed/31508414 http://dx.doi.org/10.3389/fbioe.2019.00198 Text en Copyright © 2019 Otálora, Atzori, Andrearczyk, Khan and Müller. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioengineering and Biotechnology
Otálora, Sebastian
Atzori, Manfredo
Andrearczyk, Vincent
Khan, Amjad
Müller, Henning
Staining Invariant Features for Improving Generalization of Deep Convolutional Neural Networks in Computational Pathology
title Staining Invariant Features for Improving Generalization of Deep Convolutional Neural Networks in Computational Pathology
title_full Staining Invariant Features for Improving Generalization of Deep Convolutional Neural Networks in Computational Pathology
title_fullStr Staining Invariant Features for Improving Generalization of Deep Convolutional Neural Networks in Computational Pathology
title_full_unstemmed Staining Invariant Features for Improving Generalization of Deep Convolutional Neural Networks in Computational Pathology
title_short Staining Invariant Features for Improving Generalization of Deep Convolutional Neural Networks in Computational Pathology
title_sort staining invariant features for improving generalization of deep convolutional neural networks in computational pathology
topic Bioengineering and Biotechnology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6716536/
https://www.ncbi.nlm.nih.gov/pubmed/31508414
http://dx.doi.org/10.3389/fbioe.2019.00198
work_keys_str_mv AT otalorasebastian staininginvariantfeaturesforimprovinggeneralizationofdeepconvolutionalneuralnetworksincomputationalpathology
AT atzorimanfredo staininginvariantfeaturesforimprovinggeneralizationofdeepconvolutionalneuralnetworksincomputationalpathology
AT andrearczykvincent staininginvariantfeaturesforimprovinggeneralizationofdeepconvolutionalneuralnetworksincomputationalpathology
AT khanamjad staininginvariantfeaturesforimprovinggeneralizationofdeepconvolutionalneuralnetworksincomputationalpathology
AT mullerhenning staininginvariantfeaturesforimprovinggeneralizationofdeepconvolutionalneuralnetworksincomputationalpathology