Cargando…

Data-driven color augmentation for H&E stained images in computational pathology

Computational pathology targets the automatic analysis of Whole Slide Images (WSI). WSIs are high-resolution digitized histopathology images, stained with chemical reagents to highlight specific tissue structures and scanned via whole slide scanners. The application of different parameters during WS...

Descripción completa

Detalles Bibliográficos
Autores principales: Marini, Niccolò, Otalora, Sebastian, Wodzinski, Marek, Tomassini, Selene, Dragoni, Aldo Franco, Marchand-Maillet, Stephane, Morales, Juan Pedro Dominguez, Duran-Lopez, Lourdes, Vatrano, Simona, Müller, Henning, Atzori, Manfredo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9852546/
https://www.ncbi.nlm.nih.gov/pubmed/36687531
http://dx.doi.org/10.1016/j.jpi.2022.100183
_version_ 1784872667292106752
author Marini, Niccolò
Otalora, Sebastian
Wodzinski, Marek
Tomassini, Selene
Dragoni, Aldo Franco
Marchand-Maillet, Stephane
Morales, Juan Pedro Dominguez
Duran-Lopez, Lourdes
Vatrano, Simona
Müller, Henning
Atzori, Manfredo
author_facet Marini, Niccolò
Otalora, Sebastian
Wodzinski, Marek
Tomassini, Selene
Dragoni, Aldo Franco
Marchand-Maillet, Stephane
Morales, Juan Pedro Dominguez
Duran-Lopez, Lourdes
Vatrano, Simona
Müller, Henning
Atzori, Manfredo
author_sort Marini, Niccolò
collection PubMed
description Computational pathology targets the automatic analysis of Whole Slide Images (WSI). WSIs are high-resolution digitized histopathology images, stained with chemical reagents to highlight specific tissue structures and scanned via whole slide scanners. The application of different parameters during WSI acquisition may lead to stain color heterogeneity, especially considering samples collected from several medical centers. Dealing with stain color heterogeneity often limits the robustness of methods developed to analyze WSIs, in particular Convolutional Neural Networks (CNN), the state-of-the-art algorithm for most computational pathology tasks. Stain color heterogeneity is still an unsolved problem, although several methods have been developed to alleviate it, such as Hue-Saturation-Contrast (HSC) color augmentation and stain augmentation methods. The goal of this paper is to present Data-Driven Color Augmentation (DDCA), a method to improve the efficiency of color augmentation methods by increasing the reliability of the samples used for training computational pathology models. During CNN training, a database including over 2 million H&E color variations collected from private and public datasets is used as a reference to discard augmented data with color distributions that do not correspond to realistic data. DDCA is applied to HSC color augmentation, stain augmentation and H&E-adversarial networks in colon and prostate cancer classification tasks. DDCA is then compared with 11 state-of-the-art baseline methods to handle color heterogeneity, showing that it can substantially improve classification performance on unseen data including heterogeneous color variations.
format Online
Article
Text
id pubmed-9852546
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-98525462023-01-21 Data-driven color augmentation for H&E stained images in computational pathology Marini, Niccolò Otalora, Sebastian Wodzinski, Marek Tomassini, Selene Dragoni, Aldo Franco Marchand-Maillet, Stephane Morales, Juan Pedro Dominguez Duran-Lopez, Lourdes Vatrano, Simona Müller, Henning Atzori, Manfredo J Pathol Inform Original Research Article Computational pathology targets the automatic analysis of Whole Slide Images (WSI). WSIs are high-resolution digitized histopathology images, stained with chemical reagents to highlight specific tissue structures and scanned via whole slide scanners. The application of different parameters during WSI acquisition may lead to stain color heterogeneity, especially considering samples collected from several medical centers. Dealing with stain color heterogeneity often limits the robustness of methods developed to analyze WSIs, in particular Convolutional Neural Networks (CNN), the state-of-the-art algorithm for most computational pathology tasks. Stain color heterogeneity is still an unsolved problem, although several methods have been developed to alleviate it, such as Hue-Saturation-Contrast (HSC) color augmentation and stain augmentation methods. The goal of this paper is to present Data-Driven Color Augmentation (DDCA), a method to improve the efficiency of color augmentation methods by increasing the reliability of the samples used for training computational pathology models. During CNN training, a database including over 2 million H&E color variations collected from private and public datasets is used as a reference to discard augmented data with color distributions that do not correspond to realistic data. DDCA is applied to HSC color augmentation, stain augmentation and H&E-adversarial networks in colon and prostate cancer classification tasks. DDCA is then compared with 11 state-of-the-art baseline methods to handle color heterogeneity, showing that it can substantially improve classification performance on unseen data including heterogeneous color variations. Elsevier 2023-01-03 /pmc/articles/PMC9852546/ /pubmed/36687531 http://dx.doi.org/10.1016/j.jpi.2022.100183 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Research Article
Marini, Niccolò
Otalora, Sebastian
Wodzinski, Marek
Tomassini, Selene
Dragoni, Aldo Franco
Marchand-Maillet, Stephane
Morales, Juan Pedro Dominguez
Duran-Lopez, Lourdes
Vatrano, Simona
Müller, Henning
Atzori, Manfredo
Data-driven color augmentation for H&E stained images in computational pathology
title Data-driven color augmentation for H&E stained images in computational pathology
title_full Data-driven color augmentation for H&E stained images in computational pathology
title_fullStr Data-driven color augmentation for H&E stained images in computational pathology
title_full_unstemmed Data-driven color augmentation for H&E stained images in computational pathology
title_short Data-driven color augmentation for H&E stained images in computational pathology
title_sort data-driven color augmentation for h&e stained images in computational pathology
topic Original Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9852546/
https://www.ncbi.nlm.nih.gov/pubmed/36687531
http://dx.doi.org/10.1016/j.jpi.2022.100183
work_keys_str_mv AT marininiccolo datadrivencoloraugmentationforhestainedimagesincomputationalpathology
AT otalorasebastian datadrivencoloraugmentationforhestainedimagesincomputationalpathology
AT wodzinskimarek datadrivencoloraugmentationforhestainedimagesincomputationalpathology
AT tomassiniselene datadrivencoloraugmentationforhestainedimagesincomputationalpathology
AT dragonialdofranco datadrivencoloraugmentationforhestainedimagesincomputationalpathology
AT marchandmailletstephane datadrivencoloraugmentationforhestainedimagesincomputationalpathology
AT moralesjuanpedrodominguez datadrivencoloraugmentationforhestainedimagesincomputationalpathology
AT duranlopezlourdes datadrivencoloraugmentationforhestainedimagesincomputationalpathology
AT vatranosimona datadrivencoloraugmentationforhestainedimagesincomputationalpathology
AT mullerhenning datadrivencoloraugmentationforhestainedimagesincomputationalpathology
AT atzorimanfredo datadrivencoloraugmentationforhestainedimagesincomputationalpathology