Cargando…

Which data subset should be augmented for deep learning? a simulation study using urothelial cell carcinoma histopathology images

BACKGROUND: Applying deep learning to digital histopathology is hindered by the scarcity of manually annotated datasets. While data augmentation can ameliorate this obstacle, its methods are far from standardized. Our aim was to systematically explore the effects of skipping data augmentation; apply...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ameen, Yusra A., Badary, Dalia M., Abonnoor, Ahmad Elbadry I., Hussain, Khaled F., Sewisy, Adel A.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2023
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9983182/ https://www.ncbi.nlm.nih.gov/pubmed/36869300 http://dx.doi.org/10.1186/s12859-023-05199-y

_version_	1784900491152457728
author	Ameen, Yusra A. Badary, Dalia M. Abonnoor, Ahmad Elbadry I. Hussain, Khaled F. Sewisy, Adel A.
author_facet	Ameen, Yusra A. Badary, Dalia M. Abonnoor, Ahmad Elbadry I. Hussain, Khaled F. Sewisy, Adel A.
author_sort	Ameen, Yusra A.
collection	PubMed
description	BACKGROUND: Applying deep learning to digital histopathology is hindered by the scarcity of manually annotated datasets. While data augmentation can ameliorate this obstacle, its methods are far from standardized. Our aim was to systematically explore the effects of skipping data augmentation; applying data augmentation to different subsets of the whole dataset (training set, validation set, test set, two of them, or all of them); and applying data augmentation at different time points (before, during, or after dividing the dataset into three subsets). Different combinations of the above possibilities resulted in 11 ways to apply augmentation. The literature contains no such comprehensive systematic comparison of these augmentation ways. RESULTS: Non-overlapping photographs of all tissues on 90 hematoxylin-and-eosin-stained urinary bladder slides were obtained. Then, they were manually classified as either inflammation (5948 images), urothelial cell carcinoma (5811 images), or invalid (3132 images; excluded). If done, augmentation was eight-fold by flipping and rotation. Four convolutional neural networks (Inception-v3, ResNet-101, GoogLeNet, and SqueezeNet), pre-trained on the ImageNet dataset, were fine-tuned to binary classify images of our dataset. This task was the benchmark for our experiments. Model testing performance was evaluated using accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve. Model validation accuracy was also estimated. The best testing performance was achieved when augmentation was done to the remaining data after test-set separation, but before division into training and validation sets. This leaked information between the training and the validation sets, as evidenced by the optimistic validation accuracy. However, this leakage did not cause the validation set to malfunction. Augmentation before test-set separation led to optimistic results. Test-set augmentation yielded more accurate evaluation metrics with less uncertainty. Inception-v3 had the best overall testing performance. CONCLUSIONS: In digital histopathology, augmentation should include both the test set (after its allocation), and the remaining combined training/validation set (before being split into separate training and validation sets). Future research should try to generalize our results. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05199-y.
format	Online Article Text
id	pubmed-9983182
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-99831822023-03-04 Which data subset should be augmented for deep learning? a simulation study using urothelial cell carcinoma histopathology images Ameen, Yusra A. Badary, Dalia M. Abonnoor, Ahmad Elbadry I. Hussain, Khaled F. Sewisy, Adel A. BMC Bioinformatics Research BACKGROUND: Applying deep learning to digital histopathology is hindered by the scarcity of manually annotated datasets. While data augmentation can ameliorate this obstacle, its methods are far from standardized. Our aim was to systematically explore the effects of skipping data augmentation; applying data augmentation to different subsets of the whole dataset (training set, validation set, test set, two of them, or all of them); and applying data augmentation at different time points (before, during, or after dividing the dataset into three subsets). Different combinations of the above possibilities resulted in 11 ways to apply augmentation. The literature contains no such comprehensive systematic comparison of these augmentation ways. RESULTS: Non-overlapping photographs of all tissues on 90 hematoxylin-and-eosin-stained urinary bladder slides were obtained. Then, they were manually classified as either inflammation (5948 images), urothelial cell carcinoma (5811 images), or invalid (3132 images; excluded). If done, augmentation was eight-fold by flipping and rotation. Four convolutional neural networks (Inception-v3, ResNet-101, GoogLeNet, and SqueezeNet), pre-trained on the ImageNet dataset, were fine-tuned to binary classify images of our dataset. This task was the benchmark for our experiments. Model testing performance was evaluated using accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve. Model validation accuracy was also estimated. The best testing performance was achieved when augmentation was done to the remaining data after test-set separation, but before division into training and validation sets. This leaked information between the training and the validation sets, as evidenced by the optimistic validation accuracy. However, this leakage did not cause the validation set to malfunction. Augmentation before test-set separation led to optimistic results. Test-set augmentation yielded more accurate evaluation metrics with less uncertainty. Inception-v3 had the best overall testing performance. CONCLUSIONS: In digital histopathology, augmentation should include both the test set (after its allocation), and the remaining combined training/validation set (before being split into separate training and validation sets). Future research should try to generalize our results. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05199-y. BioMed Central 2023-03-03 /pmc/articles/PMC9983182/ /pubmed/36869300 http://dx.doi.org/10.1186/s12859-023-05199-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Ameen, Yusra A. Badary, Dalia M. Abonnoor, Ahmad Elbadry I. Hussain, Khaled F. Sewisy, Adel A. Which data subset should be augmented for deep learning? a simulation study using urothelial cell carcinoma histopathology images
title	Which data subset should be augmented for deep learning? a simulation study using urothelial cell carcinoma histopathology images
title_full	Which data subset should be augmented for deep learning? a simulation study using urothelial cell carcinoma histopathology images
title_fullStr	Which data subset should be augmented for deep learning? a simulation study using urothelial cell carcinoma histopathology images
title_full_unstemmed	Which data subset should be augmented for deep learning? a simulation study using urothelial cell carcinoma histopathology images
title_short	Which data subset should be augmented for deep learning? a simulation study using urothelial cell carcinoma histopathology images
title_sort	which data subset should be augmented for deep learning? a simulation study using urothelial cell carcinoma histopathology images
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9983182/ https://www.ncbi.nlm.nih.gov/pubmed/36869300 http://dx.doi.org/10.1186/s12859-023-05199-y
work_keys_str_mv	AT ameenyusraa whichdatasubsetshouldbeaugmentedfordeeplearningasimulationstudyusingurothelialcellcarcinomahistopathologyimages AT badarydaliam whichdatasubsetshouldbeaugmentedfordeeplearningasimulationstudyusingurothelialcellcarcinomahistopathologyimages AT abonnoorahmadelbadryi whichdatasubsetshouldbeaugmentedfordeeplearningasimulationstudyusingurothelialcellcarcinomahistopathologyimages AT hussainkhaledf whichdatasubsetshouldbeaugmentedfordeeplearningasimulationstudyusingurothelialcellcarcinomahistopathologyimages AT sewisyadela whichdatasubsetshouldbeaugmentedfordeeplearningasimulationstudyusingurothelialcellcarcinomahistopathologyimages

Which data subset should be augmented for deep learning? a simulation study using urothelial cell carcinoma histopathology images

Ejemplares similares