Cargando…

Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation

Convolutional neural network (CNN) models obtain state of the art performance on image classification, localization, and segmentation tasks. Limitations in computer hardware, most notably memory size in deep learning accelerator cards, prevent relatively large images, such as those from medical and...

Descripción completa

Detalles Bibliográficos
Autores principales:	Reina, G. Anthony, Panchumarthy, Ravi, Thakur, Siddhesh Pravin, Bastidas, Alexei, Bakas, Spyridon
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2020
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7020775/ https://www.ncbi.nlm.nih.gov/pubmed/32116512 http://dx.doi.org/10.3389/fnins.2020.00065

_version_	1783497810408112128
author	Reina, G. Anthony Panchumarthy, Ravi Thakur, Siddhesh Pravin Bastidas, Alexei Bakas, Spyridon
author_facet	Reina, G. Anthony Panchumarthy, Ravi Thakur, Siddhesh Pravin Bastidas, Alexei Bakas, Spyridon
author_sort	Reina, G. Anthony
collection	PubMed
description	Convolutional neural network (CNN) models obtain state of the art performance on image classification, localization, and segmentation tasks. Limitations in computer hardware, most notably memory size in deep learning accelerator cards, prevent relatively large images, such as those from medical and satellite imaging, from being processed as a whole in their original resolution. A fully convolutional topology, such as U-Net, is typically trained on down-sampled images and inferred on images of their original size and resolution, by simply dividing the larger image into smaller (typically overlapping) tiles, making predictions on these tiles, and stitching them back together as the prediction for the whole image. In this study, we show that this tiling technique combined with translationally-invariant nature of CNNs causes small, but relevant differences during inference that can be detrimental in the performance of the model. Here we quantify these variations in both medical (i.e., BraTS) and non-medical (i.e., satellite) images and show that training a 2D U-Net model on the whole image substantially improves the overall model performance. Finally, we compare 2D and 3D semantic segmentation models to show that providing CNN models with a wider context of the image in all three dimensions leads to more accurate and consistent predictions. Our results suggest that tiling the input to CNN models—while perhaps necessary to overcome the memory limitations in computer hardware—may lead to undesirable and unpredictable errors in the model's output that can only be adequately mitigated by increasing the input of the model to the largest possible tile size.
format	Online Article Text
id	pubmed-7020775
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-70207752020-02-28 Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation Reina, G. Anthony Panchumarthy, Ravi Thakur, Siddhesh Pravin Bastidas, Alexei Bakas, Spyridon Front Neurosci Neuroscience Convolutional neural network (CNN) models obtain state of the art performance on image classification, localization, and segmentation tasks. Limitations in computer hardware, most notably memory size in deep learning accelerator cards, prevent relatively large images, such as those from medical and satellite imaging, from being processed as a whole in their original resolution. A fully convolutional topology, such as U-Net, is typically trained on down-sampled images and inferred on images of their original size and resolution, by simply dividing the larger image into smaller (typically overlapping) tiles, making predictions on these tiles, and stitching them back together as the prediction for the whole image. In this study, we show that this tiling technique combined with translationally-invariant nature of CNNs causes small, but relevant differences during inference that can be detrimental in the performance of the model. Here we quantify these variations in both medical (i.e., BraTS) and non-medical (i.e., satellite) images and show that training a 2D U-Net model on the whole image substantially improves the overall model performance. Finally, we compare 2D and 3D semantic segmentation models to show that providing CNN models with a wider context of the image in all three dimensions leads to more accurate and consistent predictions. Our results suggest that tiling the input to CNN models—while perhaps necessary to overcome the memory limitations in computer hardware—may lead to undesirable and unpredictable errors in the model's output that can only be adequately mitigated by increasing the input of the model to the largest possible tile size. Frontiers Media S.A. 2020-02-07 /pmc/articles/PMC7020775/ /pubmed/32116512 http://dx.doi.org/10.3389/fnins.2020.00065 Text en Copyright © 2020 Reina, Panchumarthy, Thakur, Bastidas and Bakas. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Reina, G. Anthony Panchumarthy, Ravi Thakur, Siddhesh Pravin Bastidas, Alexei Bakas, Spyridon Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation
title	Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation
title_full	Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation
title_fullStr	Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation
title_full_unstemmed	Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation
title_short	Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation
title_sort	systematic evaluation of image tiling adverse effects on deep learning semantic segmentation
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7020775/ https://www.ncbi.nlm.nih.gov/pubmed/32116512 http://dx.doi.org/10.3389/fnins.2020.00065
work_keys_str_mv	AT reinaganthony systematicevaluationofimagetilingadverseeffectsondeeplearningsemanticsegmentation AT panchumarthyravi systematicevaluationofimagetilingadverseeffectsondeeplearningsemanticsegmentation AT thakursiddheshpravin systematicevaluationofimagetilingadverseeffectsondeeplearningsemanticsegmentation AT bastidasalexei systematicevaluationofimagetilingadverseeffectsondeeplearningsemanticsegmentation AT bakasspyridon systematicevaluationofimagetilingadverseeffectsondeeplearningsemanticsegmentation

Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation

Ejemplares similares