Cargando…

Histo-fetch – on-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training

BACKGROUND: Training convolutional neural networks using pathology whole slide images (WSIs) is traditionally prefaced by the extraction of a training dataset of image patches. While effective, for large datasets of WSIs, this dataset preparation is inefficient. METHODS: We created a custom pipeline...

Descripción completa

Detalles Bibliográficos
Autores principales: Lutnick, Brendon, Murali, Leema Krishna, Ginley, Brandon, Rosenberg, Avi Z., Sarder, Pinaki
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8794032/
https://www.ncbi.nlm.nih.gov/pubmed/35136674
http://dx.doi.org/10.4103/jpi.jpi_59_20
_version_ 1784640737581727744
author Lutnick, Brendon
Murali, Leema Krishna
Ginley, Brandon
Rosenberg, Avi Z.
Sarder, Pinaki
author_facet Lutnick, Brendon
Murali, Leema Krishna
Ginley, Brandon
Rosenberg, Avi Z.
Sarder, Pinaki
author_sort Lutnick, Brendon
collection PubMed
description BACKGROUND: Training convolutional neural networks using pathology whole slide images (WSIs) is traditionally prefaced by the extraction of a training dataset of image patches. While effective, for large datasets of WSIs, this dataset preparation is inefficient. METHODS: We created a custom pipeline (histo-fetch) to efficiently extract random patches and labels from pathology WSIs for input to a neural network on-the-fly. We prefetch these patches as needed during network training, avoiding the need for WSI preparation such as chopping/tiling. RESULTS & CONCLUSIONS: We demonstrate the utility of this pipeline to perform artificial stain transfer and image generation using the popular networks CycleGAN and ProGAN, respectively. For a large WSI dataset, histo-fetch is 98.6% faster to start training and used 7535x less disk space.
format Online
Article
Text
id pubmed-8794032
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-87940322022-02-07 Histo-fetch – on-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training Lutnick, Brendon Murali, Leema Krishna Ginley, Brandon Rosenberg, Avi Z. Sarder, Pinaki J Pathol Inform Technical Note BACKGROUND: Training convolutional neural networks using pathology whole slide images (WSIs) is traditionally prefaced by the extraction of a training dataset of image patches. While effective, for large datasets of WSIs, this dataset preparation is inefficient. METHODS: We created a custom pipeline (histo-fetch) to efficiently extract random patches and labels from pathology WSIs for input to a neural network on-the-fly. We prefetch these patches as needed during network training, avoiding the need for WSI preparation such as chopping/tiling. RESULTS & CONCLUSIONS: We demonstrate the utility of this pipeline to perform artificial stain transfer and image generation using the popular networks CycleGAN and ProGAN, respectively. For a large WSI dataset, histo-fetch is 98.6% faster to start training and used 7535x less disk space. Elsevier 2022-12-20 /pmc/articles/PMC8794032/ /pubmed/35136674 http://dx.doi.org/10.4103/jpi.jpi_59_20 Text en © 2022 Published by Elsevier Inc. on behalf of Association for Pathology Informatics. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Technical Note
Lutnick, Brendon
Murali, Leema Krishna
Ginley, Brandon
Rosenberg, Avi Z.
Sarder, Pinaki
Histo-fetch – on-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training
title Histo-fetch – on-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training
title_full Histo-fetch – on-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training
title_fullStr Histo-fetch – on-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training
title_full_unstemmed Histo-fetch – on-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training
title_short Histo-fetch – on-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training
title_sort histo-fetch – on-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8794032/
https://www.ncbi.nlm.nih.gov/pubmed/35136674
http://dx.doi.org/10.4103/jpi.jpi_59_20
work_keys_str_mv AT lutnickbrendon histofetchontheflyprocessingofgigapixelwholeslideimagessimplifiesandspeedsneuralnetworktraining
AT muralileemakrishna histofetchontheflyprocessingofgigapixelwholeslideimagessimplifiesandspeedsneuralnetworktraining
AT ginleybrandon histofetchontheflyprocessingofgigapixelwholeslideimagessimplifiesandspeedsneuralnetworktraining
AT rosenbergaviz histofetchontheflyprocessingofgigapixelwholeslideimagessimplifiesandspeedsneuralnetworktraining
AT sarderpinaki histofetchontheflyprocessingofgigapixelwholeslideimagessimplifiesandspeedsneuralnetworktraining