Cargando…

Sparse coding of pathology slides compared to transfer learning with deep neural networks

BACKGROUND: Histopathology images of tumor biopsies present unique challenges for applying machine learning to the diagnosis and treatment of cancer. The pathology slides are high resolution, often exceeding 1GB, have non-uniform dimensions, and often contain multiple tissue slices of varying sizes...

Descripción completa

Detalles Bibliográficos
Autores principales:	Fischer, Will, Moudgalya, Sanketh S., Cohn, Judith D., Nguyen, Nga T. T., Kenyon, Garrett T.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2018
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6302377/ https://www.ncbi.nlm.nih.gov/pubmed/30577746 http://dx.doi.org/10.1186/s12859-018-2504-8

_version_	1783381964653330432
author	Fischer, Will Moudgalya, Sanketh S. Cohn, Judith D. Nguyen, Nga T. T. Kenyon, Garrett T.
author_facet	Fischer, Will Moudgalya, Sanketh S. Cohn, Judith D. Nguyen, Nga T. T. Kenyon, Garrett T.
author_sort	Fischer, Will
collection	PubMed
description	BACKGROUND: Histopathology images of tumor biopsies present unique challenges for applying machine learning to the diagnosis and treatment of cancer. The pathology slides are high resolution, often exceeding 1GB, have non-uniform dimensions, and often contain multiple tissue slices of varying sizes surrounded by large empty regions. The locations of abnormal or cancerous cells, which may constitute a small portion of any given tissue sample, are not annotated. Cancer image datasets are also extremely imbalanced, with most slides being associated with relatively common cancers. Since deep representations trained on natural photographs are unlikely to be optimal for classifying pathology slide images, which have different spectral ranges and spatial structure, we here describe an approach for learning features and inferring representations of cancer pathology slides based on sparse coding. RESULTS: We show that conventional transfer learning using a state-of-the-art deep learning architecture pre-trained on ImageNet (RESNET) and fine tuned for a binary tumor/no-tumor classification task achieved between 85% and 86% accuracy. However, when all layers up to the last convolutional layer in RESNET are replaced with a single feature map inferred via a sparse coding using a dictionary optimized for sparse reconstruction of unlabeled pathology slides, classification performance improves to over 93%, corresponding to a 54% error reduction. CONCLUSIONS: We conclude that a feature dictionary optimized for biomedical imagery may in general support better classification performance than does conventional transfer learning using a dictionary pre-trained on natural images. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2504-8) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-6302377
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-63023772018-12-31 Sparse coding of pathology slides compared to transfer learning with deep neural networks Fischer, Will Moudgalya, Sanketh S. Cohn, Judith D. Nguyen, Nga T. T. Kenyon, Garrett T. BMC Bioinformatics Research BACKGROUND: Histopathology images of tumor biopsies present unique challenges for applying machine learning to the diagnosis and treatment of cancer. The pathology slides are high resolution, often exceeding 1GB, have non-uniform dimensions, and often contain multiple tissue slices of varying sizes surrounded by large empty regions. The locations of abnormal or cancerous cells, which may constitute a small portion of any given tissue sample, are not annotated. Cancer image datasets are also extremely imbalanced, with most slides being associated with relatively common cancers. Since deep representations trained on natural photographs are unlikely to be optimal for classifying pathology slide images, which have different spectral ranges and spatial structure, we here describe an approach for learning features and inferring representations of cancer pathology slides based on sparse coding. RESULTS: We show that conventional transfer learning using a state-of-the-art deep learning architecture pre-trained on ImageNet (RESNET) and fine tuned for a binary tumor/no-tumor classification task achieved between 85% and 86% accuracy. However, when all layers up to the last convolutional layer in RESNET are replaced with a single feature map inferred via a sparse coding using a dictionary optimized for sparse reconstruction of unlabeled pathology slides, classification performance improves to over 93%, corresponding to a 54% error reduction. CONCLUSIONS: We conclude that a feature dictionary optimized for biomedical imagery may in general support better classification performance than does conventional transfer learning using a dictionary pre-trained on natural images. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2504-8) contains supplementary material, which is available to authorized users. BioMed Central 2018-12-21 /pmc/articles/PMC6302377/ /pubmed/30577746 http://dx.doi.org/10.1186/s12859-018-2504-8 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Fischer, Will Moudgalya, Sanketh S. Cohn, Judith D. Nguyen, Nga T. T. Kenyon, Garrett T. Sparse coding of pathology slides compared to transfer learning with deep neural networks
title	Sparse coding of pathology slides compared to transfer learning with deep neural networks
title_full	Sparse coding of pathology slides compared to transfer learning with deep neural networks
title_fullStr	Sparse coding of pathology slides compared to transfer learning with deep neural networks
title_full_unstemmed	Sparse coding of pathology slides compared to transfer learning with deep neural networks
title_short	Sparse coding of pathology slides compared to transfer learning with deep neural networks
title_sort	sparse coding of pathology slides compared to transfer learning with deep neural networks
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6302377/ https://www.ncbi.nlm.nih.gov/pubmed/30577746 http://dx.doi.org/10.1186/s12859-018-2504-8
work_keys_str_mv	AT fischerwill sparsecodingofpathologyslidescomparedtotransferlearningwithdeepneuralnetworks AT moudgalyasankeths sparsecodingofpathologyslidescomparedtotransferlearningwithdeepneuralnetworks AT cohnjudithd sparsecodingofpathologyslidescomparedtotransferlearningwithdeepneuralnetworks AT nguyenngatt sparsecodingofpathologyslidescomparedtotransferlearningwithdeepneuralnetworks AT kenyongarrettt sparsecodingofpathologyslidescomparedtotransferlearningwithdeepneuralnetworks

Sparse coding of pathology slides compared to transfer learning with deep neural networks

Ejemplares similares