Cargando…

Limited Number of Cases May Yield Generalizable Models, a Proof of Concept in Deep Learning for Colon Histology

BACKGROUND: Little is known about the effect of a minimum number of slides required in generating image datasets used to build generalizable machine-learning (ML) models. In addition, the assumption within deep learning is that the increased number of training images will always enhance accuracy and...

Descripción completa

Detalles Bibliográficos
Autores principales:	Holland, Lorne, Wei, Dongguang, Olson, Kristin A., Mitra, Anupam, Graff, John Paul, Jones, Andrew D., Durbin-Johnson, Blythe, Mitra, Ananya Datta, Rashidi, Hooman H.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Wolters Kluwer - Medknow 2020
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7047745/ https://www.ncbi.nlm.nih.gov/pubmed/32175170 http://dx.doi.org/10.4103/jpi.jpi_49_19

_version_	1783502172857565184
author	Holland, Lorne Wei, Dongguang Olson, Kristin A. Mitra, Anupam Graff, John Paul Jones, Andrew D. Durbin-Johnson, Blythe Mitra, Ananya Datta Rashidi, Hooman H.
author_facet	Holland, Lorne Wei, Dongguang Olson, Kristin A. Mitra, Anupam Graff, John Paul Jones, Andrew D. Durbin-Johnson, Blythe Mitra, Ananya Datta Rashidi, Hooman H.
author_sort	Holland, Lorne
collection	PubMed
description	BACKGROUND: Little is known about the effect of a minimum number of slides required in generating image datasets used to build generalizable machine-learning (ML) models. In addition, the assumption within deep learning is that the increased number of training images will always enhance accuracy and that the initial validation accuracy of the models correlates well with their generalizability. In this pilot study, we have been able to test the above assumptions to gain a better understanding of such platforms, especially when data resources are limited. METHODS: Using 10 colon histology slides (5 carcinoma and 5 benign), we were able to acquire 1000 partially overlapping images (Dataset A) that were then trained and tested on three convolutional neural networks (CNNs), ResNet50, AlexNet, and SqueezeNet, to build a large number of unique models for a simple task of classifying colon histopathology into benign and malignant. Different quantities of images (10–1000) from Dataset A were used to construct >200 unique CNN models whose performances were individually assessed. The performance of these models was initially assessed using 20% of Dataset A's images (not included in the training phase) to acquire their initial validation accuracy (internal accuracy) followed by their generalization accuracy on Dataset B (a very distinct secondary test set acquired from public domain online sources). RESULTS: All CNNs showed similar peak internal accuracies (>97%) from the Dataset A test set. Peak accuracies for the external novel test set (Dataset B), an assessment of the ability to generalize, showed marked variation (ResNet50: 98%; AlexNet: 92%; and SqueezeNet: 80%). The models with the highest accuracy were not generated using the largest training sets. Further, a model's internal accuracy did not always correlate with its generalization accuracy. The results were obtained using an optimized number of cases and controls. CONCLUSIONS: Increasing the number of images in a training set does not always improve model accuracy, and significant numbers of cases may not always be needed for generalization, especially for simple tasks. Different CNNs reach peak accuracy with different training set sizes. Further studies are required to evaluate the above findings in more complex ML models prior to using such ancillary tools in clinical settings.
format	Online Article Text
id	pubmed-7047745
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Wolters Kluwer - Medknow
record_format	MEDLINE/PubMed
spelling	pubmed-70477452020-03-13 Limited Number of Cases May Yield Generalizable Models, a Proof of Concept in Deep Learning for Colon Histology Holland, Lorne Wei, Dongguang Olson, Kristin A. Mitra, Anupam Graff, John Paul Jones, Andrew D. Durbin-Johnson, Blythe Mitra, Ananya Datta Rashidi, Hooman H. J Pathol Inform Original Article BACKGROUND: Little is known about the effect of a minimum number of slides required in generating image datasets used to build generalizable machine-learning (ML) models. In addition, the assumption within deep learning is that the increased number of training images will always enhance accuracy and that the initial validation accuracy of the models correlates well with their generalizability. In this pilot study, we have been able to test the above assumptions to gain a better understanding of such platforms, especially when data resources are limited. METHODS: Using 10 colon histology slides (5 carcinoma and 5 benign), we were able to acquire 1000 partially overlapping images (Dataset A) that were then trained and tested on three convolutional neural networks (CNNs), ResNet50, AlexNet, and SqueezeNet, to build a large number of unique models for a simple task of classifying colon histopathology into benign and malignant. Different quantities of images (10–1000) from Dataset A were used to construct >200 unique CNN models whose performances were individually assessed. The performance of these models was initially assessed using 20% of Dataset A's images (not included in the training phase) to acquire their initial validation accuracy (internal accuracy) followed by their generalization accuracy on Dataset B (a very distinct secondary test set acquired from public domain online sources). RESULTS: All CNNs showed similar peak internal accuracies (>97%) from the Dataset A test set. Peak accuracies for the external novel test set (Dataset B), an assessment of the ability to generalize, showed marked variation (ResNet50: 98%; AlexNet: 92%; and SqueezeNet: 80%). The models with the highest accuracy were not generated using the largest training sets. Further, a model's internal accuracy did not always correlate with its generalization accuracy. The results were obtained using an optimized number of cases and controls. CONCLUSIONS: Increasing the number of images in a training set does not always improve model accuracy, and significant numbers of cases may not always be needed for generalization, especially for simple tasks. Different CNNs reach peak accuracy with different training set sizes. Further studies are required to evaluate the above findings in more complex ML models prior to using such ancillary tools in clinical settings. Wolters Kluwer - Medknow 2020-02-21 /pmc/articles/PMC7047745/ /pubmed/32175170 http://dx.doi.org/10.4103/jpi.jpi_49_19 Text en Copyright: © 2020 Journal of Pathology Informatics http://creativecommons.org/licenses/by-nc-sa/4.0 This is an open access journal, and articles are distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as appropriate credit is given and the new creations are licensed under the identical terms.
spellingShingle	Original Article Holland, Lorne Wei, Dongguang Olson, Kristin A. Mitra, Anupam Graff, John Paul Jones, Andrew D. Durbin-Johnson, Blythe Mitra, Ananya Datta Rashidi, Hooman H. Limited Number of Cases May Yield Generalizable Models, a Proof of Concept in Deep Learning for Colon Histology
title	Limited Number of Cases May Yield Generalizable Models, a Proof of Concept in Deep Learning for Colon Histology
title_full	Limited Number of Cases May Yield Generalizable Models, a Proof of Concept in Deep Learning for Colon Histology
title_fullStr	Limited Number of Cases May Yield Generalizable Models, a Proof of Concept in Deep Learning for Colon Histology
title_full_unstemmed	Limited Number of Cases May Yield Generalizable Models, a Proof of Concept in Deep Learning for Colon Histology
title_short	Limited Number of Cases May Yield Generalizable Models, a Proof of Concept in Deep Learning for Colon Histology
title_sort	limited number of cases may yield generalizable models, a proof of concept in deep learning for colon histology
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7047745/ https://www.ncbi.nlm.nih.gov/pubmed/32175170 http://dx.doi.org/10.4103/jpi.jpi_49_19
work_keys_str_mv	AT hollandlorne limitednumberofcasesmayyieldgeneralizablemodelsaproofofconceptindeeplearningforcolonhistology AT weidongguang limitednumberofcasesmayyieldgeneralizablemodelsaproofofconceptindeeplearningforcolonhistology AT olsonkristina limitednumberofcasesmayyieldgeneralizablemodelsaproofofconceptindeeplearningforcolonhistology AT mitraanupam limitednumberofcasesmayyieldgeneralizablemodelsaproofofconceptindeeplearningforcolonhistology AT graffjohnpaul limitednumberofcasesmayyieldgeneralizablemodelsaproofofconceptindeeplearningforcolonhistology AT jonesandrewd limitednumberofcasesmayyieldgeneralizablemodelsaproofofconceptindeeplearningforcolonhistology AT durbinjohnsonblythe limitednumberofcasesmayyieldgeneralizablemodelsaproofofconceptindeeplearningforcolonhistology AT mitraananyadatta limitednumberofcasesmayyieldgeneralizablemodelsaproofofconceptindeeplearningforcolonhistology AT rashidihoomanh limitednumberofcasesmayyieldgeneralizablemodelsaproofofconceptindeeplearningforcolonhistology

Limited Number of Cases May Yield Generalizable Models, a Proof of Concept in Deep Learning for Colon Histology

Ejemplares similares