Cargando…

Perception without preconception: comparison between the human and machine learner in recognition of tissues from histological sections

Deep neural networks (DNNs) have shown success in image classification, with high accuracy in recognition of everyday objects. Performance of DNNs has traditionally been measured assuming human accuracy is perfect. In specific problem domains, however, human accuracy is less than perfect and a compa...

Descripción completa

Detalles Bibliográficos
Autores principales:	Barui, Sanghita, Sanyal, Parikshit, Rajmohan, K. S., Malik, Ajay, Dudani, Sharmila
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9525725/ https://www.ncbi.nlm.nih.gov/pubmed/36180472 http://dx.doi.org/10.1038/s41598-022-20012-1

_version_	1784800742074220544
author	Barui, Sanghita Sanyal, Parikshit Rajmohan, K. S. Malik, Ajay Dudani, Sharmila
author_facet	Barui, Sanghita Sanyal, Parikshit Rajmohan, K. S. Malik, Ajay Dudani, Sharmila
author_sort	Barui, Sanghita
collection	PubMed
description	Deep neural networks (DNNs) have shown success in image classification, with high accuracy in recognition of everyday objects. Performance of DNNs has traditionally been measured assuming human accuracy is perfect. In specific problem domains, however, human accuracy is less than perfect and a comparison between humans and machine learning (ML) models can be performed. In recognising everyday objects, humans have the advantage of a lifetime of experience, whereas DNN models are trained only with a limited image dataset. We have tried to compare performance of human learners and two DNN models on an image dataset which is novel to both, i.e. histological images. We thus aim to eliminate the advantage of prior experience that humans have over DNN models in image classification. Ten classes of tissues were randomly selected from the undergraduate first year histology curriculum of a Medical School in North India. Two machine learning (ML) models were developed based on the VGG16 (VML) and Inception V2 (IML) DNNs, using transfer learning, to produce a 10-class classifier. One thousand (1000) images belonging to the ten classes (i.e. 100 images from each class) were split into training (700) and validation (300) sets. After training, the VML and IML model achieved 85.67 and 89% accuracy on the validation set, respectively. The training set was also circulated to medical students (MS) of the college for a week. An online quiz, consisting of a random selection of 100 images from the validation set, was conducted on students (after obtaining informed consent) who volunteered for the study. 66 students participated in the quiz, providing 6557 responses. In addition, we prepared a set of 10 images which belonged to different classes of tissue, not present in training set (i.e. out of training scope or OTS images). A second quiz was conducted on medical students with OTS images, and the ML models were also run on these OTS images. The overall accuracy of MS in the first quiz was 55.14%. The two ML models were also run on the first quiz questionnaire, producing accuracy between 91 and 93%. The ML models scored more than 80% of medical students. Analysis of confusion matrices of both ML models and all medical students showed dissimilar error profiles. However, when comparing the subset of students who achieved similar accuracy as the ML models, the error profile was also similar. Recognition of ‘stomach’ proved difficult for both humans and ML models. In 04 images in the first quiz set, both VML model and medical students produced highly equivocal responses. Within these images, a pattern of bias was uncovered–the tendency of medical students to misclassify ‘liver’ tissue. The ‘stomach’ class proved most difficult for both MS and VML, producing 34.84% of all errors of MS, and 41.17% of all errors of VML model; however, the IML model committed most errors in recognising the ‘skin’ class (27.5% of all errors). Analysis of the convolution layers of the DNN outlined features in the original image which might have led to misclassification by the VML model. In OTS images, however, the medical students produced better overall score than both ML models, i.e. they successfully recognised patterns of similarity between tissues and could generalise their training to a novel dataset. Our findings suggest that within the scope of training, ML models perform better than 80% medical students with a distinct error profile. However, students who have reached accuracy close to the ML models, tend to replicate the error profile as that of the ML models. This suggests a degree of similarity between how machines and humans extract features from an image. If asked to recognise images outside the scope of training, humans perform better at recognising patterns and likeness between tissues. This suggests that ‘training’ is not the same as ‘learning’, and humans can extend their pattern-based learning to different domains outside of the training set.
format	Online Article Text
id	pubmed-9525725
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-95257252022-10-02 Perception without preconception: comparison between the human and machine learner in recognition of tissues from histological sections Barui, Sanghita Sanyal, Parikshit Rajmohan, K. S. Malik, Ajay Dudani, Sharmila Sci Rep Article Deep neural networks (DNNs) have shown success in image classification, with high accuracy in recognition of everyday objects. Performance of DNNs has traditionally been measured assuming human accuracy is perfect. In specific problem domains, however, human accuracy is less than perfect and a comparison between humans and machine learning (ML) models can be performed. In recognising everyday objects, humans have the advantage of a lifetime of experience, whereas DNN models are trained only with a limited image dataset. We have tried to compare performance of human learners and two DNN models on an image dataset which is novel to both, i.e. histological images. We thus aim to eliminate the advantage of prior experience that humans have over DNN models in image classification. Ten classes of tissues were randomly selected from the undergraduate first year histology curriculum of a Medical School in North India. Two machine learning (ML) models were developed based on the VGG16 (VML) and Inception V2 (IML) DNNs, using transfer learning, to produce a 10-class classifier. One thousand (1000) images belonging to the ten classes (i.e. 100 images from each class) were split into training (700) and validation (300) sets. After training, the VML and IML model achieved 85.67 and 89% accuracy on the validation set, respectively. The training set was also circulated to medical students (MS) of the college for a week. An online quiz, consisting of a random selection of 100 images from the validation set, was conducted on students (after obtaining informed consent) who volunteered for the study. 66 students participated in the quiz, providing 6557 responses. In addition, we prepared a set of 10 images which belonged to different classes of tissue, not present in training set (i.e. out of training scope or OTS images). A second quiz was conducted on medical students with OTS images, and the ML models were also run on these OTS images. The overall accuracy of MS in the first quiz was 55.14%. The two ML models were also run on the first quiz questionnaire, producing accuracy between 91 and 93%. The ML models scored more than 80% of medical students. Analysis of confusion matrices of both ML models and all medical students showed dissimilar error profiles. However, when comparing the subset of students who achieved similar accuracy as the ML models, the error profile was also similar. Recognition of ‘stomach’ proved difficult for both humans and ML models. In 04 images in the first quiz set, both VML model and medical students produced highly equivocal responses. Within these images, a pattern of bias was uncovered–the tendency of medical students to misclassify ‘liver’ tissue. The ‘stomach’ class proved most difficult for both MS and VML, producing 34.84% of all errors of MS, and 41.17% of all errors of VML model; however, the IML model committed most errors in recognising the ‘skin’ class (27.5% of all errors). Analysis of the convolution layers of the DNN outlined features in the original image which might have led to misclassification by the VML model. In OTS images, however, the medical students produced better overall score than both ML models, i.e. they successfully recognised patterns of similarity between tissues and could generalise their training to a novel dataset. Our findings suggest that within the scope of training, ML models perform better than 80% medical students with a distinct error profile. However, students who have reached accuracy close to the ML models, tend to replicate the error profile as that of the ML models. This suggests a degree of similarity between how machines and humans extract features from an image. If asked to recognise images outside the scope of training, humans perform better at recognising patterns and likeness between tissues. This suggests that ‘training’ is not the same as ‘learning’, and humans can extend their pattern-based learning to different domains outside of the training set. Nature Publishing Group UK 2022-09-30 /pmc/articles/PMC9525725/ /pubmed/36180472 http://dx.doi.org/10.1038/s41598-022-20012-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Barui, Sanghita Sanyal, Parikshit Rajmohan, K. S. Malik, Ajay Dudani, Sharmila Perception without preconception: comparison between the human and machine learner in recognition of tissues from histological sections
title	Perception without preconception: comparison between the human and machine learner in recognition of tissues from histological sections
title_full	Perception without preconception: comparison between the human and machine learner in recognition of tissues from histological sections
title_fullStr	Perception without preconception: comparison between the human and machine learner in recognition of tissues from histological sections
title_full_unstemmed	Perception without preconception: comparison between the human and machine learner in recognition of tissues from histological sections
title_short	Perception without preconception: comparison between the human and machine learner in recognition of tissues from histological sections
title_sort	perception without preconception: comparison between the human and machine learner in recognition of tissues from histological sections
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9525725/ https://www.ncbi.nlm.nih.gov/pubmed/36180472 http://dx.doi.org/10.1038/s41598-022-20012-1
work_keys_str_mv	AT baruisanghita perceptionwithoutpreconceptioncomparisonbetweenthehumanandmachinelearnerinrecognitionoftissuesfromhistologicalsections AT sanyalparikshit perceptionwithoutpreconceptioncomparisonbetweenthehumanandmachinelearnerinrecognitionoftissuesfromhistologicalsections AT rajmohanks perceptionwithoutpreconceptioncomparisonbetweenthehumanandmachinelearnerinrecognitionoftissuesfromhistologicalsections AT malikajay perceptionwithoutpreconceptioncomparisonbetweenthehumanandmachinelearnerinrecognitionoftissuesfromhistologicalsections AT dudanisharmila perceptionwithoutpreconceptioncomparisonbetweenthehumanandmachinelearnerinrecognitionoftissuesfromhistologicalsections

Perception without preconception: comparison between the human and machine learner in recognition of tissues from histological sections

Ejemplares similares