Cargando…

Automated human cell classification in sparse datasets using few-shot learning

Classifying and analyzing human cells is a lengthy procedure, often involving a trained professional. In an attempt to expedite this process, an active area of research involves automating cell classification through use of deep learning-based techniques. In practice, a large amount of data is requi...

Descripción completa

Detalles Bibliográficos
Autores principales: Walsh, Reece, Abdelpakey, Mohamed H., Shehata, Mohamed S., Mohamed, Mostafa M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8861170/
https://www.ncbi.nlm.nih.gov/pubmed/35190567
http://dx.doi.org/10.1038/s41598-022-06718-2
_version_ 1784654830268055552
author Walsh, Reece
Abdelpakey, Mohamed H.
Shehata, Mohamed S.
Mohamed, Mostafa M.
author_facet Walsh, Reece
Abdelpakey, Mohamed H.
Shehata, Mohamed S.
Mohamed, Mostafa M.
author_sort Walsh, Reece
collection PubMed
description Classifying and analyzing human cells is a lengthy procedure, often involving a trained professional. In an attempt to expedite this process, an active area of research involves automating cell classification through use of deep learning-based techniques. In practice, a large amount of data is required to accurately train these deep learning models. However, due to the sparse human cell datasets currently available, the performance of these models is typically low. This study investigates the feasibility of using few-shot learning-based techniques to mitigate the data requirements for accurate training. The study is comprised of three parts: First, current state-of-the-art few-shot learning techniques are evaluated on human cell classification. The selected techniques are trained on a non-medical dataset and then tested on two out-of-domain, human cell datasets. The results indicate that, overall, the test accuracy of state-of-the-art techniques decreased by at least 30% when transitioning from a non-medical dataset to a medical dataset. Reptile and EPNet were the top performing techniques tested on the BCCD dataset and HEp-2 dataset respectively. Second, this study evaluates the potential benefits, if any, to varying the backbone architecture and training schemes in current state-of-the-art few-shot learning techniques when used in human cell classification. To this end, the best technique identified in the first part of this study, EPNet, is used for experimentation. In particular, the study used 6 different network backbones, 5 data augmentation methodologies, and 2 model training schemes. Even with these additions, the overall test accuracy of EPNet decreased from 88.66% on non-medical datasets to 44.13% at best on the medical datasets. Third, this study presents future directions for using few-shot learning in human cell classification. In general, few-shot learning in its current state performs poorly on human cell classification. The study proves that attempts to modify existing network architectures are not effective and concludes that future research effort should be focused on improving robustness towards out-of-domain testing using optimization-based or self-supervised few-shot learning techniques.
format Online
Article
Text
id pubmed-8861170
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-88611702022-02-23 Automated human cell classification in sparse datasets using few-shot learning Walsh, Reece Abdelpakey, Mohamed H. Shehata, Mohamed S. Mohamed, Mostafa M. Sci Rep Article Classifying and analyzing human cells is a lengthy procedure, often involving a trained professional. In an attempt to expedite this process, an active area of research involves automating cell classification through use of deep learning-based techniques. In practice, a large amount of data is required to accurately train these deep learning models. However, due to the sparse human cell datasets currently available, the performance of these models is typically low. This study investigates the feasibility of using few-shot learning-based techniques to mitigate the data requirements for accurate training. The study is comprised of three parts: First, current state-of-the-art few-shot learning techniques are evaluated on human cell classification. The selected techniques are trained on a non-medical dataset and then tested on two out-of-domain, human cell datasets. The results indicate that, overall, the test accuracy of state-of-the-art techniques decreased by at least 30% when transitioning from a non-medical dataset to a medical dataset. Reptile and EPNet were the top performing techniques tested on the BCCD dataset and HEp-2 dataset respectively. Second, this study evaluates the potential benefits, if any, to varying the backbone architecture and training schemes in current state-of-the-art few-shot learning techniques when used in human cell classification. To this end, the best technique identified in the first part of this study, EPNet, is used for experimentation. In particular, the study used 6 different network backbones, 5 data augmentation methodologies, and 2 model training schemes. Even with these additions, the overall test accuracy of EPNet decreased from 88.66% on non-medical datasets to 44.13% at best on the medical datasets. Third, this study presents future directions for using few-shot learning in human cell classification. In general, few-shot learning in its current state performs poorly on human cell classification. The study proves that attempts to modify existing network architectures are not effective and concludes that future research effort should be focused on improving robustness towards out-of-domain testing using optimization-based or self-supervised few-shot learning techniques. Nature Publishing Group UK 2022-02-21 /pmc/articles/PMC8861170/ /pubmed/35190567 http://dx.doi.org/10.1038/s41598-022-06718-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Walsh, Reece
Abdelpakey, Mohamed H.
Shehata, Mohamed S.
Mohamed, Mostafa M.
Automated human cell classification in sparse datasets using few-shot learning
title Automated human cell classification in sparse datasets using few-shot learning
title_full Automated human cell classification in sparse datasets using few-shot learning
title_fullStr Automated human cell classification in sparse datasets using few-shot learning
title_full_unstemmed Automated human cell classification in sparse datasets using few-shot learning
title_short Automated human cell classification in sparse datasets using few-shot learning
title_sort automated human cell classification in sparse datasets using few-shot learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8861170/
https://www.ncbi.nlm.nih.gov/pubmed/35190567
http://dx.doi.org/10.1038/s41598-022-06718-2
work_keys_str_mv AT walshreece automatedhumancellclassificationinsparsedatasetsusingfewshotlearning
AT abdelpakeymohamedh automatedhumancellclassificationinsparsedatasetsusingfewshotlearning
AT shehatamohameds automatedhumancellclassificationinsparsedatasetsusingfewshotlearning
AT mohamedmostafam automatedhumancellclassificationinsparsedatasetsusingfewshotlearning