Cargando…
Automated human cell classification in sparse datasets using few-shot learning
Classifying and analyzing human cells is a lengthy procedure, often involving a trained professional. In an attempt to expedite this process, an active area of research involves automating cell classification through use of deep learning-based techniques. In practice, a large amount of data is requi...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8861170/ https://www.ncbi.nlm.nih.gov/pubmed/35190567 http://dx.doi.org/10.1038/s41598-022-06718-2 |
_version_ | 1784654830268055552 |
---|---|
author | Walsh, Reece Abdelpakey, Mohamed H. Shehata, Mohamed S. Mohamed, Mostafa M. |
author_facet | Walsh, Reece Abdelpakey, Mohamed H. Shehata, Mohamed S. Mohamed, Mostafa M. |
author_sort | Walsh, Reece |
collection | PubMed |
description | Classifying and analyzing human cells is a lengthy procedure, often involving a trained professional. In an attempt to expedite this process, an active area of research involves automating cell classification through use of deep learning-based techniques. In practice, a large amount of data is required to accurately train these deep learning models. However, due to the sparse human cell datasets currently available, the performance of these models is typically low. This study investigates the feasibility of using few-shot learning-based techniques to mitigate the data requirements for accurate training. The study is comprised of three parts: First, current state-of-the-art few-shot learning techniques are evaluated on human cell classification. The selected techniques are trained on a non-medical dataset and then tested on two out-of-domain, human cell datasets. The results indicate that, overall, the test accuracy of state-of-the-art techniques decreased by at least 30% when transitioning from a non-medical dataset to a medical dataset. Reptile and EPNet were the top performing techniques tested on the BCCD dataset and HEp-2 dataset respectively. Second, this study evaluates the potential benefits, if any, to varying the backbone architecture and training schemes in current state-of-the-art few-shot learning techniques when used in human cell classification. To this end, the best technique identified in the first part of this study, EPNet, is used for experimentation. In particular, the study used 6 different network backbones, 5 data augmentation methodologies, and 2 model training schemes. Even with these additions, the overall test accuracy of EPNet decreased from 88.66% on non-medical datasets to 44.13% at best on the medical datasets. Third, this study presents future directions for using few-shot learning in human cell classification. In general, few-shot learning in its current state performs poorly on human cell classification. The study proves that attempts to modify existing network architectures are not effective and concludes that future research effort should be focused on improving robustness towards out-of-domain testing using optimization-based or self-supervised few-shot learning techniques. |
format | Online Article Text |
id | pubmed-8861170 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-88611702022-02-23 Automated human cell classification in sparse datasets using few-shot learning Walsh, Reece Abdelpakey, Mohamed H. Shehata, Mohamed S. Mohamed, Mostafa M. Sci Rep Article Classifying and analyzing human cells is a lengthy procedure, often involving a trained professional. In an attempt to expedite this process, an active area of research involves automating cell classification through use of deep learning-based techniques. In practice, a large amount of data is required to accurately train these deep learning models. However, due to the sparse human cell datasets currently available, the performance of these models is typically low. This study investigates the feasibility of using few-shot learning-based techniques to mitigate the data requirements for accurate training. The study is comprised of three parts: First, current state-of-the-art few-shot learning techniques are evaluated on human cell classification. The selected techniques are trained on a non-medical dataset and then tested on two out-of-domain, human cell datasets. The results indicate that, overall, the test accuracy of state-of-the-art techniques decreased by at least 30% when transitioning from a non-medical dataset to a medical dataset. Reptile and EPNet were the top performing techniques tested on the BCCD dataset and HEp-2 dataset respectively. Second, this study evaluates the potential benefits, if any, to varying the backbone architecture and training schemes in current state-of-the-art few-shot learning techniques when used in human cell classification. To this end, the best technique identified in the first part of this study, EPNet, is used for experimentation. In particular, the study used 6 different network backbones, 5 data augmentation methodologies, and 2 model training schemes. Even with these additions, the overall test accuracy of EPNet decreased from 88.66% on non-medical datasets to 44.13% at best on the medical datasets. Third, this study presents future directions for using few-shot learning in human cell classification. In general, few-shot learning in its current state performs poorly on human cell classification. The study proves that attempts to modify existing network architectures are not effective and concludes that future research effort should be focused on improving robustness towards out-of-domain testing using optimization-based or self-supervised few-shot learning techniques. Nature Publishing Group UK 2022-02-21 /pmc/articles/PMC8861170/ /pubmed/35190567 http://dx.doi.org/10.1038/s41598-022-06718-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Walsh, Reece Abdelpakey, Mohamed H. Shehata, Mohamed S. Mohamed, Mostafa M. Automated human cell classification in sparse datasets using few-shot learning |
title | Automated human cell classification in sparse datasets using few-shot learning |
title_full | Automated human cell classification in sparse datasets using few-shot learning |
title_fullStr | Automated human cell classification in sparse datasets using few-shot learning |
title_full_unstemmed | Automated human cell classification in sparse datasets using few-shot learning |
title_short | Automated human cell classification in sparse datasets using few-shot learning |
title_sort | automated human cell classification in sparse datasets using few-shot learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8861170/ https://www.ncbi.nlm.nih.gov/pubmed/35190567 http://dx.doi.org/10.1038/s41598-022-06718-2 |
work_keys_str_mv | AT walshreece automatedhumancellclassificationinsparsedatasetsusingfewshotlearning AT abdelpakeymohamedh automatedhumancellclassificationinsparsedatasetsusingfewshotlearning AT shehatamohameds automatedhumancellclassificationinsparsedatasetsusingfewshotlearning AT mohamedmostafam automatedhumancellclassificationinsparsedatasetsusingfewshotlearning |