Cargando…

A conditional GAN-based approach for enhancing transfer learning performance in few-shot HCR tasks

Supervised learning with the restriction of a few existing training samples is called Few-Shot Learning. FSL is a subarea that puts deep learning performance in a gap, as building robust deep networks requires big training data. Using transfer learning in FSL tasks is an acceptable way to avoid the...

Descripción completa

Detalles Bibliográficos
Autores principales:	Elaraby, Nagwa, Barakat, Sherif, Rezk, Amira
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9520122/ https://www.ncbi.nlm.nih.gov/pubmed/36175593 http://dx.doi.org/10.1038/s41598-022-20654-1

_version_	1784799554544074752
author	Elaraby, Nagwa Barakat, Sherif Rezk, Amira
author_facet	Elaraby, Nagwa Barakat, Sherif Rezk, Amira
author_sort	Elaraby, Nagwa
collection	PubMed
description	Supervised learning with the restriction of a few existing training samples is called Few-Shot Learning. FSL is a subarea that puts deep learning performance in a gap, as building robust deep networks requires big training data. Using transfer learning in FSL tasks is an acceptable way to avoid the challenge of building new deep models from scratch. Transfer learning methodology considers borrowing the architecture and parameters of a previously trained model on a large-scale dataset and fine-tuning it for low-data target tasks. But practically, fine-tuning pretrained models in target FSL tasks suffers from overfitting. The few existing samples are not enough to correctly adjust the pretrained model’s parameters to provide the best fit for the target task. In this study, we consider mitigating the overfitting problem when applying transfer learning in few-shot Handwritten Character Recognition (HCR) tasks. A data augmentation approach based on Conditional Generative Adversarial Networks is introduced. CGAN is a generative model that can create artificial instances that appear more real and indistinguishable from the original samples. CGAN helps generate extra samples that hold the possible variations of human handwriting instead of applying traditional image transformations. These transformations are low-level, data-independent operations, and only produce augmented samples with limited diversity. The introduced approach was evaluated in fine-tuning the three pretrained models: AlexNet, VGG-16, and GoogleNet. The results show that the samples generated by CGAN can enhance transfer learning performance in few-shot HCR tasks. This is by achieving model fine-tuning with fewer epochs and by increasing the model’s [Formula: see text] and decreasing the Generalization Error [Formula: see text] .
format	Online Article Text
id	pubmed-9520122
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-95201222022-09-29 A conditional GAN-based approach for enhancing transfer learning performance in few-shot HCR tasks Elaraby, Nagwa Barakat, Sherif Rezk, Amira Sci Rep Article Supervised learning with the restriction of a few existing training samples is called Few-Shot Learning. FSL is a subarea that puts deep learning performance in a gap, as building robust deep networks requires big training data. Using transfer learning in FSL tasks is an acceptable way to avoid the challenge of building new deep models from scratch. Transfer learning methodology considers borrowing the architecture and parameters of a previously trained model on a large-scale dataset and fine-tuning it for low-data target tasks. But practically, fine-tuning pretrained models in target FSL tasks suffers from overfitting. The few existing samples are not enough to correctly adjust the pretrained model’s parameters to provide the best fit for the target task. In this study, we consider mitigating the overfitting problem when applying transfer learning in few-shot Handwritten Character Recognition (HCR) tasks. A data augmentation approach based on Conditional Generative Adversarial Networks is introduced. CGAN is a generative model that can create artificial instances that appear more real and indistinguishable from the original samples. CGAN helps generate extra samples that hold the possible variations of human handwriting instead of applying traditional image transformations. These transformations are low-level, data-independent operations, and only produce augmented samples with limited diversity. The introduced approach was evaluated in fine-tuning the three pretrained models: AlexNet, VGG-16, and GoogleNet. The results show that the samples generated by CGAN can enhance transfer learning performance in few-shot HCR tasks. This is by achieving model fine-tuning with fewer epochs and by increasing the model’s [Formula: see text] and decreasing the Generalization Error [Formula: see text] . Nature Publishing Group UK 2022-09-29 /pmc/articles/PMC9520122/ /pubmed/36175593 http://dx.doi.org/10.1038/s41598-022-20654-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Elaraby, Nagwa Barakat, Sherif Rezk, Amira A conditional GAN-based approach for enhancing transfer learning performance in few-shot HCR tasks
title	A conditional GAN-based approach for enhancing transfer learning performance in few-shot HCR tasks
title_full	A conditional GAN-based approach for enhancing transfer learning performance in few-shot HCR tasks
title_fullStr	A conditional GAN-based approach for enhancing transfer learning performance in few-shot HCR tasks
title_full_unstemmed	A conditional GAN-based approach for enhancing transfer learning performance in few-shot HCR tasks
title_short	A conditional GAN-based approach for enhancing transfer learning performance in few-shot HCR tasks
title_sort	conditional gan-based approach for enhancing transfer learning performance in few-shot hcr tasks
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9520122/ https://www.ncbi.nlm.nih.gov/pubmed/36175593 http://dx.doi.org/10.1038/s41598-022-20654-1
work_keys_str_mv	AT elarabynagwa aconditionalganbasedapproachforenhancingtransferlearningperformanceinfewshothcrtasks AT barakatsherif aconditionalganbasedapproachforenhancingtransferlearningperformanceinfewshothcrtasks AT rezkamira aconditionalganbasedapproachforenhancingtransferlearningperformanceinfewshothcrtasks AT elarabynagwa conditionalganbasedapproachforenhancingtransferlearningperformanceinfewshothcrtasks AT barakatsherif conditionalganbasedapproachforenhancingtransferlearningperformanceinfewshothcrtasks AT rezkamira conditionalganbasedapproachforenhancingtransferlearningperformanceinfewshothcrtasks

A conditional GAN-based approach for enhancing transfer learning performance in few-shot HCR tasks

Ejemplares similares