Cargando…

On the improvement of reinforcement active learning with the involvement of cross entropy to address one-shot learning problem

As a promising research direction in recent decades, active learning allows an oracle to assign labels to typical examples for performance improvement in learning systems. Existing works mainly focus on designing criteria for screening examples of high value to be labeled in a handcrafted manner. In...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Honglan, Huang, Jincai, Feng, Yanghe, Zhang, Jiarui, Liu, Zhong, Wang, Qi, Chen, Li
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6583946/
https://www.ncbi.nlm.nih.gov/pubmed/31216289
http://dx.doi.org/10.1371/journal.pone.0217408
_version_ 1783428453745295360
author Huang, Honglan
Huang, Jincai
Feng, Yanghe
Zhang, Jiarui
Liu, Zhong
Wang, Qi
Chen, Li
author_facet Huang, Honglan
Huang, Jincai
Feng, Yanghe
Zhang, Jiarui
Liu, Zhong
Wang, Qi
Chen, Li
author_sort Huang, Honglan
collection PubMed
description As a promising research direction in recent decades, active learning allows an oracle to assign labels to typical examples for performance improvement in learning systems. Existing works mainly focus on designing criteria for screening examples of high value to be labeled in a handcrafted manner. Instead of manually developing strategies of querying the user to access labels for the desired examples, we utilized the reinforcement learning algorithm parameterized with the neural network to automatically explore query strategies in active learning when addressing stream-based one-shot classification problems. With the involvement of cross-entropy in the loss function of Q-learning, an efficient policy to decide when and where to predict or query an instance is learned through the developed framework. Compared with a former influential work, the advantages of our method are demonstrated experimentally with two image classification tasks, and it exhibited better performance, quick convergence, relatively good stability and fewer requests for labels.
format Online
Article
Text
id pubmed-6583946
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-65839462019-06-28 On the improvement of reinforcement active learning with the involvement of cross entropy to address one-shot learning problem Huang, Honglan Huang, Jincai Feng, Yanghe Zhang, Jiarui Liu, Zhong Wang, Qi Chen, Li PLoS One Research Article As a promising research direction in recent decades, active learning allows an oracle to assign labels to typical examples for performance improvement in learning systems. Existing works mainly focus on designing criteria for screening examples of high value to be labeled in a handcrafted manner. Instead of manually developing strategies of querying the user to access labels for the desired examples, we utilized the reinforcement learning algorithm parameterized with the neural network to automatically explore query strategies in active learning when addressing stream-based one-shot classification problems. With the involvement of cross-entropy in the loss function of Q-learning, an efficient policy to decide when and where to predict or query an instance is learned through the developed framework. Compared with a former influential work, the advantages of our method are demonstrated experimentally with two image classification tasks, and it exhibited better performance, quick convergence, relatively good stability and fewer requests for labels. Public Library of Science 2019-06-19 /pmc/articles/PMC6583946/ /pubmed/31216289 http://dx.doi.org/10.1371/journal.pone.0217408 Text en © 2019 Huang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Huang, Honglan
Huang, Jincai
Feng, Yanghe
Zhang, Jiarui
Liu, Zhong
Wang, Qi
Chen, Li
On the improvement of reinforcement active learning with the involvement of cross entropy to address one-shot learning problem
title On the improvement of reinforcement active learning with the involvement of cross entropy to address one-shot learning problem
title_full On the improvement of reinforcement active learning with the involvement of cross entropy to address one-shot learning problem
title_fullStr On the improvement of reinforcement active learning with the involvement of cross entropy to address one-shot learning problem
title_full_unstemmed On the improvement of reinforcement active learning with the involvement of cross entropy to address one-shot learning problem
title_short On the improvement of reinforcement active learning with the involvement of cross entropy to address one-shot learning problem
title_sort on the improvement of reinforcement active learning with the involvement of cross entropy to address one-shot learning problem
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6583946/
https://www.ncbi.nlm.nih.gov/pubmed/31216289
http://dx.doi.org/10.1371/journal.pone.0217408
work_keys_str_mv AT huanghonglan ontheimprovementofreinforcementactivelearningwiththeinvolvementofcrossentropytoaddressoneshotlearningproblem
AT huangjincai ontheimprovementofreinforcementactivelearningwiththeinvolvementofcrossentropytoaddressoneshotlearningproblem
AT fengyanghe ontheimprovementofreinforcementactivelearningwiththeinvolvementofcrossentropytoaddressoneshotlearningproblem
AT zhangjiarui ontheimprovementofreinforcementactivelearningwiththeinvolvementofcrossentropytoaddressoneshotlearningproblem
AT liuzhong ontheimprovementofreinforcementactivelearningwiththeinvolvementofcrossentropytoaddressoneshotlearningproblem
AT wangqi ontheimprovementofreinforcementactivelearningwiththeinvolvementofcrossentropytoaddressoneshotlearningproblem
AT chenli ontheimprovementofreinforcementactivelearningwiththeinvolvementofcrossentropytoaddressoneshotlearningproblem