Cargando…
Few-Shot Text Classification with Global–Local Feature Information
Meta-learning frameworks have been proposed to generalize machine learning models for domain adaptation without sufficient label data in computer vision. However, text classification with meta-learning is less investigated. In this paper, we propose SumFS to find global top-ranked sentences by extra...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9229404/ https://www.ncbi.nlm.nih.gov/pubmed/35746202 http://dx.doi.org/10.3390/s22124420 |
_version_ | 1784734739051053056 |
---|---|
author | Wang, Depei Wang, Zhuowei Cheng, Lianglun Zhang, Weiwen |
author_facet | Wang, Depei Wang, Zhuowei Cheng, Lianglun Zhang, Weiwen |
author_sort | Wang, Depei |
collection | PubMed |
description | Meta-learning frameworks have been proposed to generalize machine learning models for domain adaptation without sufficient label data in computer vision. However, text classification with meta-learning is less investigated. In this paper, we propose SumFS to find global top-ranked sentences by extractive summary and improve the local vocabulary category features. The SumFS consists of three modules: (1) an unsupervised text summarizer that removes redundant information; (2) a weighting generator that associates feature words with attention scores to weight the lexical representations of words; (3) a regular meta-learning framework that trains with limited labeled data using a ridge regression classifier. In addition, a marine news dataset was established with limited label data. The performance of the algorithm was tested on THUCnews, Fudan, and marine news datasets. Experiments show that the SumFS can maintain or even improve accuracy while reducing input features. Moreover, the training time of each epoch is reduced by more than 50%. |
format | Online Article Text |
id | pubmed-9229404 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-92294042022-06-25 Few-Shot Text Classification with Global–Local Feature Information Wang, Depei Wang, Zhuowei Cheng, Lianglun Zhang, Weiwen Sensors (Basel) Article Meta-learning frameworks have been proposed to generalize machine learning models for domain adaptation without sufficient label data in computer vision. However, text classification with meta-learning is less investigated. In this paper, we propose SumFS to find global top-ranked sentences by extractive summary and improve the local vocabulary category features. The SumFS consists of three modules: (1) an unsupervised text summarizer that removes redundant information; (2) a weighting generator that associates feature words with attention scores to weight the lexical representations of words; (3) a regular meta-learning framework that trains with limited labeled data using a ridge regression classifier. In addition, a marine news dataset was established with limited label data. The performance of the algorithm was tested on THUCnews, Fudan, and marine news datasets. Experiments show that the SumFS can maintain or even improve accuracy while reducing input features. Moreover, the training time of each epoch is reduced by more than 50%. MDPI 2022-06-11 /pmc/articles/PMC9229404/ /pubmed/35746202 http://dx.doi.org/10.3390/s22124420 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Depei Wang, Zhuowei Cheng, Lianglun Zhang, Weiwen Few-Shot Text Classification with Global–Local Feature Information |
title | Few-Shot Text Classification with Global–Local Feature Information |
title_full | Few-Shot Text Classification with Global–Local Feature Information |
title_fullStr | Few-Shot Text Classification with Global–Local Feature Information |
title_full_unstemmed | Few-Shot Text Classification with Global–Local Feature Information |
title_short | Few-Shot Text Classification with Global–Local Feature Information |
title_sort | few-shot text classification with global–local feature information |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9229404/ https://www.ncbi.nlm.nih.gov/pubmed/35746202 http://dx.doi.org/10.3390/s22124420 |
work_keys_str_mv | AT wangdepei fewshottextclassificationwithgloballocalfeatureinformation AT wangzhuowei fewshottextclassificationwithgloballocalfeatureinformation AT chenglianglun fewshottextclassificationwithgloballocalfeatureinformation AT zhangweiwen fewshottextclassificationwithgloballocalfeatureinformation |