Cargando…
Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification
At present, short text classification is a hot topic in the area of natural language processing. Due to the sparseness and irregularity of short text, the task of short text classification still faces great challenges. In this paper, we propose a new classification model from the aspects of short te...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9141385/ https://www.ncbi.nlm.nih.gov/pubmed/35626475 http://dx.doi.org/10.3390/e24050590 |
_version_ | 1784715333153587200 |
---|---|
author | Li, Jing Zhang, Dezheng Wulamu, Aziguli |
author_facet | Li, Jing Zhang, Dezheng Wulamu, Aziguli |
author_sort | Li, Jing |
collection | PubMed |
description | At present, short text classification is a hot topic in the area of natural language processing. Due to the sparseness and irregularity of short text, the task of short text classification still faces great challenges. In this paper, we propose a new classification model from the aspects of short text representation, global feature extraction and local feature extraction. We use convolutional networks to extract shallow features from short text vectorization, and introduce a multi-level semantic extraction framework. It uses BiLSTM as the encoding layer while the attention mechanism and normalization are used as the interaction layer. Finally, we concatenate the convolution feature vector and semantic results of the semantic framework. After several rounds of feature integration, the framework improves the quality of the feature representation. Combined with the capsule network, we obtain high-level local information by dynamic routing and then squash them. In addition, we explore the optimal depth of semantic feature extraction for short text based on a multi-level semantic framework. We utilized four benchmark datasets to demonstrate that our model provides comparable results. The experimental results show that the accuracy of SUBJ, TREC, MR and ProcCons are 93.8%, 91.94%, 82.81% and 98.43%, respectively, which verifies that our model has greatly improves classification accuracy and model robustness. |
format | Online Article Text |
id | pubmed-9141385 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-91413852022-05-28 Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification Li, Jing Zhang, Dezheng Wulamu, Aziguli Entropy (Basel) Article At present, short text classification is a hot topic in the area of natural language processing. Due to the sparseness and irregularity of short text, the task of short text classification still faces great challenges. In this paper, we propose a new classification model from the aspects of short text representation, global feature extraction and local feature extraction. We use convolutional networks to extract shallow features from short text vectorization, and introduce a multi-level semantic extraction framework. It uses BiLSTM as the encoding layer while the attention mechanism and normalization are used as the interaction layer. Finally, we concatenate the convolution feature vector and semantic results of the semantic framework. After several rounds of feature integration, the framework improves the quality of the feature representation. Combined with the capsule network, we obtain high-level local information by dynamic routing and then squash them. In addition, we explore the optimal depth of semantic feature extraction for short text based on a multi-level semantic framework. We utilized four benchmark datasets to demonstrate that our model provides comparable results. The experimental results show that the accuracy of SUBJ, TREC, MR and ProcCons are 93.8%, 91.94%, 82.81% and 98.43%, respectively, which verifies that our model has greatly improves classification accuracy and model robustness. MDPI 2022-04-23 /pmc/articles/PMC9141385/ /pubmed/35626475 http://dx.doi.org/10.3390/e24050590 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Li, Jing Zhang, Dezheng Wulamu, Aziguli Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification |
title | Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification |
title_full | Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification |
title_fullStr | Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification |
title_full_unstemmed | Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification |
title_short | Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification |
title_sort | investigating multi-level semantic extraction with squash capsules for short text classification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9141385/ https://www.ncbi.nlm.nih.gov/pubmed/35626475 http://dx.doi.org/10.3390/e24050590 |
work_keys_str_mv | AT lijing investigatingmultilevelsemanticextractionwithsquashcapsulesforshorttextclassification AT zhangdezheng investigatingmultilevelsemanticextractionwithsquashcapsulesforshorttextclassification AT wulamuaziguli investigatingmultilevelsemanticextractionwithsquashcapsulesforshorttextclassification |