Cargando…

Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification

At present, short text classification is a hot topic in the area of natural language processing. Due to the sparseness and irregularity of short text, the task of short text classification still faces great challenges. In this paper, we propose a new classification model from the aspects of short te...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Jing, Zhang, Dezheng, Wulamu, Aziguli
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9141385/ https://www.ncbi.nlm.nih.gov/pubmed/35626475 http://dx.doi.org/10.3390/e24050590

_version_	1784715333153587200
author	Li, Jing Zhang, Dezheng Wulamu, Aziguli
author_facet	Li, Jing Zhang, Dezheng Wulamu, Aziguli
author_sort	Li, Jing
collection	PubMed
description	At present, short text classification is a hot topic in the area of natural language processing. Due to the sparseness and irregularity of short text, the task of short text classification still faces great challenges. In this paper, we propose a new classification model from the aspects of short text representation, global feature extraction and local feature extraction. We use convolutional networks to extract shallow features from short text vectorization, and introduce a multi-level semantic extraction framework. It uses BiLSTM as the encoding layer while the attention mechanism and normalization are used as the interaction layer. Finally, we concatenate the convolution feature vector and semantic results of the semantic framework. After several rounds of feature integration, the framework improves the quality of the feature representation. Combined with the capsule network, we obtain high-level local information by dynamic routing and then squash them. In addition, we explore the optimal depth of semantic feature extraction for short text based on a multi-level semantic framework. We utilized four benchmark datasets to demonstrate that our model provides comparable results. The experimental results show that the accuracy of SUBJ, TREC, MR and ProcCons are 93.8%, 91.94%, 82.81% and 98.43%, respectively, which verifies that our model has greatly improves classification accuracy and model robustness.
format	Online Article Text
id	pubmed-9141385
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-91413852022-05-28 Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification Li, Jing Zhang, Dezheng Wulamu, Aziguli Entropy (Basel) Article At present, short text classification is a hot topic in the area of natural language processing. Due to the sparseness and irregularity of short text, the task of short text classification still faces great challenges. In this paper, we propose a new classification model from the aspects of short text representation, global feature extraction and local feature extraction. We use convolutional networks to extract shallow features from short text vectorization, and introduce a multi-level semantic extraction framework. It uses BiLSTM as the encoding layer while the attention mechanism and normalization are used as the interaction layer. Finally, we concatenate the convolution feature vector and semantic results of the semantic framework. After several rounds of feature integration, the framework improves the quality of the feature representation. Combined with the capsule network, we obtain high-level local information by dynamic routing and then squash them. In addition, we explore the optimal depth of semantic feature extraction for short text based on a multi-level semantic framework. We utilized four benchmark datasets to demonstrate that our model provides comparable results. The experimental results show that the accuracy of SUBJ, TREC, MR and ProcCons are 93.8%, 91.94%, 82.81% and 98.43%, respectively, which verifies that our model has greatly improves classification accuracy and model robustness. MDPI 2022-04-23 /pmc/articles/PMC9141385/ /pubmed/35626475 http://dx.doi.org/10.3390/e24050590 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Li, Jing Zhang, Dezheng Wulamu, Aziguli Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification
title	Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification
title_full	Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification
title_fullStr	Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification
title_full_unstemmed	Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification
title_short	Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification
title_sort	investigating multi-level semantic extraction with squash capsules for short text classification
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9141385/ https://www.ncbi.nlm.nih.gov/pubmed/35626475 http://dx.doi.org/10.3390/e24050590
work_keys_str_mv	AT lijing investigatingmultilevelsemanticextractionwithsquashcapsulesforshorttextclassification AT zhangdezheng investigatingmultilevelsemanticextractionwithsquashcapsulesforshorttextclassification AT wulamuaziguli investigatingmultilevelsemanticextractionwithsquashcapsulesforshorttextclassification

Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification

Ejemplares similares