Cargando…
Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN
Nowadays, tourists increasingly prefer to check the reviews of attractions before traveling to decide whether to visit them or not. To respond to the change in the way tourists choose attractions, it is important to classify the reviews of attractions with high precision. In addition, more and more...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9602456/ https://www.ncbi.nlm.nih.gov/pubmed/36294096 http://dx.doi.org/10.3390/ijerph192013520 |
_version_ | 1784817322023714816 |
---|---|
author | Yang, Senqi Duan, Xuliang Xiao, Zeyan Li, Zhiyao Liu, Yuhai Jie, Zhihao Tang, Dezhao Du, Hui |
author_facet | Yang, Senqi Duan, Xuliang Xiao, Zeyan Li, Zhiyao Liu, Yuhai Jie, Zhihao Tang, Dezhao Du, Hui |
author_sort | Yang, Senqi |
collection | PubMed |
description | Nowadays, tourists increasingly prefer to check the reviews of attractions before traveling to decide whether to visit them or not. To respond to the change in the way tourists choose attractions, it is important to classify the reviews of attractions with high precision. In addition, more and more tourists like to use emojis to express their satisfaction or dissatisfaction with the attractions. In this paper, we built a dataset for Chinese attraction evaluation incorporating emojis (CAEIE) and proposed an explicitly n-gram masking method to enhance the integration of coarse-grained information into a pre-training (ERNIE-Gram) and Text Graph Convolutional Network (textGCN) (E2G) model to classify the dataset with a high accuracy. The E2G preprocesses the text and feeds it to ERNIE-Gram and TextGCN. ERNIE-Gram was trained using its unique mask mechanism to obtain the final probabilities. TextGCN used the dataset to construct heterogeneous graphs with comment text and words, which were trained to obtain a representation of the document output category probabilities. The two probabilities were calculated to obtain the final results. To demonstrate the validity of the E2G model, this paper was compared with advanced models. After experiments, it was shown that E2G had a good classification effect on the CAEIE dataset, and the accuracy of classification was up to 97.37%. Furthermore, the accuracy of E2G was 1.37% and 1.35% ahead of ERNIE-Gram and TextGCN, respectively. In addition, two sets of comparison experiments were conducted to verify the performance of TextGCN and TextGAT on the CAEIE dataset. The final results showed that ERNIE and ERNIE-Gram combined TextGCN and TextGAT, respectively, and TextGCN performed 1.6% and 2.15% ahead. This paper compared the effects of eight activation functions on the second layer of the TextGCN and the activation-function-rectified linear unit 6 (RELU6) with the best results based on experiments. |
format | Online Article Text |
id | pubmed-9602456 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96024562022-10-27 Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN Yang, Senqi Duan, Xuliang Xiao, Zeyan Li, Zhiyao Liu, Yuhai Jie, Zhihao Tang, Dezhao Du, Hui Int J Environ Res Public Health Article Nowadays, tourists increasingly prefer to check the reviews of attractions before traveling to decide whether to visit them or not. To respond to the change in the way tourists choose attractions, it is important to classify the reviews of attractions with high precision. In addition, more and more tourists like to use emojis to express their satisfaction or dissatisfaction with the attractions. In this paper, we built a dataset for Chinese attraction evaluation incorporating emojis (CAEIE) and proposed an explicitly n-gram masking method to enhance the integration of coarse-grained information into a pre-training (ERNIE-Gram) and Text Graph Convolutional Network (textGCN) (E2G) model to classify the dataset with a high accuracy. The E2G preprocesses the text and feeds it to ERNIE-Gram and TextGCN. ERNIE-Gram was trained using its unique mask mechanism to obtain the final probabilities. TextGCN used the dataset to construct heterogeneous graphs with comment text and words, which were trained to obtain a representation of the document output category probabilities. The two probabilities were calculated to obtain the final results. To demonstrate the validity of the E2G model, this paper was compared with advanced models. After experiments, it was shown that E2G had a good classification effect on the CAEIE dataset, and the accuracy of classification was up to 97.37%. Furthermore, the accuracy of E2G was 1.37% and 1.35% ahead of ERNIE-Gram and TextGCN, respectively. In addition, two sets of comparison experiments were conducted to verify the performance of TextGCN and TextGAT on the CAEIE dataset. The final results showed that ERNIE and ERNIE-Gram combined TextGCN and TextGAT, respectively, and TextGCN performed 1.6% and 2.15% ahead. This paper compared the effects of eight activation functions on the second layer of the TextGCN and the activation-function-rectified linear unit 6 (RELU6) with the best results based on experiments. MDPI 2022-10-19 /pmc/articles/PMC9602456/ /pubmed/36294096 http://dx.doi.org/10.3390/ijerph192013520 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Yang, Senqi Duan, Xuliang Xiao, Zeyan Li, Zhiyao Liu, Yuhai Jie, Zhihao Tang, Dezhao Du, Hui Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN |
title | Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN |
title_full | Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN |
title_fullStr | Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN |
title_full_unstemmed | Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN |
title_short | Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN |
title_sort | sentiment classification of chinese tourism reviews based on ernie-gram+gcn |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9602456/ https://www.ncbi.nlm.nih.gov/pubmed/36294096 http://dx.doi.org/10.3390/ijerph192013520 |
work_keys_str_mv | AT yangsenqi sentimentclassificationofchinesetourismreviewsbasedonerniegramgcn AT duanxuliang sentimentclassificationofchinesetourismreviewsbasedonerniegramgcn AT xiaozeyan sentimentclassificationofchinesetourismreviewsbasedonerniegramgcn AT lizhiyao sentimentclassificationofchinesetourismreviewsbasedonerniegramgcn AT liuyuhai sentimentclassificationofchinesetourismreviewsbasedonerniegramgcn AT jiezhihao sentimentclassificationofchinesetourismreviewsbasedonerniegramgcn AT tangdezhao sentimentclassificationofchinesetourismreviewsbasedonerniegramgcn AT duhui sentimentclassificationofchinesetourismreviewsbasedonerniegramgcn |