Cargando…

Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN

Nowadays, tourists increasingly prefer to check the reviews of attractions before traveling to decide whether to visit them or not. To respond to the change in the way tourists choose attractions, it is important to classify the reviews of attractions with high precision. In addition, more and more...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Senqi, Duan, Xuliang, Xiao, Zeyan, Li, Zhiyao, Liu, Yuhai, Jie, Zhihao, Tang, Dezhao, Du, Hui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9602456/
https://www.ncbi.nlm.nih.gov/pubmed/36294096
http://dx.doi.org/10.3390/ijerph192013520
_version_ 1784817322023714816
author Yang, Senqi
Duan, Xuliang
Xiao, Zeyan
Li, Zhiyao
Liu, Yuhai
Jie, Zhihao
Tang, Dezhao
Du, Hui
author_facet Yang, Senqi
Duan, Xuliang
Xiao, Zeyan
Li, Zhiyao
Liu, Yuhai
Jie, Zhihao
Tang, Dezhao
Du, Hui
author_sort Yang, Senqi
collection PubMed
description Nowadays, tourists increasingly prefer to check the reviews of attractions before traveling to decide whether to visit them or not. To respond to the change in the way tourists choose attractions, it is important to classify the reviews of attractions with high precision. In addition, more and more tourists like to use emojis to express their satisfaction or dissatisfaction with the attractions. In this paper, we built a dataset for Chinese attraction evaluation incorporating emojis (CAEIE) and proposed an explicitly n-gram masking method to enhance the integration of coarse-grained information into a pre-training (ERNIE-Gram) and Text Graph Convolutional Network (textGCN) (E2G) model to classify the dataset with a high accuracy. The E2G preprocesses the text and feeds it to ERNIE-Gram and TextGCN. ERNIE-Gram was trained using its unique mask mechanism to obtain the final probabilities. TextGCN used the dataset to construct heterogeneous graphs with comment text and words, which were trained to obtain a representation of the document output category probabilities. The two probabilities were calculated to obtain the final results. To demonstrate the validity of the E2G model, this paper was compared with advanced models. After experiments, it was shown that E2G had a good classification effect on the CAEIE dataset, and the accuracy of classification was up to 97.37%. Furthermore, the accuracy of E2G was 1.37% and 1.35% ahead of ERNIE-Gram and TextGCN, respectively. In addition, two sets of comparison experiments were conducted to verify the performance of TextGCN and TextGAT on the CAEIE dataset. The final results showed that ERNIE and ERNIE-Gram combined TextGCN and TextGAT, respectively, and TextGCN performed 1.6% and 2.15% ahead. This paper compared the effects of eight activation functions on the second layer of the TextGCN and the activation-function-rectified linear unit 6 (RELU6) with the best results based on experiments.
format Online
Article
Text
id pubmed-9602456
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96024562022-10-27 Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN Yang, Senqi Duan, Xuliang Xiao, Zeyan Li, Zhiyao Liu, Yuhai Jie, Zhihao Tang, Dezhao Du, Hui Int J Environ Res Public Health Article Nowadays, tourists increasingly prefer to check the reviews of attractions before traveling to decide whether to visit them or not. To respond to the change in the way tourists choose attractions, it is important to classify the reviews of attractions with high precision. In addition, more and more tourists like to use emojis to express their satisfaction or dissatisfaction with the attractions. In this paper, we built a dataset for Chinese attraction evaluation incorporating emojis (CAEIE) and proposed an explicitly n-gram masking method to enhance the integration of coarse-grained information into a pre-training (ERNIE-Gram) and Text Graph Convolutional Network (textGCN) (E2G) model to classify the dataset with a high accuracy. The E2G preprocesses the text and feeds it to ERNIE-Gram and TextGCN. ERNIE-Gram was trained using its unique mask mechanism to obtain the final probabilities. TextGCN used the dataset to construct heterogeneous graphs with comment text and words, which were trained to obtain a representation of the document output category probabilities. The two probabilities were calculated to obtain the final results. To demonstrate the validity of the E2G model, this paper was compared with advanced models. After experiments, it was shown that E2G had a good classification effect on the CAEIE dataset, and the accuracy of classification was up to 97.37%. Furthermore, the accuracy of E2G was 1.37% and 1.35% ahead of ERNIE-Gram and TextGCN, respectively. In addition, two sets of comparison experiments were conducted to verify the performance of TextGCN and TextGAT on the CAEIE dataset. The final results showed that ERNIE and ERNIE-Gram combined TextGCN and TextGAT, respectively, and TextGCN performed 1.6% and 2.15% ahead. This paper compared the effects of eight activation functions on the second layer of the TextGCN and the activation-function-rectified linear unit 6 (RELU6) with the best results based on experiments. MDPI 2022-10-19 /pmc/articles/PMC9602456/ /pubmed/36294096 http://dx.doi.org/10.3390/ijerph192013520 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Yang, Senqi
Duan, Xuliang
Xiao, Zeyan
Li, Zhiyao
Liu, Yuhai
Jie, Zhihao
Tang, Dezhao
Du, Hui
Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN
title Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN
title_full Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN
title_fullStr Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN
title_full_unstemmed Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN
title_short Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN
title_sort sentiment classification of chinese tourism reviews based on ernie-gram+gcn
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9602456/
https://www.ncbi.nlm.nih.gov/pubmed/36294096
http://dx.doi.org/10.3390/ijerph192013520
work_keys_str_mv AT yangsenqi sentimentclassificationofchinesetourismreviewsbasedonerniegramgcn
AT duanxuliang sentimentclassificationofchinesetourismreviewsbasedonerniegramgcn
AT xiaozeyan sentimentclassificationofchinesetourismreviewsbasedonerniegramgcn
AT lizhiyao sentimentclassificationofchinesetourismreviewsbasedonerniegramgcn
AT liuyuhai sentimentclassificationofchinesetourismreviewsbasedonerniegramgcn
AT jiezhihao sentimentclassificationofchinesetourismreviewsbasedonerniegramgcn
AT tangdezhao sentimentclassificationofchinesetourismreviewsbasedonerniegramgcn
AT duhui sentimentclassificationofchinesetourismreviewsbasedonerniegramgcn