Cargando…

Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms

Background : The proliferation of digital commerce has allowed merchants to reach out to a wider customer base, prompting a study of customer reviews to gauge service and product quality through sentiment analysis. Sentiment analysis can be enhanced through subjectivity and objectivity classificatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Ng, Hu, Chia, Glenn Jun Weng, Yap, Timothy Tzen Vun, Goh, Vik Tor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9130759/
https://www.ncbi.nlm.nih.gov/pubmed/35646327
http://dx.doi.org/10.12688/f1000research.73131.2
_version_ 1784713041058725888
author Ng, Hu
Chia, Glenn Jun Weng
Yap, Timothy Tzen Vun
Goh, Vik Tor
author_facet Ng, Hu
Chia, Glenn Jun Weng
Yap, Timothy Tzen Vun
Goh, Vik Tor
author_sort Ng, Hu
collection PubMed
description Background : The proliferation of digital commerce has allowed merchants to reach out to a wider customer base, prompting a study of customer reviews to gauge service and product quality through sentiment analysis. Sentiment analysis can be enhanced through subjectivity and objectivity classification with attention mechanisms. Methods: This research includes input corpora of contrasting levels of subjectivity and objectivity from different databases to perform sentiment analysis on user reviews, incorporating attention mechanisms at the aspect level. Three large corpora are chosen as the subjectivity and objectivity datasets, the Shopee user review dataset (ShopeeRD) for subjectivity, together with the Wikipedia English dataset (Wiki-en) and Internet Movie Database (IMDb) for objectivity. Word embeddings are created using Word2Vec with Skip-Gram. Then, a bidirectional LSTM with an attention layer (LSTM-ATT) imposed on word vectors. The performance of the model is evaluated and benchmarked against classification models of Logistics Regression (LR) and Linear SVC (L-SVC). Three models are trained with subjectivity (70% of ShopeeRD) and the objectivity (Wiki-en) embeddings, with ten-fold cross-validation. Next, the three models are evaluated against two datasets (IMDb and 20% of ShopeeRD). The experiments are based on benchmark comparisons, embedding comparison and model comparison with 70-10-20 train-validation-test splits. Data augmentation using AUG-BERT is performed and selected models incorporating AUG-BERT, are compared. Results: L-SVC scored the highest accuracy with 56.9% for objective embeddings (Wiki-en) while the LSTM-ATT scored 69.0% on subjective embeddings (ShopeeRD).  Improved performances were observed with data augmentation using AUG-BERT, where the LSTM-ATT+AUG-BERT model scored the highest accuracy at 60.0% for objective embeddings and 70.0% for subjective embeddings, compared to 57% (objective) and 69% (subjective) for L-SVC+AUG-BERT, and 56% (objective) and 68% (subjective) for L-SVC. Conclusions: Utilizing attention layers with subjectivity and objectivity notions has shown improvement to the accuracy of sentiment analysis models.
format Online
Article
Text
id pubmed-9130759
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-91307592022-05-27 Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms Ng, Hu Chia, Glenn Jun Weng Yap, Timothy Tzen Vun Goh, Vik Tor F1000Res Research Article Background : The proliferation of digital commerce has allowed merchants to reach out to a wider customer base, prompting a study of customer reviews to gauge service and product quality through sentiment analysis. Sentiment analysis can be enhanced through subjectivity and objectivity classification with attention mechanisms. Methods: This research includes input corpora of contrasting levels of subjectivity and objectivity from different databases to perform sentiment analysis on user reviews, incorporating attention mechanisms at the aspect level. Three large corpora are chosen as the subjectivity and objectivity datasets, the Shopee user review dataset (ShopeeRD) for subjectivity, together with the Wikipedia English dataset (Wiki-en) and Internet Movie Database (IMDb) for objectivity. Word embeddings are created using Word2Vec with Skip-Gram. Then, a bidirectional LSTM with an attention layer (LSTM-ATT) imposed on word vectors. The performance of the model is evaluated and benchmarked against classification models of Logistics Regression (LR) and Linear SVC (L-SVC). Three models are trained with subjectivity (70% of ShopeeRD) and the objectivity (Wiki-en) embeddings, with ten-fold cross-validation. Next, the three models are evaluated against two datasets (IMDb and 20% of ShopeeRD). The experiments are based on benchmark comparisons, embedding comparison and model comparison with 70-10-20 train-validation-test splits. Data augmentation using AUG-BERT is performed and selected models incorporating AUG-BERT, are compared. Results: L-SVC scored the highest accuracy with 56.9% for objective embeddings (Wiki-en) while the LSTM-ATT scored 69.0% on subjective embeddings (ShopeeRD).  Improved performances were observed with data augmentation using AUG-BERT, where the LSTM-ATT+AUG-BERT model scored the highest accuracy at 60.0% for objective embeddings and 70.0% for subjective embeddings, compared to 57% (objective) and 69% (subjective) for L-SVC+AUG-BERT, and 56% (objective) and 68% (subjective) for L-SVC. Conclusions: Utilizing attention layers with subjectivity and objectivity notions has shown improvement to the accuracy of sentiment analysis models. F1000 Research Limited 2022-05-17 /pmc/articles/PMC9130759/ /pubmed/35646327 http://dx.doi.org/10.12688/f1000research.73131.2 Text en Copyright: © 2022 Ng H et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ng, Hu
Chia, Glenn Jun Weng
Yap, Timothy Tzen Vun
Goh, Vik Tor
Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms
title Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms
title_full Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms
title_fullStr Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms
title_full_unstemmed Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms
title_short Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms
title_sort modelling sentiments based on objectivity and subjectivity with self-attention mechanisms
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9130759/
https://www.ncbi.nlm.nih.gov/pubmed/35646327
http://dx.doi.org/10.12688/f1000research.73131.2
work_keys_str_mv AT nghu modellingsentimentsbasedonobjectivityandsubjectivitywithselfattentionmechanisms
AT chiaglennjunweng modellingsentimentsbasedonobjectivityandsubjectivitywithselfattentionmechanisms
AT yaptimothytzenvun modellingsentimentsbasedonobjectivityandsubjectivitywithselfattentionmechanisms
AT gohviktor modellingsentimentsbasedonobjectivityandsubjectivitywithselfattentionmechanisms