Cargando…

Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms

Background : The proliferation of digital commerce has allowed merchants to reach out to a wider customer base, prompting a study of customer reviews to gauge service and product quality through sentiment analysis. Sentiment analysis can be enhanced through subjectivity and objectivity classificatio...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ng, Hu, Chia, Glenn Jun Weng, Yap, Timothy Tzen Vun, Goh, Vik Tor
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	F1000 Research Limited 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9130759/ https://www.ncbi.nlm.nih.gov/pubmed/35646327 http://dx.doi.org/10.12688/f1000research.73131.2

_version_	1784713041058725888
author	Ng, Hu Chia, Glenn Jun Weng Yap, Timothy Tzen Vun Goh, Vik Tor
author_facet	Ng, Hu Chia, Glenn Jun Weng Yap, Timothy Tzen Vun Goh, Vik Tor
author_sort	Ng, Hu
collection	PubMed
description	Background : The proliferation of digital commerce has allowed merchants to reach out to a wider customer base, prompting a study of customer reviews to gauge service and product quality through sentiment analysis. Sentiment analysis can be enhanced through subjectivity and objectivity classification with attention mechanisms. Methods: This research includes input corpora of contrasting levels of subjectivity and objectivity from different databases to perform sentiment analysis on user reviews, incorporating attention mechanisms at the aspect level. Three large corpora are chosen as the subjectivity and objectivity datasets, the Shopee user review dataset (ShopeeRD) for subjectivity, together with the Wikipedia English dataset (Wiki-en) and Internet Movie Database (IMDb) for objectivity. Word embeddings are created using Word2Vec with Skip-Gram. Then, a bidirectional LSTM with an attention layer (LSTM-ATT) imposed on word vectors. The performance of the model is evaluated and benchmarked against classification models of Logistics Regression (LR) and Linear SVC (L-SVC). Three models are trained with subjectivity (70% of ShopeeRD) and the objectivity (Wiki-en) embeddings, with ten-fold cross-validation. Next, the three models are evaluated against two datasets (IMDb and 20% of ShopeeRD). The experiments are based on benchmark comparisons, embedding comparison and model comparison with 70-10-20 train-validation-test splits. Data augmentation using AUG-BERT is performed and selected models incorporating AUG-BERT, are compared. Results: L-SVC scored the highest accuracy with 56.9% for objective embeddings (Wiki-en) while the LSTM-ATT scored 69.0% on subjective embeddings (ShopeeRD). Improved performances were observed with data augmentation using AUG-BERT, where the LSTM-ATT+AUG-BERT model scored the highest accuracy at 60.0% for objective embeddings and 70.0% for subjective embeddings, compared to 57% (objective) and 69% (subjective) for L-SVC+AUG-BERT, and 56% (objective) and 68% (subjective) for L-SVC. Conclusions: Utilizing attention layers with subjectivity and objectivity notions has shown improvement to the accuracy of sentiment analysis models.
format	Online Article Text
id	pubmed-9130759
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	F1000 Research Limited
record_format	MEDLINE/PubMed
spelling	pubmed-91307592022-05-27 Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms Ng, Hu Chia, Glenn Jun Weng Yap, Timothy Tzen Vun Goh, Vik Tor F1000Res Research Article Background : The proliferation of digital commerce has allowed merchants to reach out to a wider customer base, prompting a study of customer reviews to gauge service and product quality through sentiment analysis. Sentiment analysis can be enhanced through subjectivity and objectivity classification with attention mechanisms. Methods: This research includes input corpora of contrasting levels of subjectivity and objectivity from different databases to perform sentiment analysis on user reviews, incorporating attention mechanisms at the aspect level. Three large corpora are chosen as the subjectivity and objectivity datasets, the Shopee user review dataset (ShopeeRD) for subjectivity, together with the Wikipedia English dataset (Wiki-en) and Internet Movie Database (IMDb) for objectivity. Word embeddings are created using Word2Vec with Skip-Gram. Then, a bidirectional LSTM with an attention layer (LSTM-ATT) imposed on word vectors. The performance of the model is evaluated and benchmarked against classification models of Logistics Regression (LR) and Linear SVC (L-SVC). Three models are trained with subjectivity (70% of ShopeeRD) and the objectivity (Wiki-en) embeddings, with ten-fold cross-validation. Next, the three models are evaluated against two datasets (IMDb and 20% of ShopeeRD). The experiments are based on benchmark comparisons, embedding comparison and model comparison with 70-10-20 train-validation-test splits. Data augmentation using AUG-BERT is performed and selected models incorporating AUG-BERT, are compared. Results: L-SVC scored the highest accuracy with 56.9% for objective embeddings (Wiki-en) while the LSTM-ATT scored 69.0% on subjective embeddings (ShopeeRD). Improved performances were observed with data augmentation using AUG-BERT, where the LSTM-ATT+AUG-BERT model scored the highest accuracy at 60.0% for objective embeddings and 70.0% for subjective embeddings, compared to 57% (objective) and 69% (subjective) for L-SVC+AUG-BERT, and 56% (objective) and 68% (subjective) for L-SVC. Conclusions: Utilizing attention layers with subjectivity and objectivity notions has shown improvement to the accuracy of sentiment analysis models. F1000 Research Limited 2022-05-17 /pmc/articles/PMC9130759/ /pubmed/35646327 http://dx.doi.org/10.12688/f1000research.73131.2 Text en Copyright: © 2022 Ng H et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Ng, Hu Chia, Glenn Jun Weng Yap, Timothy Tzen Vun Goh, Vik Tor Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms
title	Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms
title_full	Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms
title_fullStr	Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms
title_full_unstemmed	Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms
title_short	Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms
title_sort	modelling sentiments based on objectivity and subjectivity with self-attention mechanisms
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9130759/ https://www.ncbi.nlm.nih.gov/pubmed/35646327 http://dx.doi.org/10.12688/f1000research.73131.2
work_keys_str_mv	AT nghu modellingsentimentsbasedonobjectivityandsubjectivitywithselfattentionmechanisms AT chiaglennjunweng modellingsentimentsbasedonobjectivityandsubjectivitywithselfattentionmechanisms AT yaptimothytzenvun modellingsentimentsbasedonobjectivityandsubjectivitywithselfattentionmechanisms AT gohviktor modellingsentimentsbasedonobjectivityandsubjectivitywithselfattentionmechanisms

Modelling sentiments based on objectivity and subjectivity with self-attention mechanisms

Ejemplares similares