Cargando…

Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias

Sentiment analysis is widely studied to extract opinions from user generated content (UGC), and various methods have been proposed in recent literature. However, these methods are likely to introduce sentiment bias, and the classification results tend to be positive or negative, especially for the l...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Hongyu, Zhang, Yongshi, Zhang, Jianpei, Yang, Jing, Zou, Xiaomei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6108458/
https://www.ncbi.nlm.nih.gov/pubmed/30142154
http://dx.doi.org/10.1371/journal.pone.0202523
_version_ 1783350146837250048
author Han, Hongyu
Zhang, Yongshi
Zhang, Jianpei
Yang, Jing
Zou, Xiaomei
author_facet Han, Hongyu
Zhang, Yongshi
Zhang, Jianpei
Yang, Jing
Zou, Xiaomei
author_sort Han, Hongyu
collection PubMed
description Sentiment analysis is widely studied to extract opinions from user generated content (UGC), and various methods have been proposed in recent literature. However, these methods are likely to introduce sentiment bias, and the classification results tend to be positive or negative, especially for the lexicon-based sentiment classification methods. The existence of sentiment bias leads to poor performance of sentiment analysis. To deal with this problem, we propose a novel sentiment bias processing strategy which can be applied to the lexicon-based sentiment analysis method. Weight and threshold parameters learned from a small training set are introduced into the lexicon-based sentiment scoring formula, and then the formula is used to classify the reviews. In this paper, a completed sentiment classification framework is proposed. SentiWordNet (SWN) is used as the experimental sentiment lexicon, and review data of four products collected from Amazon are used as the experimental datasets. Experimental results show that the bias processing strategy reduces polarity bias rate (PBR) and improves performance of the lexicon-based sentiment analysis method.
format Online
Article
Text
id pubmed-6108458
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-61084582018-09-18 Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias Han, Hongyu Zhang, Yongshi Zhang, Jianpei Yang, Jing Zou, Xiaomei PLoS One Research Article Sentiment analysis is widely studied to extract opinions from user generated content (UGC), and various methods have been proposed in recent literature. However, these methods are likely to introduce sentiment bias, and the classification results tend to be positive or negative, especially for the lexicon-based sentiment classification methods. The existence of sentiment bias leads to poor performance of sentiment analysis. To deal with this problem, we propose a novel sentiment bias processing strategy which can be applied to the lexicon-based sentiment analysis method. Weight and threshold parameters learned from a small training set are introduced into the lexicon-based sentiment scoring formula, and then the formula is used to classify the reviews. In this paper, a completed sentiment classification framework is proposed. SentiWordNet (SWN) is used as the experimental sentiment lexicon, and review data of four products collected from Amazon are used as the experimental datasets. Experimental results show that the bias processing strategy reduces polarity bias rate (PBR) and improves performance of the lexicon-based sentiment analysis method. Public Library of Science 2018-08-24 /pmc/articles/PMC6108458/ /pubmed/30142154 http://dx.doi.org/10.1371/journal.pone.0202523 Text en © 2018 Han et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Han, Hongyu
Zhang, Yongshi
Zhang, Jianpei
Yang, Jing
Zou, Xiaomei
Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias
title Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias
title_full Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias
title_fullStr Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias
title_full_unstemmed Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias
title_short Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias
title_sort improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6108458/
https://www.ncbi.nlm.nih.gov/pubmed/30142154
http://dx.doi.org/10.1371/journal.pone.0202523
work_keys_str_mv AT hanhongyu improvingtheperformanceoflexiconbasedreviewsentimentanalysismethodbyreducingadditionalintroducedsentimentbias
AT zhangyongshi improvingtheperformanceoflexiconbasedreviewsentimentanalysismethodbyreducingadditionalintroducedsentimentbias
AT zhangjianpei improvingtheperformanceoflexiconbasedreviewsentimentanalysismethodbyreducingadditionalintroducedsentimentbias
AT yangjing improvingtheperformanceoflexiconbasedreviewsentimentanalysismethodbyreducingadditionalintroducedsentimentbias
AT zouxiaomei improvingtheperformanceoflexiconbasedreviewsentimentanalysismethodbyreducingadditionalintroducedsentimentbias