Cargando…
Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias
Sentiment analysis is widely studied to extract opinions from user generated content (UGC), and various methods have been proposed in recent literature. However, these methods are likely to introduce sentiment bias, and the classification results tend to be positive or negative, especially for the l...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6108458/ https://www.ncbi.nlm.nih.gov/pubmed/30142154 http://dx.doi.org/10.1371/journal.pone.0202523 |
_version_ | 1783350146837250048 |
---|---|
author | Han, Hongyu Zhang, Yongshi Zhang, Jianpei Yang, Jing Zou, Xiaomei |
author_facet | Han, Hongyu Zhang, Yongshi Zhang, Jianpei Yang, Jing Zou, Xiaomei |
author_sort | Han, Hongyu |
collection | PubMed |
description | Sentiment analysis is widely studied to extract opinions from user generated content (UGC), and various methods have been proposed in recent literature. However, these methods are likely to introduce sentiment bias, and the classification results tend to be positive or negative, especially for the lexicon-based sentiment classification methods. The existence of sentiment bias leads to poor performance of sentiment analysis. To deal with this problem, we propose a novel sentiment bias processing strategy which can be applied to the lexicon-based sentiment analysis method. Weight and threshold parameters learned from a small training set are introduced into the lexicon-based sentiment scoring formula, and then the formula is used to classify the reviews. In this paper, a completed sentiment classification framework is proposed. SentiWordNet (SWN) is used as the experimental sentiment lexicon, and review data of four products collected from Amazon are used as the experimental datasets. Experimental results show that the bias processing strategy reduces polarity bias rate (PBR) and improves performance of the lexicon-based sentiment analysis method. |
format | Online Article Text |
id | pubmed-6108458 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-61084582018-09-18 Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias Han, Hongyu Zhang, Yongshi Zhang, Jianpei Yang, Jing Zou, Xiaomei PLoS One Research Article Sentiment analysis is widely studied to extract opinions from user generated content (UGC), and various methods have been proposed in recent literature. However, these methods are likely to introduce sentiment bias, and the classification results tend to be positive or negative, especially for the lexicon-based sentiment classification methods. The existence of sentiment bias leads to poor performance of sentiment analysis. To deal with this problem, we propose a novel sentiment bias processing strategy which can be applied to the lexicon-based sentiment analysis method. Weight and threshold parameters learned from a small training set are introduced into the lexicon-based sentiment scoring formula, and then the formula is used to classify the reviews. In this paper, a completed sentiment classification framework is proposed. SentiWordNet (SWN) is used as the experimental sentiment lexicon, and review data of four products collected from Amazon are used as the experimental datasets. Experimental results show that the bias processing strategy reduces polarity bias rate (PBR) and improves performance of the lexicon-based sentiment analysis method. Public Library of Science 2018-08-24 /pmc/articles/PMC6108458/ /pubmed/30142154 http://dx.doi.org/10.1371/journal.pone.0202523 Text en © 2018 Han et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Han, Hongyu Zhang, Yongshi Zhang, Jianpei Yang, Jing Zou, Xiaomei Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias |
title | Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias |
title_full | Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias |
title_fullStr | Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias |
title_full_unstemmed | Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias |
title_short | Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias |
title_sort | improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6108458/ https://www.ncbi.nlm.nih.gov/pubmed/30142154 http://dx.doi.org/10.1371/journal.pone.0202523 |
work_keys_str_mv | AT hanhongyu improvingtheperformanceoflexiconbasedreviewsentimentanalysismethodbyreducingadditionalintroducedsentimentbias AT zhangyongshi improvingtheperformanceoflexiconbasedreviewsentimentanalysismethodbyreducingadditionalintroducedsentimentbias AT zhangjianpei improvingtheperformanceoflexiconbasedreviewsentimentanalysismethodbyreducingadditionalintroducedsentimentbias AT yangjing improvingtheperformanceoflexiconbasedreviewsentimentanalysismethodbyreducingadditionalintroducedsentimentbias AT zouxiaomei improvingtheperformanceoflexiconbasedreviewsentimentanalysismethodbyreducingadditionalintroducedsentimentbias |