Cargando…
A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas
Due to the massive progression of the Web, people post their reviews for any product, movies and places they visit on social media. The reviews available on social media are helpful to customers as well as the product owners to evaluate their products based on different reviews. Analyzing structured...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959606/ https://www.ncbi.nlm.nih.gov/pubmed/33816997 http://dx.doi.org/10.7717/peerj-cs.347 |
_version_ | 1783664986046857216 |
---|---|
author | Bhamare, Bhavana R. Prabhu, Jeyanthi |
author_facet | Bhamare, Bhavana R. Prabhu, Jeyanthi |
author_sort | Bhamare, Bhavana R. |
collection | PubMed |
description | Due to the massive progression of the Web, people post their reviews for any product, movies and places they visit on social media. The reviews available on social media are helpful to customers as well as the product owners to evaluate their products based on different reviews. Analyzing structured data is easy as compared to unstructured data. The reviews are available in an unstructured format. Aspect-Based Sentiment Analysis mines the aspects of a product from the reviews and further determines sentiment for each aspect. In this work, two methods for aspect extraction are proposed. The datasets used for this work are SemEval restaurant review dataset, Yelp and Kaggle datasets. In the first method a multivariate filter-based approach for feature selection is proposed. This method support to select significant features and reduces redundancy among selected features. It shows improvement in F1-score compared to a method that uses only relevant features selected using Term Frequency weight. In another method, selective dependency relations are used to extract features. This is done using Stanford NLP parser. The results gained using features extracted by selective dependency rules are better as compared to features extracted by using all dependency rules. In the hybrid approach, both lemma features and selective dependency relation based features are extracted. Using the hybrid feature set, 94.78% accuracy and 85.24% F1-score is achieved in the aspect category prediction task. |
format | Online Article Text |
id | pubmed-7959606 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-79596062021-04-02 A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas Bhamare, Bhavana R. Prabhu, Jeyanthi PeerJ Comput Sci Artificial Intelligence Due to the massive progression of the Web, people post their reviews for any product, movies and places they visit on social media. The reviews available on social media are helpful to customers as well as the product owners to evaluate their products based on different reviews. Analyzing structured data is easy as compared to unstructured data. The reviews are available in an unstructured format. Aspect-Based Sentiment Analysis mines the aspects of a product from the reviews and further determines sentiment for each aspect. In this work, two methods for aspect extraction are proposed. The datasets used for this work are SemEval restaurant review dataset, Yelp and Kaggle datasets. In the first method a multivariate filter-based approach for feature selection is proposed. This method support to select significant features and reduces redundancy among selected features. It shows improvement in F1-score compared to a method that uses only relevant features selected using Term Frequency weight. In another method, selective dependency relations are used to extract features. This is done using Stanford NLP parser. The results gained using features extracted by selective dependency rules are better as compared to features extracted by using all dependency rules. In the hybrid approach, both lemma features and selective dependency relation based features are extracted. Using the hybrid feature set, 94.78% accuracy and 85.24% F1-score is achieved in the aspect category prediction task. PeerJ Inc. 2021-02-05 /pmc/articles/PMC7959606/ /pubmed/33816997 http://dx.doi.org/10.7717/peerj-cs.347 Text en © 2021 Bhamare and Prabhu https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Artificial Intelligence Bhamare, Bhavana R. Prabhu, Jeyanthi A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas |
title | A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas |
title_full | A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas |
title_fullStr | A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas |
title_full_unstemmed | A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas |
title_short | A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas |
title_sort | supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959606/ https://www.ncbi.nlm.nih.gov/pubmed/33816997 http://dx.doi.org/10.7717/peerj-cs.347 |
work_keys_str_mv | AT bhamarebhavanar asupervisedschemeforaspectextractioninsentimentanalysisusingthehybridfeaturesetofworddependencyrelationsandlemmas AT prabhujeyanthi asupervisedschemeforaspectextractioninsentimentanalysisusingthehybridfeaturesetofworddependencyrelationsandlemmas AT bhamarebhavanar supervisedschemeforaspectextractioninsentimentanalysisusingthehybridfeaturesetofworddependencyrelationsandlemmas AT prabhujeyanthi supervisedschemeforaspectextractioninsentimentanalysisusingthehybridfeaturesetofworddependencyrelationsandlemmas |