Cargando…

A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas

Due to the massive progression of the Web, people post their reviews for any product, movies and places they visit on social media. The reviews available on social media are helpful to customers as well as the product owners to evaluate their products based on different reviews. Analyzing structured...

Descripción completa

Detalles Bibliográficos
Autores principales: Bhamare, Bhavana R., Prabhu, Jeyanthi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959606/
https://www.ncbi.nlm.nih.gov/pubmed/33816997
http://dx.doi.org/10.7717/peerj-cs.347
_version_ 1783664986046857216
author Bhamare, Bhavana R.
Prabhu, Jeyanthi
author_facet Bhamare, Bhavana R.
Prabhu, Jeyanthi
author_sort Bhamare, Bhavana R.
collection PubMed
description Due to the massive progression of the Web, people post their reviews for any product, movies and places they visit on social media. The reviews available on social media are helpful to customers as well as the product owners to evaluate their products based on different reviews. Analyzing structured data is easy as compared to unstructured data. The reviews are available in an unstructured format. Aspect-Based Sentiment Analysis mines the aspects of a product from the reviews and further determines sentiment for each aspect. In this work, two methods for aspect extraction are proposed. The datasets used for this work are SemEval restaurant review dataset, Yelp and Kaggle datasets. In the first method a multivariate filter-based approach for feature selection is proposed. This method support to select significant features and reduces redundancy among selected features. It shows improvement in F1-score compared to a method that uses only relevant features selected using Term Frequency weight. In another method, selective dependency relations are used to extract features. This is done using Stanford NLP parser. The results gained using features extracted by selective dependency rules are better as compared to features extracted by using all dependency rules. In the hybrid approach, both lemma features and selective dependency relation based features are extracted. Using the hybrid feature set, 94.78% accuracy and 85.24% F1-score is achieved in the aspect category prediction task.
format Online
Article
Text
id pubmed-7959606
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-79596062021-04-02 A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas Bhamare, Bhavana R. Prabhu, Jeyanthi PeerJ Comput Sci Artificial Intelligence Due to the massive progression of the Web, people post their reviews for any product, movies and places they visit on social media. The reviews available on social media are helpful to customers as well as the product owners to evaluate their products based on different reviews. Analyzing structured data is easy as compared to unstructured data. The reviews are available in an unstructured format. Aspect-Based Sentiment Analysis mines the aspects of a product from the reviews and further determines sentiment for each aspect. In this work, two methods for aspect extraction are proposed. The datasets used for this work are SemEval restaurant review dataset, Yelp and Kaggle datasets. In the first method a multivariate filter-based approach for feature selection is proposed. This method support to select significant features and reduces redundancy among selected features. It shows improvement in F1-score compared to a method that uses only relevant features selected using Term Frequency weight. In another method, selective dependency relations are used to extract features. This is done using Stanford NLP parser. The results gained using features extracted by selective dependency rules are better as compared to features extracted by using all dependency rules. In the hybrid approach, both lemma features and selective dependency relation based features are extracted. Using the hybrid feature set, 94.78% accuracy and 85.24% F1-score is achieved in the aspect category prediction task. PeerJ Inc. 2021-02-05 /pmc/articles/PMC7959606/ /pubmed/33816997 http://dx.doi.org/10.7717/peerj-cs.347 Text en © 2021 Bhamare and Prabhu https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Artificial Intelligence
Bhamare, Bhavana R.
Prabhu, Jeyanthi
A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas
title A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas
title_full A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas
title_fullStr A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas
title_full_unstemmed A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas
title_short A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas
title_sort supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959606/
https://www.ncbi.nlm.nih.gov/pubmed/33816997
http://dx.doi.org/10.7717/peerj-cs.347
work_keys_str_mv AT bhamarebhavanar asupervisedschemeforaspectextractioninsentimentanalysisusingthehybridfeaturesetofworddependencyrelationsandlemmas
AT prabhujeyanthi asupervisedschemeforaspectextractioninsentimentanalysisusingthehybridfeaturesetofworddependencyrelationsandlemmas
AT bhamarebhavanar supervisedschemeforaspectextractioninsentimentanalysisusingthehybridfeaturesetofworddependencyrelationsandlemmas
AT prabhujeyanthi supervisedschemeforaspectextractioninsentimentanalysisusingthehybridfeaturesetofworddependencyrelationsandlemmas