Cargando…
Leveraging aspect phrase embeddings for cross-domain review rating prediction
Online review platforms are a popular way for users to post reviews by expressing their opinions towards a product or service, and they are valuable for other users and companies to find out the overall opinions of customers. These reviews tend to be accompanied by a rating, where the star rating ha...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924723/ https://www.ncbi.nlm.nih.gov/pubmed/33816878 http://dx.doi.org/10.7717/peerj-cs.225 |
_version_ | 1783659149900382208 |
---|---|
author | Jiang, Aiqi Zubiaga, Arkaitz |
author_facet | Jiang, Aiqi Zubiaga, Arkaitz |
author_sort | Jiang, Aiqi |
collection | PubMed |
description | Online review platforms are a popular way for users to post reviews by expressing their opinions towards a product or service, and they are valuable for other users and companies to find out the overall opinions of customers. These reviews tend to be accompanied by a rating, where the star rating has become the most common approach for users to give their feedback in a quantitative way, generally as a Likert scale of 1–5 stars. In other social media platforms like Facebook or Twitter, an automated review rating prediction system can be useful to determine the rating that a user would have given to the product or service. Existing work on review rating prediction focuses on specific domains, such as restaurants or hotels. This, however, ignores the fact that some review domains which are less frequently rated, such as dentists, lack sufficient data to build a reliable prediction model. In this paper, we experiment on 12 datasets pertaining to 12 different review domains of varying level of popularity to assess the performance of predictions across different domains. We introduce a model that leverages aspect phrase embeddings extracted from the reviews, which enables the development of both in-domain and cross-domain review rating prediction systems. Our experiments show that both of our review rating prediction systems outperform all other baselines. The cross-domain review rating prediction system is particularly significant for the least popular review domains, where leveraging training data from other domains leads to remarkable improvements in performance. The in-domain review rating prediction system is instead more suitable for popular review domains, provided that a model built from training data pertaining to the target domain is more suitable when this data is abundant. |
format | Online Article Text |
id | pubmed-7924723 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-79247232021-04-02 Leveraging aspect phrase embeddings for cross-domain review rating prediction Jiang, Aiqi Zubiaga, Arkaitz PeerJ Comput Sci Artificial Intelligence Online review platforms are a popular way for users to post reviews by expressing their opinions towards a product or service, and they are valuable for other users and companies to find out the overall opinions of customers. These reviews tend to be accompanied by a rating, where the star rating has become the most common approach for users to give their feedback in a quantitative way, generally as a Likert scale of 1–5 stars. In other social media platforms like Facebook or Twitter, an automated review rating prediction system can be useful to determine the rating that a user would have given to the product or service. Existing work on review rating prediction focuses on specific domains, such as restaurants or hotels. This, however, ignores the fact that some review domains which are less frequently rated, such as dentists, lack sufficient data to build a reliable prediction model. In this paper, we experiment on 12 datasets pertaining to 12 different review domains of varying level of popularity to assess the performance of predictions across different domains. We introduce a model that leverages aspect phrase embeddings extracted from the reviews, which enables the development of both in-domain and cross-domain review rating prediction systems. Our experiments show that both of our review rating prediction systems outperform all other baselines. The cross-domain review rating prediction system is particularly significant for the least popular review domains, where leveraging training data from other domains leads to remarkable improvements in performance. The in-domain review rating prediction system is instead more suitable for popular review domains, provided that a model built from training data pertaining to the target domain is more suitable when this data is abundant. PeerJ Inc. 2019-10-07 /pmc/articles/PMC7924723/ /pubmed/33816878 http://dx.doi.org/10.7717/peerj-cs.225 Text en ©2019 Jiang and Zubiaga https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Artificial Intelligence Jiang, Aiqi Zubiaga, Arkaitz Leveraging aspect phrase embeddings for cross-domain review rating prediction |
title | Leveraging aspect phrase embeddings for cross-domain review rating prediction |
title_full | Leveraging aspect phrase embeddings for cross-domain review rating prediction |
title_fullStr | Leveraging aspect phrase embeddings for cross-domain review rating prediction |
title_full_unstemmed | Leveraging aspect phrase embeddings for cross-domain review rating prediction |
title_short | Leveraging aspect phrase embeddings for cross-domain review rating prediction |
title_sort | leveraging aspect phrase embeddings for cross-domain review rating prediction |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924723/ https://www.ncbi.nlm.nih.gov/pubmed/33816878 http://dx.doi.org/10.7717/peerj-cs.225 |
work_keys_str_mv | AT jiangaiqi leveragingaspectphraseembeddingsforcrossdomainreviewratingprediction AT zubiagaarkaitz leveragingaspectphraseembeddingsforcrossdomainreviewratingprediction |