Cargando…

Detecting opinion spams through supervised boosting approach

Product reviews are the individual’s opinions, judgement or belief about a certain product or service provided by certain companies. Such reviews serve as guides for these companies to plan and monitor their business ventures in terms of increasing productivity or enhancing their product/service qua...

Descripción completa

Detalles Bibliográficos
Autores principales: Hazim, Mohamad, Anuar, Nor Badrul, Ab Razak, Mohd Faizal, Abdullah, Nor Aniza
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5995425/
https://www.ncbi.nlm.nih.gov/pubmed/29889897
http://dx.doi.org/10.1371/journal.pone.0198884
_version_ 1783330619315453952
author Hazim, Mohamad
Anuar, Nor Badrul
Ab Razak, Mohd Faizal
Abdullah, Nor Aniza
author_facet Hazim, Mohamad
Anuar, Nor Badrul
Ab Razak, Mohd Faizal
Abdullah, Nor Aniza
author_sort Hazim, Mohamad
collection PubMed
description Product reviews are the individual’s opinions, judgement or belief about a certain product or service provided by certain companies. Such reviews serve as guides for these companies to plan and monitor their business ventures in terms of increasing productivity or enhancing their product/service qualities. Product reviews can also increase business profits by convincing future customers about the products which they have interest in. In the mobile application marketplace such as Google Playstore, reviews and star ratings are used as indicators of the application quality. However, among all these reviews, hereby also known as opinions, spams also exist, to disrupt the online business balance. Previous studies used the time series and neural network approach (which require a lot of computational power) to detect these opinion spams. However, the detection performance can be restricted in terms of accuracy because the approach focusses on basic, discrete and document level features only thereby, projecting little statistical relationships. Aiming to improve the detection of opinion spams in mobile application marketplace, this study proposes using statistical based features that are modelled through the supervised boosting approach such as the Extreme Gradient Boost (XGBoost) and the Generalized Boosted Regression Model (GBM) to evaluate two multilingual datasets (i.e. English and Malay language). From the evaluation done, it was found that the XGBoost is most suitable for detecting opinion spams in the English dataset while the GBM Gaussian is most suitable for the Malay dataset. The comparative analysis also indicates that the implementation of the proposed statistical based features had achieved a detection accuracy rate of 87.43 per cent on the English dataset and 86.13 per cent on the Malay dataset.
format Online
Article
Text
id pubmed-5995425
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-59954252018-06-21 Detecting opinion spams through supervised boosting approach Hazim, Mohamad Anuar, Nor Badrul Ab Razak, Mohd Faizal Abdullah, Nor Aniza PLoS One Research Article Product reviews are the individual’s opinions, judgement or belief about a certain product or service provided by certain companies. Such reviews serve as guides for these companies to plan and monitor their business ventures in terms of increasing productivity or enhancing their product/service qualities. Product reviews can also increase business profits by convincing future customers about the products which they have interest in. In the mobile application marketplace such as Google Playstore, reviews and star ratings are used as indicators of the application quality. However, among all these reviews, hereby also known as opinions, spams also exist, to disrupt the online business balance. Previous studies used the time series and neural network approach (which require a lot of computational power) to detect these opinion spams. However, the detection performance can be restricted in terms of accuracy because the approach focusses on basic, discrete and document level features only thereby, projecting little statistical relationships. Aiming to improve the detection of opinion spams in mobile application marketplace, this study proposes using statistical based features that are modelled through the supervised boosting approach such as the Extreme Gradient Boost (XGBoost) and the Generalized Boosted Regression Model (GBM) to evaluate two multilingual datasets (i.e. English and Malay language). From the evaluation done, it was found that the XGBoost is most suitable for detecting opinion spams in the English dataset while the GBM Gaussian is most suitable for the Malay dataset. The comparative analysis also indicates that the implementation of the proposed statistical based features had achieved a detection accuracy rate of 87.43 per cent on the English dataset and 86.13 per cent on the Malay dataset. Public Library of Science 2018-06-11 /pmc/articles/PMC5995425/ /pubmed/29889897 http://dx.doi.org/10.1371/journal.pone.0198884 Text en © 2018 Hazim et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Hazim, Mohamad
Anuar, Nor Badrul
Ab Razak, Mohd Faizal
Abdullah, Nor Aniza
Detecting opinion spams through supervised boosting approach
title Detecting opinion spams through supervised boosting approach
title_full Detecting opinion spams through supervised boosting approach
title_fullStr Detecting opinion spams through supervised boosting approach
title_full_unstemmed Detecting opinion spams through supervised boosting approach
title_short Detecting opinion spams through supervised boosting approach
title_sort detecting opinion spams through supervised boosting approach
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5995425/
https://www.ncbi.nlm.nih.gov/pubmed/29889897
http://dx.doi.org/10.1371/journal.pone.0198884
work_keys_str_mv AT hazimmohamad detectingopinionspamsthroughsupervisedboostingapproach
AT anuarnorbadrul detectingopinionspamsthroughsupervisedboostingapproach
AT abrazakmohdfaizal detectingopinionspamsthroughsupervisedboostingapproach
AT abdullahnoraniza detectingopinionspamsthroughsupervisedboostingapproach