Cargando…

Detecting fake-review buyers using network structure: Direct evidence from Amazon

Online reviews significantly impact consumers’ decision-making process and firms’ economic outcomes and are widely seen as crucial to the success of online markets. Firms, therefore, have a strong incentive to manipulate ratings using fake reviews. This presents a problem that academic researchers h...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Sherry, Hollenbeck, Brett, Overgoor, Gijs, Proserpio, Davide, Tosyali, Ali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9704690/
https://www.ncbi.nlm.nih.gov/pubmed/36378645
http://dx.doi.org/10.1073/pnas.2211932119
_version_ 1784840108787105792
author He, Sherry
Hollenbeck, Brett
Overgoor, Gijs
Proserpio, Davide
Tosyali, Ali
author_facet He, Sherry
Hollenbeck, Brett
Overgoor, Gijs
Proserpio, Davide
Tosyali, Ali
author_sort He, Sherry
collection PubMed
description Online reviews significantly impact consumers’ decision-making process and firms’ economic outcomes and are widely seen as crucial to the success of online markets. Firms, therefore, have a strong incentive to manipulate ratings using fake reviews. This presents a problem that academic researchers have tried to solve for over two decades and on which platforms expend a large amount of resources. Nevertheless, the prevalence of fake reviews is arguably higher than ever. To combat this, we collect a dataset of reviews for thousands of Amazon products and develop a general and highly accurate method for detecting fake reviews. A unique difference between previous datasets and ours is that we directly observe which sellers buy fake reviews. Thus, while prior research has trained models using laboratory-generated reviews or proxies for fake reviews, we are able to train a model using actual fake reviews. We show that products that buy fake reviews are highly clustered in the product reviewer network. Therefore, features constructed from this network are highly predictive of which products buy fake reviews. We show that our network-based approach is also successful at detecting fake review buyers even without ground truth data, as unsupervised clustering methods can accurately identify fake review buyers by identifying clusters of products that are closely connected in the network. While text or metadata can be manipulated to evade detection, network-based features are more costly to manipulate because these features result directly from the inherent limitations of buying reviews from online review marketplaces, making our detection approach more robust to manipulation.
format Online
Article
Text
id pubmed-9704690
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-97046902023-05-15 Detecting fake-review buyers using network structure: Direct evidence from Amazon He, Sherry Hollenbeck, Brett Overgoor, Gijs Proserpio, Davide Tosyali, Ali Proc Natl Acad Sci U S A Social Sciences Online reviews significantly impact consumers’ decision-making process and firms’ economic outcomes and are widely seen as crucial to the success of online markets. Firms, therefore, have a strong incentive to manipulate ratings using fake reviews. This presents a problem that academic researchers have tried to solve for over two decades and on which platforms expend a large amount of resources. Nevertheless, the prevalence of fake reviews is arguably higher than ever. To combat this, we collect a dataset of reviews for thousands of Amazon products and develop a general and highly accurate method for detecting fake reviews. A unique difference between previous datasets and ours is that we directly observe which sellers buy fake reviews. Thus, while prior research has trained models using laboratory-generated reviews or proxies for fake reviews, we are able to train a model using actual fake reviews. We show that products that buy fake reviews are highly clustered in the product reviewer network. Therefore, features constructed from this network are highly predictive of which products buy fake reviews. We show that our network-based approach is also successful at detecting fake review buyers even without ground truth data, as unsupervised clustering methods can accurately identify fake review buyers by identifying clusters of products that are closely connected in the network. While text or metadata can be manipulated to evade detection, network-based features are more costly to manipulate because these features result directly from the inherent limitations of buying reviews from online review marketplaces, making our detection approach more robust to manipulation. National Academy of Sciences 2022-11-15 2022-11-22 /pmc/articles/PMC9704690/ /pubmed/36378645 http://dx.doi.org/10.1073/pnas.2211932119 Text en Copyright © 2022 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Social Sciences
He, Sherry
Hollenbeck, Brett
Overgoor, Gijs
Proserpio, Davide
Tosyali, Ali
Detecting fake-review buyers using network structure: Direct evidence from Amazon
title Detecting fake-review buyers using network structure: Direct evidence from Amazon
title_full Detecting fake-review buyers using network structure: Direct evidence from Amazon
title_fullStr Detecting fake-review buyers using network structure: Direct evidence from Amazon
title_full_unstemmed Detecting fake-review buyers using network structure: Direct evidence from Amazon
title_short Detecting fake-review buyers using network structure: Direct evidence from Amazon
title_sort detecting fake-review buyers using network structure: direct evidence from amazon
topic Social Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9704690/
https://www.ncbi.nlm.nih.gov/pubmed/36378645
http://dx.doi.org/10.1073/pnas.2211932119
work_keys_str_mv AT hesherry detectingfakereviewbuyersusingnetworkstructuredirectevidencefromamazon
AT hollenbeckbrett detectingfakereviewbuyersusingnetworkstructuredirectevidencefromamazon
AT overgoorgijs detectingfakereviewbuyersusingnetworkstructuredirectevidencefromamazon
AT proserpiodavide detectingfakereviewbuyersusingnetworkstructuredirectevidencefromamazon
AT tosyaliali detectingfakereviewbuyersusingnetworkstructuredirectevidencefromamazon