Cargando…

Comparison of Machine Learning and Sentiment Analysis in Detection of Suspicious Online Reviewers on Different Type of Data

The article focuses on solving an important problem of detecting suspicious reviewers in online discussions on social networks. We have concentrated on a special type of suspicious authors, on trolls. We have used methods of machine learning for generation of detection models to discriminate a troll...

Descripción completa

Detalles Bibliográficos
Autores principales:	Machova, Kristina, Mach, Marian, Vasilko, Matej
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8747373/ https://www.ncbi.nlm.nih.gov/pubmed/35009698 http://dx.doi.org/10.3390/s22010155

_version_	1784630820088053760
author	Machova, Kristina Mach, Marian Vasilko, Matej
author_facet	Machova, Kristina Mach, Marian Vasilko, Matej
author_sort	Machova, Kristina
collection	PubMed
description	The article focuses on solving an important problem of detecting suspicious reviewers in online discussions on social networks. We have concentrated on a special type of suspicious authors, on trolls. We have used methods of machine learning for generation of detection models to discriminate a troll reviewer from a common reviewer, but also methods of sentiment analysis to recognize the sentiment typical for troll’s comments. The sentiment analysis can be provided also using machine learning or lexicon-based approach. We have used lexicon-based sentiment analysis for its better ability to detect a dictionary typical for troll authors. We have achieved Accuracy = 0.95 and F1 = 0.80 using sentiment analysis. The best results using machine learning methods were achieved by support vector machine, Accuracy = 0.986 and F1 = 0.988, using a dataset with the set of all selected attributes. We can conclude that detection model based on machine learning is more successful than lexicon-based sentiment analysis, but the difference in accuracy is not so large as in F1 measure.
format	Online Article Text
id	pubmed-8747373
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-87473732022-01-11 Comparison of Machine Learning and Sentiment Analysis in Detection of Suspicious Online Reviewers on Different Type of Data Machova, Kristina Mach, Marian Vasilko, Matej Sensors (Basel) Article The article focuses on solving an important problem of detecting suspicious reviewers in online discussions on social networks. We have concentrated on a special type of suspicious authors, on trolls. We have used methods of machine learning for generation of detection models to discriminate a troll reviewer from a common reviewer, but also methods of sentiment analysis to recognize the sentiment typical for troll’s comments. The sentiment analysis can be provided also using machine learning or lexicon-based approach. We have used lexicon-based sentiment analysis for its better ability to detect a dictionary typical for troll authors. We have achieved Accuracy = 0.95 and F1 = 0.80 using sentiment analysis. The best results using machine learning methods were achieved by support vector machine, Accuracy = 0.986 and F1 = 0.988, using a dataset with the set of all selected attributes. We can conclude that detection model based on machine learning is more successful than lexicon-based sentiment analysis, but the difference in accuracy is not so large as in F1 measure. MDPI 2021-12-27 /pmc/articles/PMC8747373/ /pubmed/35009698 http://dx.doi.org/10.3390/s22010155 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Machova, Kristina Mach, Marian Vasilko, Matej Comparison of Machine Learning and Sentiment Analysis in Detection of Suspicious Online Reviewers on Different Type of Data
title	Comparison of Machine Learning and Sentiment Analysis in Detection of Suspicious Online Reviewers on Different Type of Data
title_full	Comparison of Machine Learning and Sentiment Analysis in Detection of Suspicious Online Reviewers on Different Type of Data
title_fullStr	Comparison of Machine Learning and Sentiment Analysis in Detection of Suspicious Online Reviewers on Different Type of Data
title_full_unstemmed	Comparison of Machine Learning and Sentiment Analysis in Detection of Suspicious Online Reviewers on Different Type of Data
title_short	Comparison of Machine Learning and Sentiment Analysis in Detection of Suspicious Online Reviewers on Different Type of Data
title_sort	comparison of machine learning and sentiment analysis in detection of suspicious online reviewers on different type of data
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8747373/ https://www.ncbi.nlm.nih.gov/pubmed/35009698 http://dx.doi.org/10.3390/s22010155
work_keys_str_mv	AT machovakristina comparisonofmachinelearningandsentimentanalysisindetectionofsuspiciousonlinereviewersondifferenttypeofdata AT machmarian comparisonofmachinelearningandsentimentanalysisindetectionofsuspiciousonlinereviewersondifferenttypeofdata AT vasilkomatej comparisonofmachinelearningandsentimentanalysisindetectionofsuspiciousonlinereviewersondifferenttypeofdata

Comparison of Machine Learning and Sentiment Analysis in Detection of Suspicious Online Reviewers on Different Type of Data

Ejemplares similares