Cargando…

DeeProBot: a hybrid deep neural network model for social bot detection based on user profile data

Use of online social networks (OSNs) undoubtedly brings the world closer. OSNs like Twitter provide a space for expressing one’s opinions in a public platform. This great potential is misused by the creation of bot accounts, which spread fake news and manipulate opinions. Hence, distinguishing genui...

Descripción completa

Detalles Bibliográficos
Autores principales:	Hayawi, Kadhim, Mathew, Sujith, Venugopal, Neethu, Masud, Mohammad M., Ho, Pin-Han
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer Vienna 2022
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8917378/ https://www.ncbi.nlm.nih.gov/pubmed/35309873 http://dx.doi.org/10.1007/s13278-022-00869-w

_version_	1784668537000820736
author	Hayawi, Kadhim Mathew, Sujith Venugopal, Neethu Masud, Mohammad M. Ho, Pin-Han
author_facet	Hayawi, Kadhim Mathew, Sujith Venugopal, Neethu Masud, Mohammad M. Ho, Pin-Han
author_sort	Hayawi, Kadhim
collection	PubMed
description	Use of online social networks (OSNs) undoubtedly brings the world closer. OSNs like Twitter provide a space for expressing one’s opinions in a public platform. This great potential is misused by the creation of bot accounts, which spread fake news and manipulate opinions. Hence, distinguishing genuine human accounts from bot accounts has become a pressing issue for researchers. In this paper, we propose a framework based on deep learning to classify Twitter accounts as either ‘human’ or ‘bot.’ We use the information from user profile metadata of the Twitter account like description, follower count and tweet count. We name the framework ‘DeeProBot,’ which stands for Deep Profile-based Bot detection framework. The raw text from the description field of the Twitter account is also considered a feature for training the model by embedding the raw text using pre-trained Global Vectors (GLoVe) for word representation. Using only the user profile-based features considerably reduces the feature engineering overhead compared with that of user timeline-based features like user tweets and retweets. DeeProBot handles mixed types of features including numerical, binary, and text data, making the model hybrid. The network is designed with long short-term memory (LSTM) units and dense layers to accept and process the mixed input types. The proposed model is evaluated on a collection of publicly available labeled datasets. We have designed the model to make it generalizable across different datasets. The model is evaluated using two ways: testing on a hold-out set of the same dataset; and training with one dataset and testing with a different dataset. With these experiments, the proposed model achieved AUC as high as 0.97 with a selected set of features.
format	Online Article Text
id	pubmed-8917378
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Springer Vienna
record_format	MEDLINE/PubMed
spelling	pubmed-89173782022-03-14 DeeProBot: a hybrid deep neural network model for social bot detection based on user profile data Hayawi, Kadhim Mathew, Sujith Venugopal, Neethu Masud, Mohammad M. Ho, Pin-Han Soc Netw Anal Min Original Article Use of online social networks (OSNs) undoubtedly brings the world closer. OSNs like Twitter provide a space for expressing one’s opinions in a public platform. This great potential is misused by the creation of bot accounts, which spread fake news and manipulate opinions. Hence, distinguishing genuine human accounts from bot accounts has become a pressing issue for researchers. In this paper, we propose a framework based on deep learning to classify Twitter accounts as either ‘human’ or ‘bot.’ We use the information from user profile metadata of the Twitter account like description, follower count and tweet count. We name the framework ‘DeeProBot,’ which stands for Deep Profile-based Bot detection framework. The raw text from the description field of the Twitter account is also considered a feature for training the model by embedding the raw text using pre-trained Global Vectors (GLoVe) for word representation. Using only the user profile-based features considerably reduces the feature engineering overhead compared with that of user timeline-based features like user tweets and retweets. DeeProBot handles mixed types of features including numerical, binary, and text data, making the model hybrid. The network is designed with long short-term memory (LSTM) units and dense layers to accept and process the mixed input types. The proposed model is evaluated on a collection of publicly available labeled datasets. We have designed the model to make it generalizable across different datasets. The model is evaluated using two ways: testing on a hold-out set of the same dataset; and training with one dataset and testing with a different dataset. With these experiments, the proposed model achieved AUC as high as 0.97 with a selected set of features. Springer Vienna 2022-03-12 2022 /pmc/articles/PMC8917378/ /pubmed/35309873 http://dx.doi.org/10.1007/s13278-022-00869-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Original Article Hayawi, Kadhim Mathew, Sujith Venugopal, Neethu Masud, Mohammad M. Ho, Pin-Han DeeProBot: a hybrid deep neural network model for social bot detection based on user profile data
title	DeeProBot: a hybrid deep neural network model for social bot detection based on user profile data
title_full	DeeProBot: a hybrid deep neural network model for social bot detection based on user profile data
title_fullStr	DeeProBot: a hybrid deep neural network model for social bot detection based on user profile data
title_full_unstemmed	DeeProBot: a hybrid deep neural network model for social bot detection based on user profile data
title_short	DeeProBot: a hybrid deep neural network model for social bot detection based on user profile data
title_sort	deeprobot: a hybrid deep neural network model for social bot detection based on user profile data
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8917378/ https://www.ncbi.nlm.nih.gov/pubmed/35309873 http://dx.doi.org/10.1007/s13278-022-00869-w
work_keys_str_mv	AT hayawikadhim deeprobotahybriddeepneuralnetworkmodelforsocialbotdetectionbasedonuserprofiledata AT mathewsujith deeprobotahybriddeepneuralnetworkmodelforsocialbotdetectionbasedonuserprofiledata AT venugopalneethu deeprobotahybriddeepneuralnetworkmodelforsocialbotdetectionbasedonuserprofiledata AT masudmohammadm deeprobotahybriddeepneuralnetworkmodelforsocialbotdetectionbasedonuserprofiledata AT hopinhan deeprobotahybriddeepneuralnetworkmodelforsocialbotdetectionbasedonuserprofiledata

DeeProBot: a hybrid deep neural network model for social bot detection based on user profile data

Ejemplares similares