Cargando…

Retweet communities reveal the main sources of hate speech

We address a challenging problem of identifying main sources of hate speech on Twitter. On one hand, we carefully annotate a large set of tweets for hate speech, and deploy advanced deep learning to produce high quality hate speech classification models. On the other hand, we create retweet networks...

Descripción completa

Detalles Bibliográficos
Autores principales: Evkoski, Bojan, Pelicon, Andraž, Mozetič, Igor, Ljubešić, Nikola, Kralj Novak, Petra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8929563/
https://www.ncbi.nlm.nih.gov/pubmed/35298556
http://dx.doi.org/10.1371/journal.pone.0265602
_version_ 1784670883195912192
author Evkoski, Bojan
Pelicon, Andraž
Mozetič, Igor
Ljubešić, Nikola
Kralj Novak, Petra
author_facet Evkoski, Bojan
Pelicon, Andraž
Mozetič, Igor
Ljubešić, Nikola
Kralj Novak, Petra
author_sort Evkoski, Bojan
collection PubMed
description We address a challenging problem of identifying main sources of hate speech on Twitter. On one hand, we carefully annotate a large set of tweets for hate speech, and deploy advanced deep learning to produce high quality hate speech classification models. On the other hand, we create retweet networks, detect communities and monitor their evolution through time. This combined approach is applied to three years of Slovenian Twitter data. We report a number of interesting results. Hate speech is dominated by offensive tweets, related to political and ideological issues. The share of unacceptable tweets is moderately increasing with time, from the initial 20% to 30% by the end of 2020. Unacceptable tweets are retweeted significantly more often than acceptable tweets. About 60% of unacceptable tweets are produced by a single right-wing community of only moderate size. Institutional Twitter accounts and media accounts post significantly less unacceptable tweets than individual accounts. In fact, the main sources of unacceptable tweets are anonymous accounts, and accounts that were suspended or closed during the years 2018–2020.
format Online
Article
Text
id pubmed-8929563
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-89295632022-03-18 Retweet communities reveal the main sources of hate speech Evkoski, Bojan Pelicon, Andraž Mozetič, Igor Ljubešić, Nikola Kralj Novak, Petra PLoS One Research Article We address a challenging problem of identifying main sources of hate speech on Twitter. On one hand, we carefully annotate a large set of tweets for hate speech, and deploy advanced deep learning to produce high quality hate speech classification models. On the other hand, we create retweet networks, detect communities and monitor their evolution through time. This combined approach is applied to three years of Slovenian Twitter data. We report a number of interesting results. Hate speech is dominated by offensive tweets, related to political and ideological issues. The share of unacceptable tweets is moderately increasing with time, from the initial 20% to 30% by the end of 2020. Unacceptable tweets are retweeted significantly more often than acceptable tweets. About 60% of unacceptable tweets are produced by a single right-wing community of only moderate size. Institutional Twitter accounts and media accounts post significantly less unacceptable tweets than individual accounts. In fact, the main sources of unacceptable tweets are anonymous accounts, and accounts that were suspended or closed during the years 2018–2020. Public Library of Science 2022-03-17 /pmc/articles/PMC8929563/ /pubmed/35298556 http://dx.doi.org/10.1371/journal.pone.0265602 Text en © 2022 Evkoski et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Evkoski, Bojan
Pelicon, Andraž
Mozetič, Igor
Ljubešić, Nikola
Kralj Novak, Petra
Retweet communities reveal the main sources of hate speech
title Retweet communities reveal the main sources of hate speech
title_full Retweet communities reveal the main sources of hate speech
title_fullStr Retweet communities reveal the main sources of hate speech
title_full_unstemmed Retweet communities reveal the main sources of hate speech
title_short Retweet communities reveal the main sources of hate speech
title_sort retweet communities reveal the main sources of hate speech
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8929563/
https://www.ncbi.nlm.nih.gov/pubmed/35298556
http://dx.doi.org/10.1371/journal.pone.0265602
work_keys_str_mv AT evkoskibojan retweetcommunitiesrevealthemainsourcesofhatespeech
AT peliconandraz retweetcommunitiesrevealthemainsourcesofhatespeech
AT mozeticigor retweetcommunitiesrevealthemainsourcesofhatespeech
AT ljubesicnikola retweetcommunitiesrevealthemainsourcesofhatespeech
AT kraljnovakpetra retweetcommunitiesrevealthemainsourcesofhatespeech