Cargando…

Detecting and identifying the reasons for deleted tweets before they are posted

Social media platforms empower us in several ways, from information dissemination to consumption. While these platforms are useful in promoting citizen journalism, public awareness, etc., they have misuse potential. Malicious users use them to disseminate hate speech, offensive content, rumor, etc....

Descripción completa

Detalles Bibliográficos
Autores principales: Mubarak, Hamdy, Abdaljalil, Samir, Nassar, Azza, Alam, Firoj
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10570840/
https://www.ncbi.nlm.nih.gov/pubmed/37841232
http://dx.doi.org/10.3389/frai.2023.1219767
_version_ 1785119860557086720
author Mubarak, Hamdy
Abdaljalil, Samir
Nassar, Azza
Alam, Firoj
author_facet Mubarak, Hamdy
Abdaljalil, Samir
Nassar, Azza
Alam, Firoj
author_sort Mubarak, Hamdy
collection PubMed
description Social media platforms empower us in several ways, from information dissemination to consumption. While these platforms are useful in promoting citizen journalism, public awareness, etc., they have misuse potential. Malicious users use them to disseminate hate speech, offensive content, rumor, etc. to promote social and political agendas or to harm individuals, entities, and organizations. Oftentimes, general users unconsciously share information without verifying it or unintentionally post harmful messages. Some of such content often gets deleted either by the platform due to the violation of terms and policies or by users themselves for different reasons, e.g., regret. There is a wide range of studies in characterizing, understanding, and predicting deleted content. However, studies that aim to identify the fine-grained reasons (e.g., posts are offensive, hate speech, or no identifiable reason) behind deleted content are limited. In this study, we address an existing gap by identifying and categorizing deleted tweets, especially within the Arabic context. We label them based on fine-grained disinformation categories. We have curated a dataset of 40K tweets, annotated with both coarse and fine-grained labels. Following this, we designed models to predict the likelihood of tweets being deleted and to identify the potential reasons for their deletion. Our experiments, conducted using a variety of classic and transformer models, indicate that performance surpasses the majority baseline (e.g., 25% absolute improvement for fine-grained labels). We believe that such models can assist in moderating social media posts even before they are published.
format Online
Article
Text
id pubmed-10570840
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-105708402023-10-14 Detecting and identifying the reasons for deleted tweets before they are posted Mubarak, Hamdy Abdaljalil, Samir Nassar, Azza Alam, Firoj Front Artif Intell Artificial Intelligence Social media platforms empower us in several ways, from information dissemination to consumption. While these platforms are useful in promoting citizen journalism, public awareness, etc., they have misuse potential. Malicious users use them to disseminate hate speech, offensive content, rumor, etc. to promote social and political agendas or to harm individuals, entities, and organizations. Oftentimes, general users unconsciously share information without verifying it or unintentionally post harmful messages. Some of such content often gets deleted either by the platform due to the violation of terms and policies or by users themselves for different reasons, e.g., regret. There is a wide range of studies in characterizing, understanding, and predicting deleted content. However, studies that aim to identify the fine-grained reasons (e.g., posts are offensive, hate speech, or no identifiable reason) behind deleted content are limited. In this study, we address an existing gap by identifying and categorizing deleted tweets, especially within the Arabic context. We label them based on fine-grained disinformation categories. We have curated a dataset of 40K tweets, annotated with both coarse and fine-grained labels. Following this, we designed models to predict the likelihood of tweets being deleted and to identify the potential reasons for their deletion. Our experiments, conducted using a variety of classic and transformer models, indicate that performance surpasses the majority baseline (e.g., 25% absolute improvement for fine-grained labels). We believe that such models can assist in moderating social media posts even before they are published. Frontiers Media S.A. 2023-09-29 /pmc/articles/PMC10570840/ /pubmed/37841232 http://dx.doi.org/10.3389/frai.2023.1219767 Text en Copyright © 2023 Mubarak, Abdaljalil, Nassar and Alam. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Artificial Intelligence
Mubarak, Hamdy
Abdaljalil, Samir
Nassar, Azza
Alam, Firoj
Detecting and identifying the reasons for deleted tweets before they are posted
title Detecting and identifying the reasons for deleted tweets before they are posted
title_full Detecting and identifying the reasons for deleted tweets before they are posted
title_fullStr Detecting and identifying the reasons for deleted tweets before they are posted
title_full_unstemmed Detecting and identifying the reasons for deleted tweets before they are posted
title_short Detecting and identifying the reasons for deleted tweets before they are posted
title_sort detecting and identifying the reasons for deleted tweets before they are posted
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10570840/
https://www.ncbi.nlm.nih.gov/pubmed/37841232
http://dx.doi.org/10.3389/frai.2023.1219767
work_keys_str_mv AT mubarakhamdy detectingandidentifyingthereasonsfordeletedtweetsbeforetheyareposted
AT abdaljalilsamir detectingandidentifyingthereasonsfordeletedtweetsbeforetheyareposted
AT nassarazza detectingandidentifyingthereasonsfordeletedtweetsbeforetheyareposted
AT alamfiroj detectingandidentifyingthereasonsfordeletedtweetsbeforetheyareposted