Cargando…
Detecting and identifying the reasons for deleted tweets before they are posted
Social media platforms empower us in several ways, from information dissemination to consumption. While these platforms are useful in promoting citizen journalism, public awareness, etc., they have misuse potential. Malicious users use them to disseminate hate speech, offensive content, rumor, etc....
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10570840/ https://www.ncbi.nlm.nih.gov/pubmed/37841232 http://dx.doi.org/10.3389/frai.2023.1219767 |
_version_ | 1785119860557086720 |
---|---|
author | Mubarak, Hamdy Abdaljalil, Samir Nassar, Azza Alam, Firoj |
author_facet | Mubarak, Hamdy Abdaljalil, Samir Nassar, Azza Alam, Firoj |
author_sort | Mubarak, Hamdy |
collection | PubMed |
description | Social media platforms empower us in several ways, from information dissemination to consumption. While these platforms are useful in promoting citizen journalism, public awareness, etc., they have misuse potential. Malicious users use them to disseminate hate speech, offensive content, rumor, etc. to promote social and political agendas or to harm individuals, entities, and organizations. Oftentimes, general users unconsciously share information without verifying it or unintentionally post harmful messages. Some of such content often gets deleted either by the platform due to the violation of terms and policies or by users themselves for different reasons, e.g., regret. There is a wide range of studies in characterizing, understanding, and predicting deleted content. However, studies that aim to identify the fine-grained reasons (e.g., posts are offensive, hate speech, or no identifiable reason) behind deleted content are limited. In this study, we address an existing gap by identifying and categorizing deleted tweets, especially within the Arabic context. We label them based on fine-grained disinformation categories. We have curated a dataset of 40K tweets, annotated with both coarse and fine-grained labels. Following this, we designed models to predict the likelihood of tweets being deleted and to identify the potential reasons for their deletion. Our experiments, conducted using a variety of classic and transformer models, indicate that performance surpasses the majority baseline (e.g., 25% absolute improvement for fine-grained labels). We believe that such models can assist in moderating social media posts even before they are published. |
format | Online Article Text |
id | pubmed-10570840 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-105708402023-10-14 Detecting and identifying the reasons for deleted tweets before they are posted Mubarak, Hamdy Abdaljalil, Samir Nassar, Azza Alam, Firoj Front Artif Intell Artificial Intelligence Social media platforms empower us in several ways, from information dissemination to consumption. While these platforms are useful in promoting citizen journalism, public awareness, etc., they have misuse potential. Malicious users use them to disseminate hate speech, offensive content, rumor, etc. to promote social and political agendas or to harm individuals, entities, and organizations. Oftentimes, general users unconsciously share information without verifying it or unintentionally post harmful messages. Some of such content often gets deleted either by the platform due to the violation of terms and policies or by users themselves for different reasons, e.g., regret. There is a wide range of studies in characterizing, understanding, and predicting deleted content. However, studies that aim to identify the fine-grained reasons (e.g., posts are offensive, hate speech, or no identifiable reason) behind deleted content are limited. In this study, we address an existing gap by identifying and categorizing deleted tweets, especially within the Arabic context. We label them based on fine-grained disinformation categories. We have curated a dataset of 40K tweets, annotated with both coarse and fine-grained labels. Following this, we designed models to predict the likelihood of tweets being deleted and to identify the potential reasons for their deletion. Our experiments, conducted using a variety of classic and transformer models, indicate that performance surpasses the majority baseline (e.g., 25% absolute improvement for fine-grained labels). We believe that such models can assist in moderating social media posts even before they are published. Frontiers Media S.A. 2023-09-29 /pmc/articles/PMC10570840/ /pubmed/37841232 http://dx.doi.org/10.3389/frai.2023.1219767 Text en Copyright © 2023 Mubarak, Abdaljalil, Nassar and Alam. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Artificial Intelligence Mubarak, Hamdy Abdaljalil, Samir Nassar, Azza Alam, Firoj Detecting and identifying the reasons for deleted tweets before they are posted |
title | Detecting and identifying the reasons for deleted tweets before they are posted |
title_full | Detecting and identifying the reasons for deleted tweets before they are posted |
title_fullStr | Detecting and identifying the reasons for deleted tweets before they are posted |
title_full_unstemmed | Detecting and identifying the reasons for deleted tweets before they are posted |
title_short | Detecting and identifying the reasons for deleted tweets before they are posted |
title_sort | detecting and identifying the reasons for deleted tweets before they are posted |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10570840/ https://www.ncbi.nlm.nih.gov/pubmed/37841232 http://dx.doi.org/10.3389/frai.2023.1219767 |
work_keys_str_mv | AT mubarakhamdy detectingandidentifyingthereasonsfordeletedtweetsbeforetheyareposted AT abdaljalilsamir detectingandidentifyingthereasonsfordeletedtweetsbeforetheyareposted AT nassarazza detectingandidentifyingthereasonsfordeletedtweetsbeforetheyareposted AT alamfiroj detectingandidentifyingthereasonsfordeletedtweetsbeforetheyareposted |