Cargando…

Deep learning for religious and continent-based toxic content detection and classification

With time, numerous online communication platforms have emerged that allow people to express themselves, increasing the dissemination of toxic languages, such as racism, sexual harassment, and other negative behaviors that are not accepted in polite society. As a result, toxic language identificatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Abbasi, Ahmed, Javed, Abdul Rehman, Iqbal, Farkhund, Kryvinska, Natalia, Jalil, Zunera
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9581992/
https://www.ncbi.nlm.nih.gov/pubmed/36261675
http://dx.doi.org/10.1038/s41598-022-22523-3
_version_ 1784812748909051904
author Abbasi, Ahmed
Javed, Abdul Rehman
Iqbal, Farkhund
Kryvinska, Natalia
Jalil, Zunera
author_facet Abbasi, Ahmed
Javed, Abdul Rehman
Iqbal, Farkhund
Kryvinska, Natalia
Jalil, Zunera
author_sort Abbasi, Ahmed
collection PubMed
description With time, numerous online communication platforms have emerged that allow people to express themselves, increasing the dissemination of toxic languages, such as racism, sexual harassment, and other negative behaviors that are not accepted in polite society. As a result, toxic language identification in online communication has emerged as a critical application of natural language processing. Numerous academic and industrial researchers have recently researched toxic language identification using machine learning algorithms. However, Nontoxic comments, including particular identification descriptors, such as Muslim, Jewish, White, and Black, were assigned unrealistically high toxicity ratings in several machine learning models. This research analyzes and compares modern deep learning algorithms for multilabel toxic comments classification. We explore two scenarios: the first is a multilabel classification of Religious toxic comments, and the second is a multilabel classification of race or toxic ethnicity comments with various word embeddings (GloVe, Word2vec, and FastText) without word embeddings using an ordinary embedding layer. Experiments show that the CNN model produced the best results for classifying multilabel toxic comments in both scenarios. We compared the outcomes of these modern deep learning model performances in terms of multilabel evaluation metrics.
format Online
Article
Text
id pubmed-9581992
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-95819922022-10-21 Deep learning for religious and continent-based toxic content detection and classification Abbasi, Ahmed Javed, Abdul Rehman Iqbal, Farkhund Kryvinska, Natalia Jalil, Zunera Sci Rep Article With time, numerous online communication platforms have emerged that allow people to express themselves, increasing the dissemination of toxic languages, such as racism, sexual harassment, and other negative behaviors that are not accepted in polite society. As a result, toxic language identification in online communication has emerged as a critical application of natural language processing. Numerous academic and industrial researchers have recently researched toxic language identification using machine learning algorithms. However, Nontoxic comments, including particular identification descriptors, such as Muslim, Jewish, White, and Black, were assigned unrealistically high toxicity ratings in several machine learning models. This research analyzes and compares modern deep learning algorithms for multilabel toxic comments classification. We explore two scenarios: the first is a multilabel classification of Religious toxic comments, and the second is a multilabel classification of race or toxic ethnicity comments with various word embeddings (GloVe, Word2vec, and FastText) without word embeddings using an ordinary embedding layer. Experiments show that the CNN model produced the best results for classifying multilabel toxic comments in both scenarios. We compared the outcomes of these modern deep learning model performances in terms of multilabel evaluation metrics. Nature Publishing Group UK 2022-10-19 /pmc/articles/PMC9581992/ /pubmed/36261675 http://dx.doi.org/10.1038/s41598-022-22523-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Abbasi, Ahmed
Javed, Abdul Rehman
Iqbal, Farkhund
Kryvinska, Natalia
Jalil, Zunera
Deep learning for religious and continent-based toxic content detection and classification
title Deep learning for religious and continent-based toxic content detection and classification
title_full Deep learning for religious and continent-based toxic content detection and classification
title_fullStr Deep learning for religious and continent-based toxic content detection and classification
title_full_unstemmed Deep learning for religious and continent-based toxic content detection and classification
title_short Deep learning for religious and continent-based toxic content detection and classification
title_sort deep learning for religious and continent-based toxic content detection and classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9581992/
https://www.ncbi.nlm.nih.gov/pubmed/36261675
http://dx.doi.org/10.1038/s41598-022-22523-3
work_keys_str_mv AT abbasiahmed deeplearningforreligiousandcontinentbasedtoxiccontentdetectionandclassification
AT javedabdulrehman deeplearningforreligiousandcontinentbasedtoxiccontentdetectionandclassification
AT iqbalfarkhund deeplearningforreligiousandcontinentbasedtoxiccontentdetectionandclassification
AT kryvinskanatalia deeplearningforreligiousandcontinentbasedtoxiccontentdetectionandclassification
AT jalilzunera deeplearningforreligiousandcontinentbasedtoxiccontentdetectionandclassification