Cargando…
Deep learning for religious and continent-based toxic content detection and classification
With time, numerous online communication platforms have emerged that allow people to express themselves, increasing the dissemination of toxic languages, such as racism, sexual harassment, and other negative behaviors that are not accepted in polite society. As a result, toxic language identificatio...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9581992/ https://www.ncbi.nlm.nih.gov/pubmed/36261675 http://dx.doi.org/10.1038/s41598-022-22523-3 |
_version_ | 1784812748909051904 |
---|---|
author | Abbasi, Ahmed Javed, Abdul Rehman Iqbal, Farkhund Kryvinska, Natalia Jalil, Zunera |
author_facet | Abbasi, Ahmed Javed, Abdul Rehman Iqbal, Farkhund Kryvinska, Natalia Jalil, Zunera |
author_sort | Abbasi, Ahmed |
collection | PubMed |
description | With time, numerous online communication platforms have emerged that allow people to express themselves, increasing the dissemination of toxic languages, such as racism, sexual harassment, and other negative behaviors that are not accepted in polite society. As a result, toxic language identification in online communication has emerged as a critical application of natural language processing. Numerous academic and industrial researchers have recently researched toxic language identification using machine learning algorithms. However, Nontoxic comments, including particular identification descriptors, such as Muslim, Jewish, White, and Black, were assigned unrealistically high toxicity ratings in several machine learning models. This research analyzes and compares modern deep learning algorithms for multilabel toxic comments classification. We explore two scenarios: the first is a multilabel classification of Religious toxic comments, and the second is a multilabel classification of race or toxic ethnicity comments with various word embeddings (GloVe, Word2vec, and FastText) without word embeddings using an ordinary embedding layer. Experiments show that the CNN model produced the best results for classifying multilabel toxic comments in both scenarios. We compared the outcomes of these modern deep learning model performances in terms of multilabel evaluation metrics. |
format | Online Article Text |
id | pubmed-9581992 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-95819922022-10-21 Deep learning for religious and continent-based toxic content detection and classification Abbasi, Ahmed Javed, Abdul Rehman Iqbal, Farkhund Kryvinska, Natalia Jalil, Zunera Sci Rep Article With time, numerous online communication platforms have emerged that allow people to express themselves, increasing the dissemination of toxic languages, such as racism, sexual harassment, and other negative behaviors that are not accepted in polite society. As a result, toxic language identification in online communication has emerged as a critical application of natural language processing. Numerous academic and industrial researchers have recently researched toxic language identification using machine learning algorithms. However, Nontoxic comments, including particular identification descriptors, such as Muslim, Jewish, White, and Black, were assigned unrealistically high toxicity ratings in several machine learning models. This research analyzes and compares modern deep learning algorithms for multilabel toxic comments classification. We explore two scenarios: the first is a multilabel classification of Religious toxic comments, and the second is a multilabel classification of race or toxic ethnicity comments with various word embeddings (GloVe, Word2vec, and FastText) without word embeddings using an ordinary embedding layer. Experiments show that the CNN model produced the best results for classifying multilabel toxic comments in both scenarios. We compared the outcomes of these modern deep learning model performances in terms of multilabel evaluation metrics. Nature Publishing Group UK 2022-10-19 /pmc/articles/PMC9581992/ /pubmed/36261675 http://dx.doi.org/10.1038/s41598-022-22523-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Abbasi, Ahmed Javed, Abdul Rehman Iqbal, Farkhund Kryvinska, Natalia Jalil, Zunera Deep learning for religious and continent-based toxic content detection and classification |
title | Deep learning for religious and continent-based toxic content detection and classification |
title_full | Deep learning for religious and continent-based toxic content detection and classification |
title_fullStr | Deep learning for religious and continent-based toxic content detection and classification |
title_full_unstemmed | Deep learning for religious and continent-based toxic content detection and classification |
title_short | Deep learning for religious and continent-based toxic content detection and classification |
title_sort | deep learning for religious and continent-based toxic content detection and classification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9581992/ https://www.ncbi.nlm.nih.gov/pubmed/36261675 http://dx.doi.org/10.1038/s41598-022-22523-3 |
work_keys_str_mv | AT abbasiahmed deeplearningforreligiousandcontinentbasedtoxiccontentdetectionandclassification AT javedabdulrehman deeplearningforreligiousandcontinentbasedtoxiccontentdetectionandclassification AT iqbalfarkhund deeplearningforreligiousandcontinentbasedtoxiccontentdetectionandclassification AT kryvinskanatalia deeplearningforreligiousandcontinentbasedtoxiccontentdetectionandclassification AT jalilzunera deeplearningforreligiousandcontinentbasedtoxiccontentdetectionandclassification |