Cargando…
A web framework for information aggregation and management of multilingual hate speech
Social media platforms have led to the creation of a vast amount of information produced by users and published publicly, facilitating participation in the public sphere, but also giving the opportunity for certain users to publish hateful content. This content mainly involves offensive/discriminati...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10196859/ https://www.ncbi.nlm.nih.gov/pubmed/37215824 http://dx.doi.org/10.1016/j.heliyon.2023.e16084 |
_version_ | 1785044434457460736 |
---|---|
author | Kotsakis, Rigas Vrysis, Lazaros Vryzas, Nikolaos Saridou, Theodora Matsiola, Maria Veglis, Andreas Dimoulas, Charalampos |
author_facet | Kotsakis, Rigas Vrysis, Lazaros Vryzas, Nikolaos Saridou, Theodora Matsiola, Maria Veglis, Andreas Dimoulas, Charalampos |
author_sort | Kotsakis, Rigas |
collection | PubMed |
description | Social media platforms have led to the creation of a vast amount of information produced by users and published publicly, facilitating participation in the public sphere, but also giving the opportunity for certain users to publish hateful content. This content mainly involves offensive/discriminative speech towards social groups or individuals (based on racial, religious, gender or other characteristics) and could possibly lead into subsequent hate actions/crimes due to persistent escalation. Content management and moderation in big data volumes can no longer be supported manually. In the current research, a web framework is presented and evaluated for the collection, analysis, and aggregation of multilingual textual content from various online sources. The framework is designed to address the needs of human users, journalists, academics, and the public to collect and analyze content from social media and the web in Spanish, Italian, Greek, and English, without prior training or a background in Computer Science. The backend functionality provides content collection and monitoring, semantic analysis including hate speech detection and sentiment analysis using machine learning models and rule-based algorithms, storing, querying, and retrieving such content along with the relevant metadata in a database. This functionality is assessed through a graphic user interface that is accessed using a web browser. An evaluation procedure was held through online questionnaires, including journalists and students, proving the feasibility of the use of the proposed framework by non-experts for the defined use-case scenarios. |
format | Online Article Text |
id | pubmed-10196859 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-101968592023-05-20 A web framework for information aggregation and management of multilingual hate speech Kotsakis, Rigas Vrysis, Lazaros Vryzas, Nikolaos Saridou, Theodora Matsiola, Maria Veglis, Andreas Dimoulas, Charalampos Heliyon Research Article Social media platforms have led to the creation of a vast amount of information produced by users and published publicly, facilitating participation in the public sphere, but also giving the opportunity for certain users to publish hateful content. This content mainly involves offensive/discriminative speech towards social groups or individuals (based on racial, religious, gender or other characteristics) and could possibly lead into subsequent hate actions/crimes due to persistent escalation. Content management and moderation in big data volumes can no longer be supported manually. In the current research, a web framework is presented and evaluated for the collection, analysis, and aggregation of multilingual textual content from various online sources. The framework is designed to address the needs of human users, journalists, academics, and the public to collect and analyze content from social media and the web in Spanish, Italian, Greek, and English, without prior training or a background in Computer Science. The backend functionality provides content collection and monitoring, semantic analysis including hate speech detection and sentiment analysis using machine learning models and rule-based algorithms, storing, querying, and retrieving such content along with the relevant metadata in a database. This functionality is assessed through a graphic user interface that is accessed using a web browser. An evaluation procedure was held through online questionnaires, including journalists and students, proving the feasibility of the use of the proposed framework by non-experts for the defined use-case scenarios. Elsevier 2023-05-09 /pmc/articles/PMC10196859/ /pubmed/37215824 http://dx.doi.org/10.1016/j.heliyon.2023.e16084 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Kotsakis, Rigas Vrysis, Lazaros Vryzas, Nikolaos Saridou, Theodora Matsiola, Maria Veglis, Andreas Dimoulas, Charalampos A web framework for information aggregation and management of multilingual hate speech |
title | A web framework for information aggregation and management of multilingual hate speech |
title_full | A web framework for information aggregation and management of multilingual hate speech |
title_fullStr | A web framework for information aggregation and management of multilingual hate speech |
title_full_unstemmed | A web framework for information aggregation and management of multilingual hate speech |
title_short | A web framework for information aggregation and management of multilingual hate speech |
title_sort | web framework for information aggregation and management of multilingual hate speech |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10196859/ https://www.ncbi.nlm.nih.gov/pubmed/37215824 http://dx.doi.org/10.1016/j.heliyon.2023.e16084 |
work_keys_str_mv | AT kotsakisrigas awebframeworkforinformationaggregationandmanagementofmultilingualhatespeech AT vrysislazaros awebframeworkforinformationaggregationandmanagementofmultilingualhatespeech AT vryzasnikolaos awebframeworkforinformationaggregationandmanagementofmultilingualhatespeech AT saridoutheodora awebframeworkforinformationaggregationandmanagementofmultilingualhatespeech AT matsiolamaria awebframeworkforinformationaggregationandmanagementofmultilingualhatespeech AT veglisandreas awebframeworkforinformationaggregationandmanagementofmultilingualhatespeech AT dimoulascharalampos awebframeworkforinformationaggregationandmanagementofmultilingualhatespeech AT kotsakisrigas webframeworkforinformationaggregationandmanagementofmultilingualhatespeech AT vrysislazaros webframeworkforinformationaggregationandmanagementofmultilingualhatespeech AT vryzasnikolaos webframeworkforinformationaggregationandmanagementofmultilingualhatespeech AT saridoutheodora webframeworkforinformationaggregationandmanagementofmultilingualhatespeech AT matsiolamaria webframeworkforinformationaggregationandmanagementofmultilingualhatespeech AT veglisandreas webframeworkforinformationaggregationandmanagementofmultilingualhatespeech AT dimoulascharalampos webframeworkforinformationaggregationandmanagementofmultilingualhatespeech |