Cargando…

Progressive domain adaptation for detecting hate speech on social media with small training set and its application to COVID-19 concerned posts

In this world of information and experience era, microblogging sites have been commonly used to express people feelings including fear, panic, hate and abuse. Monitoring and control of abuse on social media, especially during pandemics such as COVID-19, can help in keeping the public sentiment and m...

Descripción completa

Detalles Bibliográficos
Autores principales:	Bashar, Md Abul, Nayak, Richi, Luong, Khanh, Balasubramaniam, Thirunavukarasu
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer Vienna 2021
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8319196/ https://www.ncbi.nlm.nih.gov/pubmed/34341673 http://dx.doi.org/10.1007/s13278-021-00780-w

_version_	1783730399073009664
author	Bashar, Md Abul Nayak, Richi Luong, Khanh Balasubramaniam, Thirunavukarasu
author_facet	Bashar, Md Abul Nayak, Richi Luong, Khanh Balasubramaniam, Thirunavukarasu
author_sort	Bashar, Md Abul
collection	PubMed
description	In this world of information and experience era, microblogging sites have been commonly used to express people feelings including fear, panic, hate and abuse. Monitoring and control of abuse on social media, especially during pandemics such as COVID-19, can help in keeping the public sentiment and morale positive. Developing the fear and hate detection methods based on machine learning requires labelled data. However, obtaining the labelled data in suddenly changed circumstances as a pandemic is expensive and acquiring them in a short time is impractical. Related labelled hate data from other domains or previous incidents may be available. However, the predictive accuracy of these hate detection models decreases significantly if the data distribution of the target domain, where the prediction will be applied, is different. To address this problem, we propose a novel concept of unsupervised progressive domain adaptation based on a deep-learning language model generated through multiple text datasets. We showcase the efficacy of the proposed method in hate speech and fear detection on the tweets collection during COVID-19 where the labelled information is unavailable.
format	Online Article Text
id	pubmed-8319196
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Springer Vienna
record_format	MEDLINE/PubMed
spelling	pubmed-83191962021-07-29 Progressive domain adaptation for detecting hate speech on social media with small training set and its application to COVID-19 concerned posts Bashar, Md Abul Nayak, Richi Luong, Khanh Balasubramaniam, Thirunavukarasu Soc Netw Anal Min Original Article In this world of information and experience era, microblogging sites have been commonly used to express people feelings including fear, panic, hate and abuse. Monitoring and control of abuse on social media, especially during pandemics such as COVID-19, can help in keeping the public sentiment and morale positive. Developing the fear and hate detection methods based on machine learning requires labelled data. However, obtaining the labelled data in suddenly changed circumstances as a pandemic is expensive and acquiring them in a short time is impractical. Related labelled hate data from other domains or previous incidents may be available. However, the predictive accuracy of these hate detection models decreases significantly if the data distribution of the target domain, where the prediction will be applied, is different. To address this problem, we propose a novel concept of unsupervised progressive domain adaptation based on a deep-learning language model generated through multiple text datasets. We showcase the efficacy of the proposed method in hate speech and fear detection on the tweets collection during COVID-19 where the labelled information is unavailable. Springer Vienna 2021-07-29 2021 /pmc/articles/PMC8319196/ /pubmed/34341673 http://dx.doi.org/10.1007/s13278-021-00780-w Text en © The Author(s), under exclusive licence to Springer-Verlag GmbH Austria, part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle	Original Article Bashar, Md Abul Nayak, Richi Luong, Khanh Balasubramaniam, Thirunavukarasu Progressive domain adaptation for detecting hate speech on social media with small training set and its application to COVID-19 concerned posts
title	Progressive domain adaptation for detecting hate speech on social media with small training set and its application to COVID-19 concerned posts
title_full	Progressive domain adaptation for detecting hate speech on social media with small training set and its application to COVID-19 concerned posts
title_fullStr	Progressive domain adaptation for detecting hate speech on social media with small training set and its application to COVID-19 concerned posts
title_full_unstemmed	Progressive domain adaptation for detecting hate speech on social media with small training set and its application to COVID-19 concerned posts
title_short	Progressive domain adaptation for detecting hate speech on social media with small training set and its application to COVID-19 concerned posts
title_sort	progressive domain adaptation for detecting hate speech on social media with small training set and its application to covid-19 concerned posts
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8319196/ https://www.ncbi.nlm.nih.gov/pubmed/34341673 http://dx.doi.org/10.1007/s13278-021-00780-w
work_keys_str_mv	AT basharmdabul progressivedomainadaptationfordetectinghatespeechonsocialmediawithsmalltrainingsetanditsapplicationtocovid19concernedposts AT nayakrichi progressivedomainadaptationfordetectinghatespeechonsocialmediawithsmalltrainingsetanditsapplicationtocovid19concernedposts AT luongkhanh progressivedomainadaptationfordetectinghatespeechonsocialmediawithsmalltrainingsetanditsapplicationtocovid19concernedposts AT balasubramaniamthirunavukarasu progressivedomainadaptationfordetectinghatespeechonsocialmediawithsmalltrainingsetanditsapplicationtocovid19concernedposts

Progressive domain adaptation for detecting hate speech on social media with small training set and its application to COVID-19 concerned posts

Ejemplares similares