Cargando…
Detecting twitter hate speech in COVID-19 era using machine learning and ensemble learning techniques
The COVID-19 pandemic has impacted every nation, and social isolation is the major protective method for the coronavirus. People express themselves via Facebook and Twitter. People disseminate disinformation and hate speech on Twitter. This research seeks to detect hate speech using machine learning...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Author(s). Published by Elsevier Ltd.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9481233/ http://dx.doi.org/10.1016/j.jjimei.2022.100120 |
_version_ | 1784791218456100864 |
---|---|
author | Khanday, Akib Mohi Ud Din Rabani, Syed Tanzeel Khan, Qamar Rayees Malik, Showkat Hassan |
author_facet | Khanday, Akib Mohi Ud Din Rabani, Syed Tanzeel Khan, Qamar Rayees Malik, Showkat Hassan |
author_sort | Khanday, Akib Mohi Ud Din |
collection | PubMed |
description | The COVID-19 pandemic has impacted every nation, and social isolation is the major protective method for the coronavirus. People express themselves via Facebook and Twitter. People disseminate disinformation and hate speech on Twitter. This research seeks to detect hate speech using machine learning and ensemble learning techniques during COVID-19. Twitter data was extracted from using its API with the help of trending hashtags during the COVID-19 pandemic. Tweets were manually annotated into two categories based on different factors. Features are extracted using TF/IDF, Bag of Words and Tweet Length. The study found the Decision Tree classifier to be effective. Compared to other typical ML classifiers, it has 98% precision, 97% recall, 97% F1-Score, and 97% accuracy. The Stochastic Gradient Boosting classifier outperforms all others with 99 percent precision, 97 percent recall, 98 percent F1-Score, and 98.04 percent accuracy. |
format | Online Article Text |
id | pubmed-9481233 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | The Author(s). Published by Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-94812332022-09-19 Detecting twitter hate speech in COVID-19 era using machine learning and ensemble learning techniques Khanday, Akib Mohi Ud Din Rabani, Syed Tanzeel Khan, Qamar Rayees Malik, Showkat Hassan International Journal of Information Management Data Insights Article The COVID-19 pandemic has impacted every nation, and social isolation is the major protective method for the coronavirus. People express themselves via Facebook and Twitter. People disseminate disinformation and hate speech on Twitter. This research seeks to detect hate speech using machine learning and ensemble learning techniques during COVID-19. Twitter data was extracted from using its API with the help of trending hashtags during the COVID-19 pandemic. Tweets were manually annotated into two categories based on different factors. Features are extracted using TF/IDF, Bag of Words and Tweet Length. The study found the Decision Tree classifier to be effective. Compared to other typical ML classifiers, it has 98% precision, 97% recall, 97% F1-Score, and 97% accuracy. The Stochastic Gradient Boosting classifier outperforms all others with 99 percent precision, 97 percent recall, 98 percent F1-Score, and 98.04 percent accuracy. The Author(s). Published by Elsevier Ltd. 2022-11 2022-09-16 /pmc/articles/PMC9481233/ http://dx.doi.org/10.1016/j.jjimei.2022.100120 Text en © 2022 The Author(s) Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Khanday, Akib Mohi Ud Din Rabani, Syed Tanzeel Khan, Qamar Rayees Malik, Showkat Hassan Detecting twitter hate speech in COVID-19 era using machine learning and ensemble learning techniques |
title | Detecting twitter hate speech in COVID-19 era using machine learning and ensemble learning techniques |
title_full | Detecting twitter hate speech in COVID-19 era using machine learning and ensemble learning techniques |
title_fullStr | Detecting twitter hate speech in COVID-19 era using machine learning and ensemble learning techniques |
title_full_unstemmed | Detecting twitter hate speech in COVID-19 era using machine learning and ensemble learning techniques |
title_short | Detecting twitter hate speech in COVID-19 era using machine learning and ensemble learning techniques |
title_sort | detecting twitter hate speech in covid-19 era using machine learning and ensemble learning techniques |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9481233/ http://dx.doi.org/10.1016/j.jjimei.2022.100120 |
work_keys_str_mv | AT khandayakibmohiuddin detectingtwitterhatespeechincovid19erausingmachinelearningandensemblelearningtechniques AT rabanisyedtanzeel detectingtwitterhatespeechincovid19erausingmachinelearningandensemblelearningtechniques AT khanqamarrayees detectingtwitterhatespeechincovid19erausingmachinelearningandensemblelearningtechniques AT malikshowkathassan detectingtwitterhatespeechincovid19erausingmachinelearningandensemblelearningtechniques |