Cargando…
A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques
Context and Background: Since December 2019, the coronavirus (COVID-19) epidemic has sparked considerable alarm among the general community and significantly affected societal attitudes and perceptions. Apart from the disease itself, many people suffer from anxiety and depression due to the disease...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9492356/ https://www.ncbi.nlm.nih.gov/pubmed/36156967 http://dx.doi.org/10.1155/2022/6561622 |
_version_ | 1784793460682784768 |
---|---|
author | Fatima, Rubia Samad Shaikh, Naila Riaz, Adnan Ahmad, Sadique El-Affendi, Mohammed A. Alyamani, Khaled A. Z. Nabeel, Muhammad Ali Khan, Javed Yasin, Affan Latif, Rana M. Amir |
author_facet | Fatima, Rubia Samad Shaikh, Naila Riaz, Adnan Ahmad, Sadique El-Affendi, Mohammed A. Alyamani, Khaled A. Z. Nabeel, Muhammad Ali Khan, Javed Yasin, Affan Latif, Rana M. Amir |
author_sort | Fatima, Rubia |
collection | PubMed |
description | Context and Background: Since December 2019, the coronavirus (COVID-19) epidemic has sparked considerable alarm among the general community and significantly affected societal attitudes and perceptions. Apart from the disease itself, many people suffer from anxiety and depression due to the disease and the present threat of an outbreak. Due to the fast propagation of the virus and misleading/fake information, the issues of public discourse alter, resulting in significant confusion in certain places. Rumours are unproven facts or stories that propagate and promote sentiments of prejudice, hatred, and fear. Objective. The study's objective is to propose a novel solution to detect fake news using state-of-the-art machines and deep learning models. Furthermore, to analyse which models outperformed in detecting the fake news. Method. In the research study, we adapted a COVID-19 rumours dataset, which incorporates rumours from news websites and tweets, together with information about the rumours. It is important to analyse data utilizing Natural Language Processing (NLP) and Deep Learning (DL) approaches. Based on the accuracy, precision, recall, and the f1 score, we can assess the effectiveness of the ML and DL algorithms. Results. The data adopted from the source (mentioned in the paper) have collected 9200 comments from Google and 34,779 Twitter postings filtered for phrases connected with COVID-19-related fake news. Experiment 1. The dataset was assessed using the following three criteria: veracity, stance, and sentiment. In these terms, we have different labels, and we have applied the DL algorithms separately to each term. We have used different models in the experiment such as (i) LSTM and (ii) Temporal Convolution Networks (TCN). The TCN model has more performance on each measurement parameter in the evaluated results. So, we have used the TCN model for the practical implication for better findings. Experiment 2. In the second experiment, we have used different state-of-the-art deep learning models and algorithms such as (i) Simple RNN; (ii) LSTM + Word Embedding; (iii) Bidirectional + Word Embedding; (iv) LSTM + CNN-1D; and (v) BERT. Furthermore, we have evaluated the performance of these models on all three datasets, e.g., veracity, stance, and sentiment. Based on our second experimental evaluation, the BERT has a superior performance over the other models compared. |
format | Online Article Text |
id | pubmed-9492356 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-94923562022-09-22 A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques Fatima, Rubia Samad Shaikh, Naila Riaz, Adnan Ahmad, Sadique El-Affendi, Mohammed A. Alyamani, Khaled A. Z. Nabeel, Muhammad Ali Khan, Javed Yasin, Affan Latif, Rana M. Amir Comput Intell Neurosci Research Article Context and Background: Since December 2019, the coronavirus (COVID-19) epidemic has sparked considerable alarm among the general community and significantly affected societal attitudes and perceptions. Apart from the disease itself, many people suffer from anxiety and depression due to the disease and the present threat of an outbreak. Due to the fast propagation of the virus and misleading/fake information, the issues of public discourse alter, resulting in significant confusion in certain places. Rumours are unproven facts or stories that propagate and promote sentiments of prejudice, hatred, and fear. Objective. The study's objective is to propose a novel solution to detect fake news using state-of-the-art machines and deep learning models. Furthermore, to analyse which models outperformed in detecting the fake news. Method. In the research study, we adapted a COVID-19 rumours dataset, which incorporates rumours from news websites and tweets, together with information about the rumours. It is important to analyse data utilizing Natural Language Processing (NLP) and Deep Learning (DL) approaches. Based on the accuracy, precision, recall, and the f1 score, we can assess the effectiveness of the ML and DL algorithms. Results. The data adopted from the source (mentioned in the paper) have collected 9200 comments from Google and 34,779 Twitter postings filtered for phrases connected with COVID-19-related fake news. Experiment 1. The dataset was assessed using the following three criteria: veracity, stance, and sentiment. In these terms, we have different labels, and we have applied the DL algorithms separately to each term. We have used different models in the experiment such as (i) LSTM and (ii) Temporal Convolution Networks (TCN). The TCN model has more performance on each measurement parameter in the evaluated results. So, we have used the TCN model for the practical implication for better findings. Experiment 2. In the second experiment, we have used different state-of-the-art deep learning models and algorithms such as (i) Simple RNN; (ii) LSTM + Word Embedding; (iii) Bidirectional + Word Embedding; (iv) LSTM + CNN-1D; and (v) BERT. Furthermore, we have evaluated the performance of these models on all three datasets, e.g., veracity, stance, and sentiment. Based on our second experimental evaluation, the BERT has a superior performance over the other models compared. Hindawi 2022-09-14 /pmc/articles/PMC9492356/ /pubmed/36156967 http://dx.doi.org/10.1155/2022/6561622 Text en Copyright © 2022 Rubia Fatima et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Fatima, Rubia Samad Shaikh, Naila Riaz, Adnan Ahmad, Sadique El-Affendi, Mohammed A. Alyamani, Khaled A. Z. Nabeel, Muhammad Ali Khan, Javed Yasin, Affan Latif, Rana M. Amir A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques |
title | A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques |
title_full | A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques |
title_fullStr | A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques |
title_full_unstemmed | A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques |
title_short | A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques |
title_sort | natural language processing (nlp) evaluation on covid-19 rumour dataset using deep learning techniques |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9492356/ https://www.ncbi.nlm.nih.gov/pubmed/36156967 http://dx.doi.org/10.1155/2022/6561622 |
work_keys_str_mv | AT fatimarubia anaturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT samadshaikhnaila anaturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT riazadnan anaturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT ahmadsadique anaturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT elaffendimohammeda anaturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT alyamanikhaledaz anaturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT nabeelmuhammad anaturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT alikhanjaved anaturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT yasinaffan anaturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT latifranamamir anaturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT fatimarubia naturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT samadshaikhnaila naturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT riazadnan naturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT ahmadsadique naturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT elaffendimohammeda naturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT alyamanikhaledaz naturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT nabeelmuhammad naturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT alikhanjaved naturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT yasinaffan naturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques AT latifranamamir naturallanguageprocessingnlpevaluationoncovid19rumourdatasetusingdeeplearningtechniques |