Cargando…

Search queries related to COVID-19 based on keyword extraction

BACKGROUND: Pandemic COVID-19 caused an infodemic – massive spread of true and fake information about novel coronavirus. This study aims to present the possibility of using Keyword Extraction as a tool to obtain the most trending search queries related to COVID-19 and analyze the possibility of incl...

Descripción completa

Detalles Bibliográficos
Autores principales: Kelebercová, Lívia, Munk, Michal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Author(s). Published by Elsevier B.V. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9578928/
https://www.ncbi.nlm.nih.gov/pubmed/36275392
http://dx.doi.org/10.1016/j.procs.2022.09.320
_version_ 1784812070650249216
author Kelebercová, Lívia
Munk, Michal
author_facet Kelebercová, Lívia
Munk, Michal
author_sort Kelebercová, Lívia
collection PubMed
description BACKGROUND: Pandemic COVID-19 caused an infodemic – massive spread of true and fake information about novel coronavirus. This study aims to present the possibility of using Keyword Extraction as a tool to obtain the most trending search queries related to COVID-19 and analyze the possibility of including their search volume in models for the prediction of fake news. METHODS: The study used Python implementation of the machine learning-based technique KeyBERT to extract keywords from true and fake news. These keywords were used in the next step to obtain related search queries with Google Trends API. RESULTS: Non-parametric Spearman Rank Order Correlation was identified as a statistically positive correlation (p < 0.001) between the occurrence of false news and top query / rising query metrics provided by Google Trends of queries related to extracted keywords pandemic, HIV, lockdown, plague, Michigan, and protest, which proves that search volume can identify fake news. CONCLUSIONS: Experiments done in this research proved that Keyword Extraction from false news is useful for obtaining related search queries and the top query and rising query metrics can be used to increase the accuracy of fake news prediction models.
format Online
Article
Text
id pubmed-9578928
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher The Author(s). Published by Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-95789282022-10-19 Search queries related to COVID-19 based on keyword extraction Kelebercová, Lívia Munk, Michal Procedia Comput Sci Article BACKGROUND: Pandemic COVID-19 caused an infodemic – massive spread of true and fake information about novel coronavirus. This study aims to present the possibility of using Keyword Extraction as a tool to obtain the most trending search queries related to COVID-19 and analyze the possibility of including their search volume in models for the prediction of fake news. METHODS: The study used Python implementation of the machine learning-based technique KeyBERT to extract keywords from true and fake news. These keywords were used in the next step to obtain related search queries with Google Trends API. RESULTS: Non-parametric Spearman Rank Order Correlation was identified as a statistically positive correlation (p < 0.001) between the occurrence of false news and top query / rising query metrics provided by Google Trends of queries related to extracted keywords pandemic, HIV, lockdown, plague, Michigan, and protest, which proves that search volume can identify fake news. CONCLUSIONS: Experiments done in this research proved that Keyword Extraction from false news is useful for obtaining related search queries and the top query and rising query metrics can be used to increase the accuracy of fake news prediction models. The Author(s). Published by Elsevier B.V. 2022 2022-10-19 /pmc/articles/PMC9578928/ /pubmed/36275392 http://dx.doi.org/10.1016/j.procs.2022.09.320 Text en © 2022 The Author(s). Published by Elsevier B.V. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Kelebercová, Lívia
Munk, Michal
Search queries related to COVID-19 based on keyword extraction
title Search queries related to COVID-19 based on keyword extraction
title_full Search queries related to COVID-19 based on keyword extraction
title_fullStr Search queries related to COVID-19 based on keyword extraction
title_full_unstemmed Search queries related to COVID-19 based on keyword extraction
title_short Search queries related to COVID-19 based on keyword extraction
title_sort search queries related to covid-19 based on keyword extraction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9578928/
https://www.ncbi.nlm.nih.gov/pubmed/36275392
http://dx.doi.org/10.1016/j.procs.2022.09.320
work_keys_str_mv AT kelebercovalivia searchqueriesrelatedtocovid19basedonkeywordextraction
AT munkmichal searchqueriesrelatedtocovid19basedonkeywordextraction