Cargando…
Search queries related to COVID-19 based on keyword extraction
BACKGROUND: Pandemic COVID-19 caused an infodemic – massive spread of true and fake information about novel coronavirus. This study aims to present the possibility of using Keyword Extraction as a tool to obtain the most trending search queries related to COVID-19 and analyze the possibility of incl...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Author(s). Published by Elsevier B.V.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9578928/ https://www.ncbi.nlm.nih.gov/pubmed/36275392 http://dx.doi.org/10.1016/j.procs.2022.09.320 |
_version_ | 1784812070650249216 |
---|---|
author | Kelebercová, Lívia Munk, Michal |
author_facet | Kelebercová, Lívia Munk, Michal |
author_sort | Kelebercová, Lívia |
collection | PubMed |
description | BACKGROUND: Pandemic COVID-19 caused an infodemic – massive spread of true and fake information about novel coronavirus. This study aims to present the possibility of using Keyword Extraction as a tool to obtain the most trending search queries related to COVID-19 and analyze the possibility of including their search volume in models for the prediction of fake news. METHODS: The study used Python implementation of the machine learning-based technique KeyBERT to extract keywords from true and fake news. These keywords were used in the next step to obtain related search queries with Google Trends API. RESULTS: Non-parametric Spearman Rank Order Correlation was identified as a statistically positive correlation (p < 0.001) between the occurrence of false news and top query / rising query metrics provided by Google Trends of queries related to extracted keywords pandemic, HIV, lockdown, plague, Michigan, and protest, which proves that search volume can identify fake news. CONCLUSIONS: Experiments done in this research proved that Keyword Extraction from false news is useful for obtaining related search queries and the top query and rising query metrics can be used to increase the accuracy of fake news prediction models. |
format | Online Article Text |
id | pubmed-9578928 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | The Author(s). Published by Elsevier B.V. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95789282022-10-19 Search queries related to COVID-19 based on keyword extraction Kelebercová, Lívia Munk, Michal Procedia Comput Sci Article BACKGROUND: Pandemic COVID-19 caused an infodemic – massive spread of true and fake information about novel coronavirus. This study aims to present the possibility of using Keyword Extraction as a tool to obtain the most trending search queries related to COVID-19 and analyze the possibility of including their search volume in models for the prediction of fake news. METHODS: The study used Python implementation of the machine learning-based technique KeyBERT to extract keywords from true and fake news. These keywords were used in the next step to obtain related search queries with Google Trends API. RESULTS: Non-parametric Spearman Rank Order Correlation was identified as a statistically positive correlation (p < 0.001) between the occurrence of false news and top query / rising query metrics provided by Google Trends of queries related to extracted keywords pandemic, HIV, lockdown, plague, Michigan, and protest, which proves that search volume can identify fake news. CONCLUSIONS: Experiments done in this research proved that Keyword Extraction from false news is useful for obtaining related search queries and the top query and rising query metrics can be used to increase the accuracy of fake news prediction models. The Author(s). Published by Elsevier B.V. 2022 2022-10-19 /pmc/articles/PMC9578928/ /pubmed/36275392 http://dx.doi.org/10.1016/j.procs.2022.09.320 Text en © 2022 The Author(s). Published by Elsevier B.V. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Kelebercová, Lívia Munk, Michal Search queries related to COVID-19 based on keyword extraction |
title | Search queries related to COVID-19 based on keyword extraction |
title_full | Search queries related to COVID-19 based on keyword extraction |
title_fullStr | Search queries related to COVID-19 based on keyword extraction |
title_full_unstemmed | Search queries related to COVID-19 based on keyword extraction |
title_short | Search queries related to COVID-19 based on keyword extraction |
title_sort | search queries related to covid-19 based on keyword extraction |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9578928/ https://www.ncbi.nlm.nih.gov/pubmed/36275392 http://dx.doi.org/10.1016/j.procs.2022.09.320 |
work_keys_str_mv | AT kelebercovalivia searchqueriesrelatedtocovid19basedonkeywordextraction AT munkmichal searchqueriesrelatedtocovid19basedonkeywordextraction |