Cargando…

CHECKED: Chinese COVID-19 fake news dataset

COVID-19 has impacted all lives. To maintain social distancing and avoiding exposure, works and lives have gradually moved online. Under this trend, social media usage to obtain COVID-19 news has increased. Also, misinformation on COVID-19 is frequently spread on social media. In this work, we devel...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Chen, Zhou, Xinyi, Zafarani, Reza
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Vienna 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8217979/
https://www.ncbi.nlm.nih.gov/pubmed/34178179
http://dx.doi.org/10.1007/s13278-021-00766-8
_version_ 1783710699980062720
author Yang, Chen
Zhou, Xinyi
Zafarani, Reza
author_facet Yang, Chen
Zhou, Xinyi
Zafarani, Reza
author_sort Yang, Chen
collection PubMed
description COVID-19 has impacted all lives. To maintain social distancing and avoiding exposure, works and lives have gradually moved online. Under this trend, social media usage to obtain COVID-19 news has increased. Also, misinformation on COVID-19 is frequently spread on social media. In this work, we develop CHECKED, the first Chinese dataset on COVID-19 misinformation. CHECKED provides a total 2,104 verified microblogs related to COVID-19 from December 2019 to August 2020, identified by using a specific list of keywords. Correspondingly, CHECKED includes 1,868,175 reposts, 1,185,702 comments, and 56,852,736 likes that reveal how these verified microblogs are spread and reacted on Weibo. The dataset contains a rich set of multimedia information for each microblog including ground-truth label, textual, visual, temporal, and network information. Extensive experiments have been conducted to analyze CHECKED data and to provide benchmark results for well-established methods when predicting fake news using CHECKED. We hope that CHECKED can facilitate studies that target misinformation on coronavirus. The dataset is available at https://github.com/cyang03/CHECKED.
format Online
Article
Text
id pubmed-8217979
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Vienna
record_format MEDLINE/PubMed
spelling pubmed-82179792021-06-23 CHECKED: Chinese COVID-19 fake news dataset Yang, Chen Zhou, Xinyi Zafarani, Reza Soc Netw Anal Min Original Article COVID-19 has impacted all lives. To maintain social distancing and avoiding exposure, works and lives have gradually moved online. Under this trend, social media usage to obtain COVID-19 news has increased. Also, misinformation on COVID-19 is frequently spread on social media. In this work, we develop CHECKED, the first Chinese dataset on COVID-19 misinformation. CHECKED provides a total 2,104 verified microblogs related to COVID-19 from December 2019 to August 2020, identified by using a specific list of keywords. Correspondingly, CHECKED includes 1,868,175 reposts, 1,185,702 comments, and 56,852,736 likes that reveal how these verified microblogs are spread and reacted on Weibo. The dataset contains a rich set of multimedia information for each microblog including ground-truth label, textual, visual, temporal, and network information. Extensive experiments have been conducted to analyze CHECKED data and to provide benchmark results for well-established methods when predicting fake news using CHECKED. We hope that CHECKED can facilitate studies that target misinformation on coronavirus. The dataset is available at https://github.com/cyang03/CHECKED. Springer Vienna 2021-06-22 2021 /pmc/articles/PMC8217979/ /pubmed/34178179 http://dx.doi.org/10.1007/s13278-021-00766-8 Text en © The Author(s), under exclusive licence to Springer-Verlag GmbH Austria, part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Article
Yang, Chen
Zhou, Xinyi
Zafarani, Reza
CHECKED: Chinese COVID-19 fake news dataset
title CHECKED: Chinese COVID-19 fake news dataset
title_full CHECKED: Chinese COVID-19 fake news dataset
title_fullStr CHECKED: Chinese COVID-19 fake news dataset
title_full_unstemmed CHECKED: Chinese COVID-19 fake news dataset
title_short CHECKED: Chinese COVID-19 fake news dataset
title_sort checked: chinese covid-19 fake news dataset
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8217979/
https://www.ncbi.nlm.nih.gov/pubmed/34178179
http://dx.doi.org/10.1007/s13278-021-00766-8
work_keys_str_mv AT yangchen checkedchinesecovid19fakenewsdataset
AT zhouxinyi checkedchinesecovid19fakenewsdataset
AT zafaranireza checkedchinesecovid19fakenewsdataset