Cargando…

A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration

As the COVID-19 pandemic continues to spread worldwide, an unprecedented amount of open data is being generated for medical, genetics, and epidemiological research. The unparalleled rate at which many research groups around the world are releasing data and publications on the ongoing pandemic is all...

Descripción completa

Detalles Bibliográficos
Autores principales: Banda, Juan M., Tekumalla, Ramya, Wang, Guanyu, Yu, Jingyuan, Liu, Tuo, Ding, Yuning, Artemova, Ekaterina, Tutubalina, Elena, Chowell, Gerardo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9620940/
https://www.ncbi.nlm.nih.gov/pubmed/36417228
http://dx.doi.org/10.3390/epidemiologia2030024
_version_ 1784821429845360640
author Banda, Juan M.
Tekumalla, Ramya
Wang, Guanyu
Yu, Jingyuan
Liu, Tuo
Ding, Yuning
Artemova, Ekaterina
Tutubalina, Elena
Chowell, Gerardo
author_facet Banda, Juan M.
Tekumalla, Ramya
Wang, Guanyu
Yu, Jingyuan
Liu, Tuo
Ding, Yuning
Artemova, Ekaterina
Tutubalina, Elena
Chowell, Gerardo
author_sort Banda, Juan M.
collection PubMed
description As the COVID-19 pandemic continues to spread worldwide, an unprecedented amount of open data is being generated for medical, genetics, and epidemiological research. The unparalleled rate at which many research groups around the world are releasing data and publications on the ongoing pandemic is allowing other scientists to learn from local experiences and data generated on the front lines of the COVID-19 pandemic. However, there is a need to integrate additional data sources that map and measure the role of social dynamics of such a unique worldwide event in biomedical, biological, and epidemiological analyses. For this purpose, we present a large-scale curated dataset of over 1.12 billion tweets, growing daily, related to COVID-19 chatter generated from 1 January 2020 to 27 June 2021 at the time of writing. This data source provides a freely available additional data source for researchers worldwide to conduct a wide and diverse number of research projects, such as epidemiological analyses, emotional and mental responses to social distancing measures, the identification of sources of misinformation, stratified measurement of sentiment towards the pandemic in near real time, among many others.
format Online
Article
Text
id pubmed-9620940
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96209402022-11-18 A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration Banda, Juan M. Tekumalla, Ramya Wang, Guanyu Yu, Jingyuan Liu, Tuo Ding, Yuning Artemova, Ekaterina Tutubalina, Elena Chowell, Gerardo Epidemiologia (Basel) Data Descriptor As the COVID-19 pandemic continues to spread worldwide, an unprecedented amount of open data is being generated for medical, genetics, and epidemiological research. The unparalleled rate at which many research groups around the world are releasing data and publications on the ongoing pandemic is allowing other scientists to learn from local experiences and data generated on the front lines of the COVID-19 pandemic. However, there is a need to integrate additional data sources that map and measure the role of social dynamics of such a unique worldwide event in biomedical, biological, and epidemiological analyses. For this purpose, we present a large-scale curated dataset of over 1.12 billion tweets, growing daily, related to COVID-19 chatter generated from 1 January 2020 to 27 June 2021 at the time of writing. This data source provides a freely available additional data source for researchers worldwide to conduct a wide and diverse number of research projects, such as epidemiological analyses, emotional and mental responses to social distancing measures, the identification of sources of misinformation, stratified measurement of sentiment towards the pandemic in near real time, among many others. MDPI 2021-08-05 /pmc/articles/PMC9620940/ /pubmed/36417228 http://dx.doi.org/10.3390/epidemiologia2030024 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Descriptor
Banda, Juan M.
Tekumalla, Ramya
Wang, Guanyu
Yu, Jingyuan
Liu, Tuo
Ding, Yuning
Artemova, Ekaterina
Tutubalina, Elena
Chowell, Gerardo
A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration
title A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration
title_full A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration
title_fullStr A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration
title_full_unstemmed A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration
title_short A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration
title_sort large-scale covid-19 twitter chatter dataset for open scientific research—an international collaboration
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9620940/
https://www.ncbi.nlm.nih.gov/pubmed/36417228
http://dx.doi.org/10.3390/epidemiologia2030024
work_keys_str_mv AT bandajuanm alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT tekumallaramya alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT wangguanyu alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT yujingyuan alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT liutuo alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT dingyuning alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT artemovaekaterina alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT tutubalinaelena alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT chowellgerardo alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT bandajuanm largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT tekumallaramya largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT wangguanyu largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT yujingyuan largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT liutuo largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT dingyuning largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT artemovaekaterina largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT tutubalinaelena largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration
AT chowellgerardo largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration