Cargando…
A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration
As the COVID-19 pandemic continues to spread worldwide, an unprecedented amount of open data is being generated for medical, genetics, and epidemiological research. The unparalleled rate at which many research groups around the world are releasing data and publications on the ongoing pandemic is all...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9620940/ https://www.ncbi.nlm.nih.gov/pubmed/36417228 http://dx.doi.org/10.3390/epidemiologia2030024 |
_version_ | 1784821429845360640 |
---|---|
author | Banda, Juan M. Tekumalla, Ramya Wang, Guanyu Yu, Jingyuan Liu, Tuo Ding, Yuning Artemova, Ekaterina Tutubalina, Elena Chowell, Gerardo |
author_facet | Banda, Juan M. Tekumalla, Ramya Wang, Guanyu Yu, Jingyuan Liu, Tuo Ding, Yuning Artemova, Ekaterina Tutubalina, Elena Chowell, Gerardo |
author_sort | Banda, Juan M. |
collection | PubMed |
description | As the COVID-19 pandemic continues to spread worldwide, an unprecedented amount of open data is being generated for medical, genetics, and epidemiological research. The unparalleled rate at which many research groups around the world are releasing data and publications on the ongoing pandemic is allowing other scientists to learn from local experiences and data generated on the front lines of the COVID-19 pandemic. However, there is a need to integrate additional data sources that map and measure the role of social dynamics of such a unique worldwide event in biomedical, biological, and epidemiological analyses. For this purpose, we present a large-scale curated dataset of over 1.12 billion tweets, growing daily, related to COVID-19 chatter generated from 1 January 2020 to 27 June 2021 at the time of writing. This data source provides a freely available additional data source for researchers worldwide to conduct a wide and diverse number of research projects, such as epidemiological analyses, emotional and mental responses to social distancing measures, the identification of sources of misinformation, stratified measurement of sentiment towards the pandemic in near real time, among many others. |
format | Online Article Text |
id | pubmed-9620940 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96209402022-11-18 A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration Banda, Juan M. Tekumalla, Ramya Wang, Guanyu Yu, Jingyuan Liu, Tuo Ding, Yuning Artemova, Ekaterina Tutubalina, Elena Chowell, Gerardo Epidemiologia (Basel) Data Descriptor As the COVID-19 pandemic continues to spread worldwide, an unprecedented amount of open data is being generated for medical, genetics, and epidemiological research. The unparalleled rate at which many research groups around the world are releasing data and publications on the ongoing pandemic is allowing other scientists to learn from local experiences and data generated on the front lines of the COVID-19 pandemic. However, there is a need to integrate additional data sources that map and measure the role of social dynamics of such a unique worldwide event in biomedical, biological, and epidemiological analyses. For this purpose, we present a large-scale curated dataset of over 1.12 billion tweets, growing daily, related to COVID-19 chatter generated from 1 January 2020 to 27 June 2021 at the time of writing. This data source provides a freely available additional data source for researchers worldwide to conduct a wide and diverse number of research projects, such as epidemiological analyses, emotional and mental responses to social distancing measures, the identification of sources of misinformation, stratified measurement of sentiment towards the pandemic in near real time, among many others. MDPI 2021-08-05 /pmc/articles/PMC9620940/ /pubmed/36417228 http://dx.doi.org/10.3390/epidemiologia2030024 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Data Descriptor Banda, Juan M. Tekumalla, Ramya Wang, Guanyu Yu, Jingyuan Liu, Tuo Ding, Yuning Artemova, Ekaterina Tutubalina, Elena Chowell, Gerardo A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration |
title | A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration |
title_full | A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration |
title_fullStr | A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration |
title_full_unstemmed | A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration |
title_short | A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration |
title_sort | large-scale covid-19 twitter chatter dataset for open scientific research—an international collaboration |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9620940/ https://www.ncbi.nlm.nih.gov/pubmed/36417228 http://dx.doi.org/10.3390/epidemiologia2030024 |
work_keys_str_mv | AT bandajuanm alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT tekumallaramya alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT wangguanyu alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT yujingyuan alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT liutuo alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT dingyuning alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT artemovaekaterina alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT tutubalinaelena alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT chowellgerardo alargescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT bandajuanm largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT tekumallaramya largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT wangguanyu largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT yujingyuan largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT liutuo largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT dingyuning largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT artemovaekaterina largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT tutubalinaelena largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration AT chowellgerardo largescalecovid19twitterchatterdatasetforopenscientificresearchaninternationalcollaboration |