Cargando…

A first public dataset from Brazilian twitter and news on COVID-19 in Portuguese

In this data article, we provide a collection of 3,925,366 tweets and 18,413 online news around the online discussion about COVID-19 in Brazil. The data from Twitter were collected through Twitterscraper Python library and we considered a set of keywords in Portuguese regarding to COVID-19. In order...

Descripción completa

Detalles Bibliográficos
Autores principales: de Melo, Tiago, Figueiredo, Carlos M.S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7434436/
https://www.ncbi.nlm.nih.gov/pubmed/32844106
http://dx.doi.org/10.1016/j.dib.2020.106179
_version_ 1783572147044614144
author de Melo, Tiago
Figueiredo, Carlos M.S.
author_facet de Melo, Tiago
Figueiredo, Carlos M.S.
author_sort de Melo, Tiago
collection PubMed
description In this data article, we provide a collection of 3,925,366 tweets and 18,413 online news around the online discussion about COVID-19 in Brazil. The data from Twitter were collected through Twitterscraper Python library and we considered a set of keywords in Portuguese regarding to COVID-19. In order to facilitate the identification of tweets that have hashtags, media and retweets for researchers or data enthusiasts, we created three specific datasets for each of these categories. The news on COVID-19 was collected from the UOL portal, the most popular Brazilian website. All the data were gathered from January to May, 2020. These datasets can attract the attention from communities such as data science, social science, natural language processing, tourism, infodemiology, and public health.
format Online
Article
Text
id pubmed-7434436
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-74344362020-08-19 A first public dataset from Brazilian twitter and news on COVID-19 in Portuguese de Melo, Tiago Figueiredo, Carlos M.S. Data Brief Social Science In this data article, we provide a collection of 3,925,366 tweets and 18,413 online news around the online discussion about COVID-19 in Brazil. The data from Twitter were collected through Twitterscraper Python library and we considered a set of keywords in Portuguese regarding to COVID-19. In order to facilitate the identification of tweets that have hashtags, media and retweets for researchers or data enthusiasts, we created three specific datasets for each of these categories. The news on COVID-19 was collected from the UOL portal, the most popular Brazilian website. All the data were gathered from January to May, 2020. These datasets can attract the attention from communities such as data science, social science, natural language processing, tourism, infodemiology, and public health. Elsevier 2020-08-18 /pmc/articles/PMC7434436/ /pubmed/32844106 http://dx.doi.org/10.1016/j.dib.2020.106179 Text en © 2020 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Social Science
de Melo, Tiago
Figueiredo, Carlos M.S.
A first public dataset from Brazilian twitter and news on COVID-19 in Portuguese
title A first public dataset from Brazilian twitter and news on COVID-19 in Portuguese
title_full A first public dataset from Brazilian twitter and news on COVID-19 in Portuguese
title_fullStr A first public dataset from Brazilian twitter and news on COVID-19 in Portuguese
title_full_unstemmed A first public dataset from Brazilian twitter and news on COVID-19 in Portuguese
title_short A first public dataset from Brazilian twitter and news on COVID-19 in Portuguese
title_sort first public dataset from brazilian twitter and news on covid-19 in portuguese
topic Social Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7434436/
https://www.ncbi.nlm.nih.gov/pubmed/32844106
http://dx.doi.org/10.1016/j.dib.2020.106179
work_keys_str_mv AT demelotiago afirstpublicdatasetfrombraziliantwitterandnewsoncovid19inportuguese
AT figueiredocarlosms afirstpublicdatasetfrombraziliantwitterandnewsoncovid19inportuguese
AT demelotiago firstpublicdatasetfrombraziliantwitterandnewsoncovid19inportuguese
AT figueiredocarlosms firstpublicdatasetfrombraziliantwitterandnewsoncovid19inportuguese