Cargando…

The Twitter of Babel: Mapping World Languages through Microblogging Platforms

Large scale analysis and statistics of socio-technical systems that just a few short years ago would have required the use of consistent economic and human resources can nowadays be conveniently performed by mining the enormous amount of digital data produced by human activities. Although a characte...

Descripción completa

Detalles Bibliográficos
Autores principales: Mocanu, Delia, Baronchelli, Andrea, Perra, Nicola, Gonçalves, Bruno, Zhang, Qian, Vespignani, Alessandro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3630228/
https://www.ncbi.nlm.nih.gov/pubmed/23637940
http://dx.doi.org/10.1371/journal.pone.0061981
_version_ 1782266682667958272
author Mocanu, Delia
Baronchelli, Andrea
Perra, Nicola
Gonçalves, Bruno
Zhang, Qian
Vespignani, Alessandro
author_facet Mocanu, Delia
Baronchelli, Andrea
Perra, Nicola
Gonçalves, Bruno
Zhang, Qian
Vespignani, Alessandro
author_sort Mocanu, Delia
collection PubMed
description Large scale analysis and statistics of socio-technical systems that just a few short years ago would have required the use of consistent economic and human resources can nowadays be conveniently performed by mining the enormous amount of digital data produced by human activities. Although a characterization of several aspects of our societies is emerging from the data revolution, a number of questions concerning the reliability and the biases inherent to the big data “proxies” of social life are still open. Here, we survey worldwide linguistic indicators and trends through the analysis of a large-scale dataset of microblogging posts. We show that available data allow for the study of language geography at scales ranging from country-level aggregation to specific city neighborhoods. The high resolution and coverage of the data allows us to investigate different indicators such as the linguistic homogeneity of different countries, the touristic seasonal patterns within countries and the geographical distribution of different languages in multilingual regions. This work highlights the potential of geolocalized studies of open data sources to improve current analysis and develop indicators for major social phenomena in specific communities.
format Online
Article
Text
id pubmed-3630228
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36302282013-05-01 The Twitter of Babel: Mapping World Languages through Microblogging Platforms Mocanu, Delia Baronchelli, Andrea Perra, Nicola Gonçalves, Bruno Zhang, Qian Vespignani, Alessandro PLoS One Research Article Large scale analysis and statistics of socio-technical systems that just a few short years ago would have required the use of consistent economic and human resources can nowadays be conveniently performed by mining the enormous amount of digital data produced by human activities. Although a characterization of several aspects of our societies is emerging from the data revolution, a number of questions concerning the reliability and the biases inherent to the big data “proxies” of social life are still open. Here, we survey worldwide linguistic indicators and trends through the analysis of a large-scale dataset of microblogging posts. We show that available data allow for the study of language geography at scales ranging from country-level aggregation to specific city neighborhoods. The high resolution and coverage of the data allows us to investigate different indicators such as the linguistic homogeneity of different countries, the touristic seasonal patterns within countries and the geographical distribution of different languages in multilingual regions. This work highlights the potential of geolocalized studies of open data sources to improve current analysis and develop indicators for major social phenomena in specific communities. Public Library of Science 2013-04-18 /pmc/articles/PMC3630228/ /pubmed/23637940 http://dx.doi.org/10.1371/journal.pone.0061981 Text en © 2013 Mocanu et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Mocanu, Delia
Baronchelli, Andrea
Perra, Nicola
Gonçalves, Bruno
Zhang, Qian
Vespignani, Alessandro
The Twitter of Babel: Mapping World Languages through Microblogging Platforms
title The Twitter of Babel: Mapping World Languages through Microblogging Platforms
title_full The Twitter of Babel: Mapping World Languages through Microblogging Platforms
title_fullStr The Twitter of Babel: Mapping World Languages through Microblogging Platforms
title_full_unstemmed The Twitter of Babel: Mapping World Languages through Microblogging Platforms
title_short The Twitter of Babel: Mapping World Languages through Microblogging Platforms
title_sort twitter of babel: mapping world languages through microblogging platforms
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3630228/
https://www.ncbi.nlm.nih.gov/pubmed/23637940
http://dx.doi.org/10.1371/journal.pone.0061981
work_keys_str_mv AT mocanudelia thetwitterofbabelmappingworldlanguagesthroughmicrobloggingplatforms
AT baronchelliandrea thetwitterofbabelmappingworldlanguagesthroughmicrobloggingplatforms
AT perranicola thetwitterofbabelmappingworldlanguagesthroughmicrobloggingplatforms
AT goncalvesbruno thetwitterofbabelmappingworldlanguagesthroughmicrobloggingplatforms
AT zhangqian thetwitterofbabelmappingworldlanguagesthroughmicrobloggingplatforms
AT vespignanialessandro thetwitterofbabelmappingworldlanguagesthroughmicrobloggingplatforms
AT mocanudelia twitterofbabelmappingworldlanguagesthroughmicrobloggingplatforms
AT baronchelliandrea twitterofbabelmappingworldlanguagesthroughmicrobloggingplatforms
AT perranicola twitterofbabelmappingworldlanguagesthroughmicrobloggingplatforms
AT goncalvesbruno twitterofbabelmappingworldlanguagesthroughmicrobloggingplatforms
AT zhangqian twitterofbabelmappingworldlanguagesthroughmicrobloggingplatforms
AT vespignanialessandro twitterofbabelmappingworldlanguagesthroughmicrobloggingplatforms