Cargando…

Geolocation of multiple sociolinguistic markers in Buenos Aires

Analysis of language geography is increasingly being used for studying spatial patterns of social dynamics. This trend is fueled by social media platforms such as Twitter which provide access to large amounts of natural language data combined with geolocation and user metadata enabling reconstructio...

Descripción completa

Detalles Bibliográficos
Autores principales: Kellert, Olga, Matlis, Nicholas H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9462814/
https://www.ncbi.nlm.nih.gov/pubmed/36084118
http://dx.doi.org/10.1371/journal.pone.0274114
_version_ 1784787274382180352
author Kellert, Olga
Matlis, Nicholas H.
author_facet Kellert, Olga
Matlis, Nicholas H.
author_sort Kellert, Olga
collection PubMed
description Analysis of language geography is increasingly being used for studying spatial patterns of social dynamics. This trend is fueled by social media platforms such as Twitter which provide access to large amounts of natural language data combined with geolocation and user metadata enabling reconstruction of detailed spatial patterns of language use. Most studies are performed on large spatial scales associated with countries and regions, where language dynamics are often dominated by the effects of geographic and administrative borders. Extending to smaller, urban scales, however, allows visualization of spatial patterns of language use determined by social dynamics within the city, providing valuable information for a range of social topics from demographic studies to urban planning. So far, few studies have been made in this domain, due, in part, to the challenges in developing algorithms that accurately classify linguistic features. Here we extend urban-scale geographical analysis of language use beyond lexical meaning to include other sociolinguistic markers that identify language style, dialect and social groups. Some features, which have not been explored with social-media data on the urban scale, can be used to target a range of social phenomena. Our study focuses on Twitter use in Buenos Aires and our approach classifies tweets based on contrasting sets of tokens manually selected to target precise linguistic features. We perform statistical analyses of eleven categories of language use to quantify the presence of spatial patterns and the extent to which they are socially driven. We then perform the first comparative analysis assessing how the patterns and strength of social drivers vary with category. Finally, we derive plausible explanations for the patterns by comparing them with independently generated maps of geosocial context. Identifying these connections is a key aspect of the social-dynamics analysis which has so far received insufficient attention.
format Online
Article
Text
id pubmed-9462814
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-94628142022-09-10 Geolocation of multiple sociolinguistic markers in Buenos Aires Kellert, Olga Matlis, Nicholas H. PLoS One Research Article Analysis of language geography is increasingly being used for studying spatial patterns of social dynamics. This trend is fueled by social media platforms such as Twitter which provide access to large amounts of natural language data combined with geolocation and user metadata enabling reconstruction of detailed spatial patterns of language use. Most studies are performed on large spatial scales associated with countries and regions, where language dynamics are often dominated by the effects of geographic and administrative borders. Extending to smaller, urban scales, however, allows visualization of spatial patterns of language use determined by social dynamics within the city, providing valuable information for a range of social topics from demographic studies to urban planning. So far, few studies have been made in this domain, due, in part, to the challenges in developing algorithms that accurately classify linguistic features. Here we extend urban-scale geographical analysis of language use beyond lexical meaning to include other sociolinguistic markers that identify language style, dialect and social groups. Some features, which have not been explored with social-media data on the urban scale, can be used to target a range of social phenomena. Our study focuses on Twitter use in Buenos Aires and our approach classifies tweets based on contrasting sets of tokens manually selected to target precise linguistic features. We perform statistical analyses of eleven categories of language use to quantify the presence of spatial patterns and the extent to which they are socially driven. We then perform the first comparative analysis assessing how the patterns and strength of social drivers vary with category. Finally, we derive plausible explanations for the patterns by comparing them with independently generated maps of geosocial context. Identifying these connections is a key aspect of the social-dynamics analysis which has so far received insufficient attention. Public Library of Science 2022-09-09 /pmc/articles/PMC9462814/ /pubmed/36084118 http://dx.doi.org/10.1371/journal.pone.0274114 Text en © 2022 Kellert, Matlis https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kellert, Olga
Matlis, Nicholas H.
Geolocation of multiple sociolinguistic markers in Buenos Aires
title Geolocation of multiple sociolinguistic markers in Buenos Aires
title_full Geolocation of multiple sociolinguistic markers in Buenos Aires
title_fullStr Geolocation of multiple sociolinguistic markers in Buenos Aires
title_full_unstemmed Geolocation of multiple sociolinguistic markers in Buenos Aires
title_short Geolocation of multiple sociolinguistic markers in Buenos Aires
title_sort geolocation of multiple sociolinguistic markers in buenos aires
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9462814/
https://www.ncbi.nlm.nih.gov/pubmed/36084118
http://dx.doi.org/10.1371/journal.pone.0274114
work_keys_str_mv AT kellertolga geolocationofmultiplesociolinguisticmarkersinbuenosaires
AT matlisnicholash geolocationofmultiplesociolinguisticmarkersinbuenosaires