Cargando…

Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact

When speakers of different languages interact, they are likely to influence each other: contact leaves traces in the linguistic record, which in turn can reveal geographical areas of past human interaction and migration. However, other factors may contribute to similarities between languages. Inheri...

Descripción completa

Detalles Bibliográficos
Autores principales: Ranacher, Peter, Neureiter, Nico, van Gijn, Rik, Sonnenhauser, Barbara, Escher, Anastasia, Weibel, Robert, Muysken, Pieter, Bickel, Balthasar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8355670/
https://www.ncbi.nlm.nih.gov/pubmed/34376092
http://dx.doi.org/10.1098/rsif.2020.1031
_version_ 1783736806840205312
author Ranacher, Peter
Neureiter, Nico
van Gijn, Rik
Sonnenhauser, Barbara
Escher, Anastasia
Weibel, Robert
Muysken, Pieter
Bickel, Balthasar
author_facet Ranacher, Peter
Neureiter, Nico
van Gijn, Rik
Sonnenhauser, Barbara
Escher, Anastasia
Weibel, Robert
Muysken, Pieter
Bickel, Balthasar
author_sort Ranacher, Peter
collection PubMed
description When speakers of different languages interact, they are likely to influence each other: contact leaves traces in the linguistic record, which in turn can reveal geographical areas of past human interaction and migration. However, other factors may contribute to similarities between languages. Inheritance from a shared ancestral language and universal preference for a linguistic property may both overshadow contact signals. How can we find geographical contact areas in language data, while accounting for the confounding effects of inheritance and universal preference? We present sBayes, an algorithm for Bayesian clustering in the presence of confounding effects. The algorithm learns which similarities are better explained by confounders, and which are due to contact effects. Contact areas are free to take any shape or size, but an explicit geographical prior ensures their spatial coherence. We test sBayes on simulated data and apply it in two case studies to reveal language contact in South America and the Balkans. Our results are supported by findings from previous studies. While we focus on detecting language contact, the method can also be used to uncover other traces of shared history in cultural evolution, and more generally, to reveal latent spatial clusters in the presence of confounders.
format Online
Article
Text
id pubmed-8355670
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Royal Society
record_format MEDLINE/PubMed
spelling pubmed-83556702021-08-11 Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact Ranacher, Peter Neureiter, Nico van Gijn, Rik Sonnenhauser, Barbara Escher, Anastasia Weibel, Robert Muysken, Pieter Bickel, Balthasar J R Soc Interface Life Sciences–Mathematics interface When speakers of different languages interact, they are likely to influence each other: contact leaves traces in the linguistic record, which in turn can reveal geographical areas of past human interaction and migration. However, other factors may contribute to similarities between languages. Inheritance from a shared ancestral language and universal preference for a linguistic property may both overshadow contact signals. How can we find geographical contact areas in language data, while accounting for the confounding effects of inheritance and universal preference? We present sBayes, an algorithm for Bayesian clustering in the presence of confounding effects. The algorithm learns which similarities are better explained by confounders, and which are due to contact effects. Contact areas are free to take any shape or size, but an explicit geographical prior ensures their spatial coherence. We test sBayes on simulated data and apply it in two case studies to reveal language contact in South America and the Balkans. Our results are supported by findings from previous studies. While we focus on detecting language contact, the method can also be used to uncover other traces of shared history in cultural evolution, and more generally, to reveal latent spatial clusters in the presence of confounders. The Royal Society 2021-08-11 /pmc/articles/PMC8355670/ /pubmed/34376092 http://dx.doi.org/10.1098/rsif.2020.1031 Text en © 2021 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited.
spellingShingle Life Sciences–Mathematics interface
Ranacher, Peter
Neureiter, Nico
van Gijn, Rik
Sonnenhauser, Barbara
Escher, Anastasia
Weibel, Robert
Muysken, Pieter
Bickel, Balthasar
Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact
title Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact
title_full Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact
title_fullStr Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact
title_full_unstemmed Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact
title_short Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact
title_sort contact-tracing in cultural evolution: a bayesian mixture model to detect geographic areas of language contact
topic Life Sciences–Mathematics interface
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8355670/
https://www.ncbi.nlm.nih.gov/pubmed/34376092
http://dx.doi.org/10.1098/rsif.2020.1031
work_keys_str_mv AT ranacherpeter contacttracinginculturalevolutionabayesianmixturemodeltodetectgeographicareasoflanguagecontact
AT neureiternico contacttracinginculturalevolutionabayesianmixturemodeltodetectgeographicareasoflanguagecontact
AT vangijnrik contacttracinginculturalevolutionabayesianmixturemodeltodetectgeographicareasoflanguagecontact
AT sonnenhauserbarbara contacttracinginculturalevolutionabayesianmixturemodeltodetectgeographicareasoflanguagecontact
AT escheranastasia contacttracinginculturalevolutionabayesianmixturemodeltodetectgeographicareasoflanguagecontact
AT weibelrobert contacttracinginculturalevolutionabayesianmixturemodeltodetectgeographicareasoflanguagecontact
AT muyskenpieter contacttracinginculturalevolutionabayesianmixturemodeltodetectgeographicareasoflanguagecontact
AT bickelbalthasar contacttracinginculturalevolutionabayesianmixturemodeltodetectgeographicareasoflanguagecontact