Cargando…
Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact
When speakers of different languages interact, they are likely to influence each other: contact leaves traces in the linguistic record, which in turn can reveal geographical areas of past human interaction and migration. However, other factors may contribute to similarities between languages. Inheri...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8355670/ https://www.ncbi.nlm.nih.gov/pubmed/34376092 http://dx.doi.org/10.1098/rsif.2020.1031 |
_version_ | 1783736806840205312 |
---|---|
author | Ranacher, Peter Neureiter, Nico van Gijn, Rik Sonnenhauser, Barbara Escher, Anastasia Weibel, Robert Muysken, Pieter Bickel, Balthasar |
author_facet | Ranacher, Peter Neureiter, Nico van Gijn, Rik Sonnenhauser, Barbara Escher, Anastasia Weibel, Robert Muysken, Pieter Bickel, Balthasar |
author_sort | Ranacher, Peter |
collection | PubMed |
description | When speakers of different languages interact, they are likely to influence each other: contact leaves traces in the linguistic record, which in turn can reveal geographical areas of past human interaction and migration. However, other factors may contribute to similarities between languages. Inheritance from a shared ancestral language and universal preference for a linguistic property may both overshadow contact signals. How can we find geographical contact areas in language data, while accounting for the confounding effects of inheritance and universal preference? We present sBayes, an algorithm for Bayesian clustering in the presence of confounding effects. The algorithm learns which similarities are better explained by confounders, and which are due to contact effects. Contact areas are free to take any shape or size, but an explicit geographical prior ensures their spatial coherence. We test sBayes on simulated data and apply it in two case studies to reveal language contact in South America and the Balkans. Our results are supported by findings from previous studies. While we focus on detecting language contact, the method can also be used to uncover other traces of shared history in cultural evolution, and more generally, to reveal latent spatial clusters in the presence of confounders. |
format | Online Article Text |
id | pubmed-8355670 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | The Royal Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-83556702021-08-11 Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact Ranacher, Peter Neureiter, Nico van Gijn, Rik Sonnenhauser, Barbara Escher, Anastasia Weibel, Robert Muysken, Pieter Bickel, Balthasar J R Soc Interface Life Sciences–Mathematics interface When speakers of different languages interact, they are likely to influence each other: contact leaves traces in the linguistic record, which in turn can reveal geographical areas of past human interaction and migration. However, other factors may contribute to similarities between languages. Inheritance from a shared ancestral language and universal preference for a linguistic property may both overshadow contact signals. How can we find geographical contact areas in language data, while accounting for the confounding effects of inheritance and universal preference? We present sBayes, an algorithm for Bayesian clustering in the presence of confounding effects. The algorithm learns which similarities are better explained by confounders, and which are due to contact effects. Contact areas are free to take any shape or size, but an explicit geographical prior ensures their spatial coherence. We test sBayes on simulated data and apply it in two case studies to reveal language contact in South America and the Balkans. Our results are supported by findings from previous studies. While we focus on detecting language contact, the method can also be used to uncover other traces of shared history in cultural evolution, and more generally, to reveal latent spatial clusters in the presence of confounders. The Royal Society 2021-08-11 /pmc/articles/PMC8355670/ /pubmed/34376092 http://dx.doi.org/10.1098/rsif.2020.1031 Text en © 2021 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited. |
spellingShingle | Life Sciences–Mathematics interface Ranacher, Peter Neureiter, Nico van Gijn, Rik Sonnenhauser, Barbara Escher, Anastasia Weibel, Robert Muysken, Pieter Bickel, Balthasar Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact |
title | Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact |
title_full | Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact |
title_fullStr | Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact |
title_full_unstemmed | Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact |
title_short | Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact |
title_sort | contact-tracing in cultural evolution: a bayesian mixture model to detect geographic areas of language contact |
topic | Life Sciences–Mathematics interface |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8355670/ https://www.ncbi.nlm.nih.gov/pubmed/34376092 http://dx.doi.org/10.1098/rsif.2020.1031 |
work_keys_str_mv | AT ranacherpeter contacttracinginculturalevolutionabayesianmixturemodeltodetectgeographicareasoflanguagecontact AT neureiternico contacttracinginculturalevolutionabayesianmixturemodeltodetectgeographicareasoflanguagecontact AT vangijnrik contacttracinginculturalevolutionabayesianmixturemodeltodetectgeographicareasoflanguagecontact AT sonnenhauserbarbara contacttracinginculturalevolutionabayesianmixturemodeltodetectgeographicareasoflanguagecontact AT escheranastasia contacttracinginculturalevolutionabayesianmixturemodeltodetectgeographicareasoflanguagecontact AT weibelrobert contacttracinginculturalevolutionabayesianmixturemodeltodetectgeographicareasoflanguagecontact AT muyskenpieter contacttracinginculturalevolutionabayesianmixturemodeltodetectgeographicareasoflanguagecontact AT bickelbalthasar contacttracinginculturalevolutionabayesianmixturemodeltodetectgeographicareasoflanguagecontact |