Cargando…
The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort
Languages employ different strategies to transmit structural and grammatical information. While, for example, grammatical dependency relationships in sentences are mainly conveyed by the ordering of the words for languages like Mandarin Chinese, or Vietnamese, the word ordering is much less restrict...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5345836/ https://www.ncbi.nlm.nih.gov/pubmed/28282435 http://dx.doi.org/10.1371/journal.pone.0173614 |
_version_ | 1782513790688952320 |
---|---|
author | Koplenig, Alexander Meyer, Peter Wolfer, Sascha Müller-Spitzer, Carolin |
author_facet | Koplenig, Alexander Meyer, Peter Wolfer, Sascha Müller-Spitzer, Carolin |
author_sort | Koplenig, Alexander |
collection | PubMed |
description | Languages employ different strategies to transmit structural and grammatical information. While, for example, grammatical dependency relationships in sentences are mainly conveyed by the ordering of the words for languages like Mandarin Chinese, or Vietnamese, the word ordering is much less restricted for languages such as Inupiatun or Quechua, as these languages (also) use the internal structure of words (e.g. inflectional morphology) to mark grammatical relationships in a sentence. Based on a quantitative analysis of more than 1,500 unique translations of different books of the Bible in almost 1,200 different languages that are spoken as a native language by approximately 6 billion people (more than 80% of the world population), we present large-scale evidence for a statistical trade-off between the amount of information conveyed by the ordering of words and the amount of information conveyed by internal word structure: languages that rely more strongly on word order information tend to rely less on word structure information and vice versa. Or put differently, if less information is carried within the word, more information has to be spread among words in order to communicate successfully. In addition, we find that–despite differences in the way information is expressed–there is also evidence for a trade-off between different books of the biblical canon that recurs with little variation across languages: the more informative the word order of the book, the less informative its word structure and vice versa. We argue that this might suggest that, on the one hand, languages encode information in very different (but efficient) ways. On the other hand, content-related and stylistic features are statistically encoded in very similar ways. |
format | Online Article Text |
id | pubmed-5345836 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-53458362017-03-30 The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort Koplenig, Alexander Meyer, Peter Wolfer, Sascha Müller-Spitzer, Carolin PLoS One Research Article Languages employ different strategies to transmit structural and grammatical information. While, for example, grammatical dependency relationships in sentences are mainly conveyed by the ordering of the words for languages like Mandarin Chinese, or Vietnamese, the word ordering is much less restricted for languages such as Inupiatun or Quechua, as these languages (also) use the internal structure of words (e.g. inflectional morphology) to mark grammatical relationships in a sentence. Based on a quantitative analysis of more than 1,500 unique translations of different books of the Bible in almost 1,200 different languages that are spoken as a native language by approximately 6 billion people (more than 80% of the world population), we present large-scale evidence for a statistical trade-off between the amount of information conveyed by the ordering of words and the amount of information conveyed by internal word structure: languages that rely more strongly on word order information tend to rely less on word structure information and vice versa. Or put differently, if less information is carried within the word, more information has to be spread among words in order to communicate successfully. In addition, we find that–despite differences in the way information is expressed–there is also evidence for a trade-off between different books of the biblical canon that recurs with little variation across languages: the more informative the word order of the book, the less informative its word structure and vice versa. We argue that this might suggest that, on the one hand, languages encode information in very different (but efficient) ways. On the other hand, content-related and stylistic features are statistically encoded in very similar ways. Public Library of Science 2017-03-10 /pmc/articles/PMC5345836/ /pubmed/28282435 http://dx.doi.org/10.1371/journal.pone.0173614 Text en © 2017 Koplenig et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Koplenig, Alexander Meyer, Peter Wolfer, Sascha Müller-Spitzer, Carolin The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort |
title | The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort |
title_full | The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort |
title_fullStr | The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort |
title_full_unstemmed | The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort |
title_short | The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort |
title_sort | statistical trade-off between word order and word structure – large-scale evidence for the principle of least effort |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5345836/ https://www.ncbi.nlm.nih.gov/pubmed/28282435 http://dx.doi.org/10.1371/journal.pone.0173614 |
work_keys_str_mv | AT koplenigalexander thestatisticaltradeoffbetweenwordorderandwordstructurelargescaleevidencefortheprincipleofleasteffort AT meyerpeter thestatisticaltradeoffbetweenwordorderandwordstructurelargescaleevidencefortheprincipleofleasteffort AT wolfersascha thestatisticaltradeoffbetweenwordorderandwordstructurelargescaleevidencefortheprincipleofleasteffort AT mullerspitzercarolin thestatisticaltradeoffbetweenwordorderandwordstructurelargescaleevidencefortheprincipleofleasteffort AT koplenigalexander statisticaltradeoffbetweenwordorderandwordstructurelargescaleevidencefortheprincipleofleasteffort AT meyerpeter statisticaltradeoffbetweenwordorderandwordstructurelargescaleevidencefortheprincipleofleasteffort AT wolfersascha statisticaltradeoffbetweenwordorderandwordstructurelargescaleevidencefortheprincipleofleasteffort AT mullerspitzercarolin statisticaltradeoffbetweenwordorderandwordstructurelargescaleevidencefortheprincipleofleasteffort |