Cargando…

Can Menzerath’s law be a criterion of complexity in communication?

Menzerath’s law is a quantitative linguistic law which states that, on average, the longer is a linguistic construct, the shorter are its constituents. In contrast, Menzerath-Altmann’s law (MAL) is a precise mathematical power-law-exponential formula which expresses the expected length of the lingui...

Descripción completa

Detalles Bibliográficos
Autores principales: G. Torre, Iván, Dębowski, Łukasz, Hernández-Fernández, Antoni
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8378695/
https://www.ncbi.nlm.nih.gov/pubmed/34415939
http://dx.doi.org/10.1371/journal.pone.0256133
_version_ 1783740860628729856
author G. Torre, Iván
Dębowski, Łukasz
Hernández-Fernández, Antoni
author_facet G. Torre, Iván
Dębowski, Łukasz
Hernández-Fernández, Antoni
author_sort G. Torre, Iván
collection PubMed
description Menzerath’s law is a quantitative linguistic law which states that, on average, the longer is a linguistic construct, the shorter are its constituents. In contrast, Menzerath-Altmann’s law (MAL) is a precise mathematical power-law-exponential formula which expresses the expected length of the linguistic construct conditioned on the number of its constituents. In this paper, we investigate the anatomy of MAL for constructs being word tokens and constituents being syllables, measuring its length in graphemes. First, we derive the exact form of MAL for texts generated by the memoryless source with three emitted symbols, which can be interpreted as a monkey typing model or a null model. We show that this null model complies with Menzerath’s law, revealing that Menzerath’s law itself can hardly be a criterion of complexity in communication. This observation does not apply to the more precise Menzerath-Altmann’s law, which predicts an inverted regime for sufficiently range constructs, i.e., the longer is a word, the longer are its syllables. To support this claim, we analyze MAL on data from 21 languages, consisting of texts from the Standardized Project Gutenberg. We show the presence of the inverted regime, not exhibited by the null model, and we demonstrate robustness of our results. We also report the complicated distribution of syllable sizes with respect to their position in the word, which might be related with the emerging MAL. Altogether, our results indicate that Menzerath’s law—in terms of correlations—is a spurious observation, while complex patterns and efficiency dynamics should be rather attributed to specific forms of Menzerath-Altmann’s law.
format Online
Article
Text
id pubmed-8378695
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83786952021-08-21 Can Menzerath’s law be a criterion of complexity in communication? G. Torre, Iván Dębowski, Łukasz Hernández-Fernández, Antoni PLoS One Research Article Menzerath’s law is a quantitative linguistic law which states that, on average, the longer is a linguistic construct, the shorter are its constituents. In contrast, Menzerath-Altmann’s law (MAL) is a precise mathematical power-law-exponential formula which expresses the expected length of the linguistic construct conditioned on the number of its constituents. In this paper, we investigate the anatomy of MAL for constructs being word tokens and constituents being syllables, measuring its length in graphemes. First, we derive the exact form of MAL for texts generated by the memoryless source with three emitted symbols, which can be interpreted as a monkey typing model or a null model. We show that this null model complies with Menzerath’s law, revealing that Menzerath’s law itself can hardly be a criterion of complexity in communication. This observation does not apply to the more precise Menzerath-Altmann’s law, which predicts an inverted regime for sufficiently range constructs, i.e., the longer is a word, the longer are its syllables. To support this claim, we analyze MAL on data from 21 languages, consisting of texts from the Standardized Project Gutenberg. We show the presence of the inverted regime, not exhibited by the null model, and we demonstrate robustness of our results. We also report the complicated distribution of syllable sizes with respect to their position in the word, which might be related with the emerging MAL. Altogether, our results indicate that Menzerath’s law—in terms of correlations—is a spurious observation, while complex patterns and efficiency dynamics should be rather attributed to specific forms of Menzerath-Altmann’s law. Public Library of Science 2021-08-20 /pmc/articles/PMC8378695/ /pubmed/34415939 http://dx.doi.org/10.1371/journal.pone.0256133 Text en © 2021 G. Torre et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
G. Torre, Iván
Dębowski, Łukasz
Hernández-Fernández, Antoni
Can Menzerath’s law be a criterion of complexity in communication?
title Can Menzerath’s law be a criterion of complexity in communication?
title_full Can Menzerath’s law be a criterion of complexity in communication?
title_fullStr Can Menzerath’s law be a criterion of complexity in communication?
title_full_unstemmed Can Menzerath’s law be a criterion of complexity in communication?
title_short Can Menzerath’s law be a criterion of complexity in communication?
title_sort can menzerath’s law be a criterion of complexity in communication?
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8378695/
https://www.ncbi.nlm.nih.gov/pubmed/34415939
http://dx.doi.org/10.1371/journal.pone.0256133
work_keys_str_mv AT gtorreivan canmenzerathslawbeacriterionofcomplexityincommunication
AT debowskiłukasz canmenzerathslawbeacriterionofcomplexityincommunication
AT hernandezfernandezantoni canmenzerathslawbeacriterionofcomplexityincommunication