Cargando…

Universal Entropy of Word Ordering Across Linguistic Families

BACKGROUND: The language faculty is probably the most distinctive feature of our species, and endows us with a unique ability to exchange highly structured information. In written language, information is encoded by the concatenation of basic symbols under grammatical and semantic constraints. As is...

Descripción completa

Detalles Bibliográficos
Autores principales: Montemurro, Marcelo A., Zanette, Damián H.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3094390/
https://www.ncbi.nlm.nih.gov/pubmed/21603637
http://dx.doi.org/10.1371/journal.pone.0019875
_version_ 1782203559308165120
author Montemurro, Marcelo A.
Zanette, Damián H.
author_facet Montemurro, Marcelo A.
Zanette, Damián H.
author_sort Montemurro, Marcelo A.
collection PubMed
description BACKGROUND: The language faculty is probably the most distinctive feature of our species, and endows us with a unique ability to exchange highly structured information. In written language, information is encoded by the concatenation of basic symbols under grammatical and semantic constraints. As is also the case in other natural information carriers, the resulting symbolic sequences show a delicate balance between order and disorder. That balance is determined by the interplay between the diversity of symbols and by their specific ordering in the sequences. Here we used entropy to quantify the contribution of different organizational levels to the overall statistical structure of language. METHODOLOGY/PRINCIPAL FINDINGS: We computed a relative entropy measure to quantify the degree of ordering in word sequences from languages belonging to several linguistic families. While a direct estimation of the overall entropy of language yielded values that varied for the different families considered, the relative entropy quantifying word ordering presented an almost constant value for all those families. CONCLUSIONS/SIGNIFICANCE: Our results indicate that despite the differences in the structure and vocabulary of the languages analyzed, the impact of word ordering in the structure of language is a statistical linguistic universal.
format Text
id pubmed-3094390
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30943902011-05-19 Universal Entropy of Word Ordering Across Linguistic Families Montemurro, Marcelo A. Zanette, Damián H. PLoS One Research Article BACKGROUND: The language faculty is probably the most distinctive feature of our species, and endows us with a unique ability to exchange highly structured information. In written language, information is encoded by the concatenation of basic symbols under grammatical and semantic constraints. As is also the case in other natural information carriers, the resulting symbolic sequences show a delicate balance between order and disorder. That balance is determined by the interplay between the diversity of symbols and by their specific ordering in the sequences. Here we used entropy to quantify the contribution of different organizational levels to the overall statistical structure of language. METHODOLOGY/PRINCIPAL FINDINGS: We computed a relative entropy measure to quantify the degree of ordering in word sequences from languages belonging to several linguistic families. While a direct estimation of the overall entropy of language yielded values that varied for the different families considered, the relative entropy quantifying word ordering presented an almost constant value for all those families. CONCLUSIONS/SIGNIFICANCE: Our results indicate that despite the differences in the structure and vocabulary of the languages analyzed, the impact of word ordering in the structure of language is a statistical linguistic universal. Public Library of Science 2011-05-13 /pmc/articles/PMC3094390/ /pubmed/21603637 http://dx.doi.org/10.1371/journal.pone.0019875 Text en Montemurro, Zanette. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Montemurro, Marcelo A.
Zanette, Damián H.
Universal Entropy of Word Ordering Across Linguistic Families
title Universal Entropy of Word Ordering Across Linguistic Families
title_full Universal Entropy of Word Ordering Across Linguistic Families
title_fullStr Universal Entropy of Word Ordering Across Linguistic Families
title_full_unstemmed Universal Entropy of Word Ordering Across Linguistic Families
title_short Universal Entropy of Word Ordering Across Linguistic Families
title_sort universal entropy of word ordering across linguistic families
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3094390/
https://www.ncbi.nlm.nih.gov/pubmed/21603637
http://dx.doi.org/10.1371/journal.pone.0019875
work_keys_str_mv AT montemurromarceloa universalentropyofwordorderingacrosslinguisticfamilies
AT zanettedamianh universalentropyofwordorderingacrosslinguisticfamilies