Cargando…
Unzipping Zipf’s law
In spite of decades of theorizing, the origins of Zipf’s law remain elusive. I propose that a Zipfian distribution straightforwardly follows from the interaction of syntax (word classes differing in class size) and semantics (words having to be sufficiently specific to be distinctive and sufficientl...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5549924/ https://www.ncbi.nlm.nih.gov/pubmed/28792963 http://dx.doi.org/10.1371/journal.pone.0181987 |
_version_ | 1783256047357526016 |
---|---|
author | Lestrade, Sander |
author_facet | Lestrade, Sander |
author_sort | Lestrade, Sander |
collection | PubMed |
description | In spite of decades of theorizing, the origins of Zipf’s law remain elusive. I propose that a Zipfian distribution straightforwardly follows from the interaction of syntax (word classes differing in class size) and semantics (words having to be sufficiently specific to be distinctive and sufficiently general to be reusable). These factors are independently motivated and well-established ingredients of a natural-language system. Using a computational model, it is shown that neither of these ingredients suffices to produce a Zipfian distribution on its own and that the results deviate from the Zipfian ideal only in the same way as natural language itself does. |
format | Online Article Text |
id | pubmed-5549924 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-55499242017-08-15 Unzipping Zipf’s law Lestrade, Sander PLoS One Research Article In spite of decades of theorizing, the origins of Zipf’s law remain elusive. I propose that a Zipfian distribution straightforwardly follows from the interaction of syntax (word classes differing in class size) and semantics (words having to be sufficiently specific to be distinctive and sufficiently general to be reusable). These factors are independently motivated and well-established ingredients of a natural-language system. Using a computational model, it is shown that neither of these ingredients suffices to produce a Zipfian distribution on its own and that the results deviate from the Zipfian ideal only in the same way as natural language itself does. Public Library of Science 2017-08-09 /pmc/articles/PMC5549924/ /pubmed/28792963 http://dx.doi.org/10.1371/journal.pone.0181987 Text en © 2017 Sander Lestrade http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Lestrade, Sander Unzipping Zipf’s law |
title | Unzipping Zipf’s law |
title_full | Unzipping Zipf’s law |
title_fullStr | Unzipping Zipf’s law |
title_full_unstemmed | Unzipping Zipf’s law |
title_short | Unzipping Zipf’s law |
title_sort | unzipping zipf’s law |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5549924/ https://www.ncbi.nlm.nih.gov/pubmed/28792963 http://dx.doi.org/10.1371/journal.pone.0181987 |
work_keys_str_mv | AT lestradesander unzippingzipfslaw |