Cargando…

The common patterns of abundance: the log series and Zipf's law

In a language corpus, the probability that a word occurs n times is often proportional to 1/ n (2). Assigning rank, s, to words according to their abundance, log s vs log n typically has a slope of minus one. That simple Zipf's law pattern also arises in the population sizes of cities, the size...

Descripción completa

Detalles Bibliográficos
Autor principal: Frank, Steven A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6480937/
https://www.ncbi.nlm.nih.gov/pubmed/31069071
http://dx.doi.org/10.12688/f1000research.18681.1
_version_ 1783413680868687872
author Frank, Steven A.
author_facet Frank, Steven A.
author_sort Frank, Steven A.
collection PubMed
description In a language corpus, the probability that a word occurs n times is often proportional to 1/ n (2). Assigning rank, s, to words according to their abundance, log s vs log n typically has a slope of minus one. That simple Zipf's law pattern also arises in the population sizes of cities, the sizes of corporations, and other patterns of abundance. By contrast, for the abundances of different biological species, the probability of a population of size n is typically proportional to 1/ n, declining exponentially for larger n, the log series pattern. This article shows that the differing patterns of Zipf's law and the log series arise as the opposing endpoints of a more general theory. The general theory follows from the generic form of all probability patterns as a consequence of conserved average values and the associated invariances of scale. To understand the common patterns of abundance, the generic form of probability distributions plus the conserved average abundance is sufficient. The general theory includes cases that are between the Zipf and log series endpoints, providing a broad framework for analyzing widely observed abundance patterns.
format Online
Article
Text
id pubmed-6480937
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-64809372019-05-07 The common patterns of abundance: the log series and Zipf's law Frank, Steven A. F1000Res Research Article In a language corpus, the probability that a word occurs n times is often proportional to 1/ n (2). Assigning rank, s, to words according to their abundance, log s vs log n typically has a slope of minus one. That simple Zipf's law pattern also arises in the population sizes of cities, the sizes of corporations, and other patterns of abundance. By contrast, for the abundances of different biological species, the probability of a population of size n is typically proportional to 1/ n, declining exponentially for larger n, the log series pattern. This article shows that the differing patterns of Zipf's law and the log series arise as the opposing endpoints of a more general theory. The general theory follows from the generic form of all probability patterns as a consequence of conserved average values and the associated invariances of scale. To understand the common patterns of abundance, the generic form of probability distributions plus the conserved average abundance is sufficient. The general theory includes cases that are between the Zipf and log series endpoints, providing a broad framework for analyzing widely observed abundance patterns. F1000 Research Limited 2019-03-25 /pmc/articles/PMC6480937/ /pubmed/31069071 http://dx.doi.org/10.12688/f1000research.18681.1 Text en Copyright: © 2019 Frank SA http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Frank, Steven A.
The common patterns of abundance: the log series and Zipf's law
title The common patterns of abundance: the log series and Zipf's law
title_full The common patterns of abundance: the log series and Zipf's law
title_fullStr The common patterns of abundance: the log series and Zipf's law
title_full_unstemmed The common patterns of abundance: the log series and Zipf's law
title_short The common patterns of abundance: the log series and Zipf's law
title_sort common patterns of abundance: the log series and zipf's law
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6480937/
https://www.ncbi.nlm.nih.gov/pubmed/31069071
http://dx.doi.org/10.12688/f1000research.18681.1
work_keys_str_mv AT frankstevena thecommonpatternsofabundancethelogseriesandzipfslaw
AT frankstevena commonpatternsofabundancethelogseriesandzipfslaw