Cargando…
Seeing through noise in power laws
Despite widespread claims of power laws across the natural and social sciences, evidence in data is often equivocal. Modern data and statistical methods reject even classic power laws such as Pareto’s law of wealth and the Gutenberg–Richter law for earthquake magnitudes. We show that the maximum-lik...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10465205/ https://www.ncbi.nlm.nih.gov/pubmed/37643642 http://dx.doi.org/10.1098/rsif.2023.0310 |
_version_ | 1785098617551323136 |
---|---|
author | Lin, Qianying Newberry, Mitchell |
author_facet | Lin, Qianying Newberry, Mitchell |
author_sort | Lin, Qianying |
collection | PubMed |
description | Despite widespread claims of power laws across the natural and social sciences, evidence in data is often equivocal. Modern data and statistical methods reject even classic power laws such as Pareto’s law of wealth and the Gutenberg–Richter law for earthquake magnitudes. We show that the maximum-likelihood estimators and Kolmogorov–Smirnov (K-S) statistics in widespread use are unexpectedly sensitive to ubiquitous errors in data such as measurement noise, quantization noise, heaping and censorship of small values. This sensitivity causes spurious rejection of power laws and biases parameter estimates even in arbitrarily large samples, which explains inconsistencies between theory and data. We show that logarithmic binning by powers of λ > 1 attenuates these errors in a manner analogous to noise averaging in normal statistics and that λ thereby tunes a trade-off between accuracy and precision in estimation. Binning also removes potentially misleading within-scale information while preserving information about the shape of a distribution over powers of λ, and we show that some amount of binning can improve sensitivity and specificity of K-S tests without any cost, while more extreme binning tunes a trade-off between sensitivity and specificity. We therefore advocate logarithmic binning as a simple essential step in power-law inference. |
format | Online Article Text |
id | pubmed-10465205 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | The Royal Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-104652052023-08-30 Seeing through noise in power laws Lin, Qianying Newberry, Mitchell J R Soc Interface Life Sciences–Mathematics interface Despite widespread claims of power laws across the natural and social sciences, evidence in data is often equivocal. Modern data and statistical methods reject even classic power laws such as Pareto’s law of wealth and the Gutenberg–Richter law for earthquake magnitudes. We show that the maximum-likelihood estimators and Kolmogorov–Smirnov (K-S) statistics in widespread use are unexpectedly sensitive to ubiquitous errors in data such as measurement noise, quantization noise, heaping and censorship of small values. This sensitivity causes spurious rejection of power laws and biases parameter estimates even in arbitrarily large samples, which explains inconsistencies between theory and data. We show that logarithmic binning by powers of λ > 1 attenuates these errors in a manner analogous to noise averaging in normal statistics and that λ thereby tunes a trade-off between accuracy and precision in estimation. Binning also removes potentially misleading within-scale information while preserving information about the shape of a distribution over powers of λ, and we show that some amount of binning can improve sensitivity and specificity of K-S tests without any cost, while more extreme binning tunes a trade-off between sensitivity and specificity. We therefore advocate logarithmic binning as a simple essential step in power-law inference. The Royal Society 2023-08-30 /pmc/articles/PMC10465205/ /pubmed/37643642 http://dx.doi.org/10.1098/rsif.2023.0310 Text en © 2023 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited. |
spellingShingle | Life Sciences–Mathematics interface Lin, Qianying Newberry, Mitchell Seeing through noise in power laws |
title | Seeing through noise in power laws |
title_full | Seeing through noise in power laws |
title_fullStr | Seeing through noise in power laws |
title_full_unstemmed | Seeing through noise in power laws |
title_short | Seeing through noise in power laws |
title_sort | seeing through noise in power laws |
topic | Life Sciences–Mathematics interface |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10465205/ https://www.ncbi.nlm.nih.gov/pubmed/37643642 http://dx.doi.org/10.1098/rsif.2023.0310 |
work_keys_str_mv | AT linqianying seeingthroughnoiseinpowerlaws AT newberrymitchell seeingthroughnoiseinpowerlaws |