Cargando…

Seeing through noise in power laws

Despite widespread claims of power laws across the natural and social sciences, evidence in data is often equivocal. Modern data and statistical methods reject even classic power laws such as Pareto’s law of wealth and the Gutenberg–Richter law for earthquake magnitudes. We show that the maximum-lik...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Qianying, Newberry, Mitchell
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10465205/
https://www.ncbi.nlm.nih.gov/pubmed/37643642
http://dx.doi.org/10.1098/rsif.2023.0310
_version_ 1785098617551323136
author Lin, Qianying
Newberry, Mitchell
author_facet Lin, Qianying
Newberry, Mitchell
author_sort Lin, Qianying
collection PubMed
description Despite widespread claims of power laws across the natural and social sciences, evidence in data is often equivocal. Modern data and statistical methods reject even classic power laws such as Pareto’s law of wealth and the Gutenberg–Richter law for earthquake magnitudes. We show that the maximum-likelihood estimators and Kolmogorov–Smirnov (K-S) statistics in widespread use are unexpectedly sensitive to ubiquitous errors in data such as measurement noise, quantization noise, heaping and censorship of small values. This sensitivity causes spurious rejection of power laws and biases parameter estimates even in arbitrarily large samples, which explains inconsistencies between theory and data. We show that logarithmic binning by powers of λ > 1 attenuates these errors in a manner analogous to noise averaging in normal statistics and that λ thereby tunes a trade-off between accuracy and precision in estimation. Binning also removes potentially misleading within-scale information while preserving information about the shape of a distribution over powers of λ, and we show that some amount of binning can improve sensitivity and specificity of K-S tests without any cost, while more extreme binning tunes a trade-off between sensitivity and specificity. We therefore advocate logarithmic binning as a simple essential step in power-law inference.
format Online
Article
Text
id pubmed-10465205
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher The Royal Society
record_format MEDLINE/PubMed
spelling pubmed-104652052023-08-30 Seeing through noise in power laws Lin, Qianying Newberry, Mitchell J R Soc Interface Life Sciences–Mathematics interface Despite widespread claims of power laws across the natural and social sciences, evidence in data is often equivocal. Modern data and statistical methods reject even classic power laws such as Pareto’s law of wealth and the Gutenberg–Richter law for earthquake magnitudes. We show that the maximum-likelihood estimators and Kolmogorov–Smirnov (K-S) statistics in widespread use are unexpectedly sensitive to ubiquitous errors in data such as measurement noise, quantization noise, heaping and censorship of small values. This sensitivity causes spurious rejection of power laws and biases parameter estimates even in arbitrarily large samples, which explains inconsistencies between theory and data. We show that logarithmic binning by powers of λ > 1 attenuates these errors in a manner analogous to noise averaging in normal statistics and that λ thereby tunes a trade-off between accuracy and precision in estimation. Binning also removes potentially misleading within-scale information while preserving information about the shape of a distribution over powers of λ, and we show that some amount of binning can improve sensitivity and specificity of K-S tests without any cost, while more extreme binning tunes a trade-off between sensitivity and specificity. We therefore advocate logarithmic binning as a simple essential step in power-law inference. The Royal Society 2023-08-30 /pmc/articles/PMC10465205/ /pubmed/37643642 http://dx.doi.org/10.1098/rsif.2023.0310 Text en © 2023 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited.
spellingShingle Life Sciences–Mathematics interface
Lin, Qianying
Newberry, Mitchell
Seeing through noise in power laws
title Seeing through noise in power laws
title_full Seeing through noise in power laws
title_fullStr Seeing through noise in power laws
title_full_unstemmed Seeing through noise in power laws
title_short Seeing through noise in power laws
title_sort seeing through noise in power laws
topic Life Sciences–Mathematics interface
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10465205/
https://www.ncbi.nlm.nih.gov/pubmed/37643642
http://dx.doi.org/10.1098/rsif.2023.0310
work_keys_str_mv AT linqianying seeingthroughnoiseinpowerlaws
AT newberrymitchell seeingthroughnoiseinpowerlaws