Cargando…

Strong evidence of an information-theoretical conservation principle linking all discrete systems

Diverse discrete systems share common global properties that lack a unifying theoretical explanation. However, constraining the simplest measure of total information (Hartley–Shannon) in a statistical mechanics framework reveals a principle, the conservation of Hartley–Shannon information (CoHSI) th...

Descripción completa

Detalles Bibliográficos
Autores principales: Hatton, Les, Warr, Gregory
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6837184/
https://www.ncbi.nlm.nih.gov/pubmed/31824716
http://dx.doi.org/10.1098/rsos.191101
_version_ 1783467039586779136
author Hatton, Les
Warr, Gregory
author_facet Hatton, Les
Warr, Gregory
author_sort Hatton, Les
collection PubMed
description Diverse discrete systems share common global properties that lack a unifying theoretical explanation. However, constraining the simplest measure of total information (Hartley–Shannon) in a statistical mechanics framework reveals a principle, the conservation of Hartley–Shannon information (CoHSI) that directly predicts both known and unsuspected common properties of discrete systems, as borne out in the diverse systems of computer software, proteins and music. Discrete systems fall into two categories distinguished by their structure: heterogeneous systems in which there is a distinguishable order of assembly of the system’s components from an alphabet of unique tokens (e.g. proteins assembled from an alphabet of amino acids), and homogeneous systems in which unique tokens are simply binned, counted and rank ordered. Heterogeneous systems are characterized by an implicit distribution of component lengths, with sharp unimodal peak (containing the majority of components) and a power-law tail, whereas homogeneous systems reduce naturally to Zipf’s Law but with a drooping tail in the distribution. We also confirm predictions that very long components are inevitable for heterogeneous systems; that discrete systems can exhibit simultaneously both heterogeneous and homogeneous behaviour; and that in systems with more than one consistent token alphabet (e.g. digital music), the alphabets themselves show a power-law relationship.
format Online
Article
Text
id pubmed-6837184
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher The Royal Society
record_format MEDLINE/PubMed
spelling pubmed-68371842019-12-10 Strong evidence of an information-theoretical conservation principle linking all discrete systems Hatton, Les Warr, Gregory R Soc Open Sci Computer Science Diverse discrete systems share common global properties that lack a unifying theoretical explanation. However, constraining the simplest measure of total information (Hartley–Shannon) in a statistical mechanics framework reveals a principle, the conservation of Hartley–Shannon information (CoHSI) that directly predicts both known and unsuspected common properties of discrete systems, as borne out in the diverse systems of computer software, proteins and music. Discrete systems fall into two categories distinguished by their structure: heterogeneous systems in which there is a distinguishable order of assembly of the system’s components from an alphabet of unique tokens (e.g. proteins assembled from an alphabet of amino acids), and homogeneous systems in which unique tokens are simply binned, counted and rank ordered. Heterogeneous systems are characterized by an implicit distribution of component lengths, with sharp unimodal peak (containing the majority of components) and a power-law tail, whereas homogeneous systems reduce naturally to Zipf’s Law but with a drooping tail in the distribution. We also confirm predictions that very long components are inevitable for heterogeneous systems; that discrete systems can exhibit simultaneously both heterogeneous and homogeneous behaviour; and that in systems with more than one consistent token alphabet (e.g. digital music), the alphabets themselves show a power-law relationship. The Royal Society 2019-10-23 /pmc/articles/PMC6837184/ /pubmed/31824716 http://dx.doi.org/10.1098/rsos.191101 Text en © 2019 The Authors. http://creativecommons.org/licenses/by/4.0/ Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.
spellingShingle Computer Science
Hatton, Les
Warr, Gregory
Strong evidence of an information-theoretical conservation principle linking all discrete systems
title Strong evidence of an information-theoretical conservation principle linking all discrete systems
title_full Strong evidence of an information-theoretical conservation principle linking all discrete systems
title_fullStr Strong evidence of an information-theoretical conservation principle linking all discrete systems
title_full_unstemmed Strong evidence of an information-theoretical conservation principle linking all discrete systems
title_short Strong evidence of an information-theoretical conservation principle linking all discrete systems
title_sort strong evidence of an information-theoretical conservation principle linking all discrete systems
topic Computer Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6837184/
https://www.ncbi.nlm.nih.gov/pubmed/31824716
http://dx.doi.org/10.1098/rsos.191101
work_keys_str_mv AT hattonles strongevidenceofaninformationtheoreticalconservationprinciplelinkingalldiscretesystems
AT warrgregory strongevidenceofaninformationtheoreticalconservationprinciplelinkingalldiscretesystems