Cargando…

Expected Shannon Entropy and Shannon Differentiation between Subpopulations for Neutral Genes under the Finite Island Model

Shannon entropy H and related measures are increasingly used in molecular ecology and population genetics because (1) unlike measures based on heterozygosity or allele number, these measures weigh alleles in proportion to their population fraction, thus capturing a previously-ignored aspect of allel...

Descripción completa

Detalles Bibliográficos
Autores principales: Chao, Anne, Jost, Lou, Hsieh, T. C., Ma, K. H., Sherwin, William B., Rollins, Lee Ann
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4465833/
https://www.ncbi.nlm.nih.gov/pubmed/26067448
http://dx.doi.org/10.1371/journal.pone.0125471
_version_ 1782376138815832064
author Chao, Anne
Jost, Lou
Hsieh, T. C.
Ma, K. H.
Sherwin, William B.
Rollins, Lee Ann
author_facet Chao, Anne
Jost, Lou
Hsieh, T. C.
Ma, K. H.
Sherwin, William B.
Rollins, Lee Ann
author_sort Chao, Anne
collection PubMed
description Shannon entropy H and related measures are increasingly used in molecular ecology and population genetics because (1) unlike measures based on heterozygosity or allele number, these measures weigh alleles in proportion to their population fraction, thus capturing a previously-ignored aspect of allele frequency distributions that may be important in many applications; (2) these measures connect directly to the rich predictive mathematics of information theory; (3) Shannon entropy is completely additive and has an explicitly hierarchical nature; and (4) Shannon entropy-based differentiation measures obey strong monotonicity properties that heterozygosity-based measures lack. We derive simple new expressions for the expected values of the Shannon entropy of the equilibrium allele distribution at a neutral locus in a single isolated population under two models of mutation: the infinite allele model and the stepwise mutation model. Surprisingly, this complex stochastic system for each model has an entropy expressable as a simple combination of well-known mathematical functions. Moreover, entropy- and heterozygosity-based measures for each model are linked by simple relationships that are shown by simulations to be approximately valid even far from equilibrium. We also identify a bridge between the two models of mutation. We apply our approach to subdivided populations which follow the finite island model, obtaining the Shannon entropy of the equilibrium allele distributions of the subpopulations and of the total population. We also derive the expected mutual information and normalized mutual information (“Shannon differentiation”) between subpopulations at equilibrium, and identify the model parameters that determine them. We apply our measures to data from the common starling (Sturnus vulgaris) in Australia. Our measures provide a test for neutrality that is robust to violations of equilibrium assumptions, as verified on real world data from starlings.
format Online
Article
Text
id pubmed-4465833
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44658332015-06-25 Expected Shannon Entropy and Shannon Differentiation between Subpopulations for Neutral Genes under the Finite Island Model Chao, Anne Jost, Lou Hsieh, T. C. Ma, K. H. Sherwin, William B. Rollins, Lee Ann PLoS One Research Article Shannon entropy H and related measures are increasingly used in molecular ecology and population genetics because (1) unlike measures based on heterozygosity or allele number, these measures weigh alleles in proportion to their population fraction, thus capturing a previously-ignored aspect of allele frequency distributions that may be important in many applications; (2) these measures connect directly to the rich predictive mathematics of information theory; (3) Shannon entropy is completely additive and has an explicitly hierarchical nature; and (4) Shannon entropy-based differentiation measures obey strong monotonicity properties that heterozygosity-based measures lack. We derive simple new expressions for the expected values of the Shannon entropy of the equilibrium allele distribution at a neutral locus in a single isolated population under two models of mutation: the infinite allele model and the stepwise mutation model. Surprisingly, this complex stochastic system for each model has an entropy expressable as a simple combination of well-known mathematical functions. Moreover, entropy- and heterozygosity-based measures for each model are linked by simple relationships that are shown by simulations to be approximately valid even far from equilibrium. We also identify a bridge between the two models of mutation. We apply our approach to subdivided populations which follow the finite island model, obtaining the Shannon entropy of the equilibrium allele distributions of the subpopulations and of the total population. We also derive the expected mutual information and normalized mutual information (“Shannon differentiation”) between subpopulations at equilibrium, and identify the model parameters that determine them. We apply our measures to data from the common starling (Sturnus vulgaris) in Australia. Our measures provide a test for neutrality that is robust to violations of equilibrium assumptions, as verified on real world data from starlings. Public Library of Science 2015-06-11 /pmc/articles/PMC4465833/ /pubmed/26067448 http://dx.doi.org/10.1371/journal.pone.0125471 Text en © 2015 Chao et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Chao, Anne
Jost, Lou
Hsieh, T. C.
Ma, K. H.
Sherwin, William B.
Rollins, Lee Ann
Expected Shannon Entropy and Shannon Differentiation between Subpopulations for Neutral Genes under the Finite Island Model
title Expected Shannon Entropy and Shannon Differentiation between Subpopulations for Neutral Genes under the Finite Island Model
title_full Expected Shannon Entropy and Shannon Differentiation between Subpopulations for Neutral Genes under the Finite Island Model
title_fullStr Expected Shannon Entropy and Shannon Differentiation between Subpopulations for Neutral Genes under the Finite Island Model
title_full_unstemmed Expected Shannon Entropy and Shannon Differentiation between Subpopulations for Neutral Genes under the Finite Island Model
title_short Expected Shannon Entropy and Shannon Differentiation between Subpopulations for Neutral Genes under the Finite Island Model
title_sort expected shannon entropy and shannon differentiation between subpopulations for neutral genes under the finite island model
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4465833/
https://www.ncbi.nlm.nih.gov/pubmed/26067448
http://dx.doi.org/10.1371/journal.pone.0125471
work_keys_str_mv AT chaoanne expectedshannonentropyandshannondifferentiationbetweensubpopulationsforneutralgenesunderthefiniteislandmodel
AT jostlou expectedshannonentropyandshannondifferentiationbetweensubpopulationsforneutralgenesunderthefiniteislandmodel
AT hsiehtc expectedshannonentropyandshannondifferentiationbetweensubpopulationsforneutralgenesunderthefiniteislandmodel
AT makh expectedshannonentropyandshannondifferentiationbetweensubpopulationsforneutralgenesunderthefiniteislandmodel
AT sherwinwilliamb expectedshannonentropyandshannondifferentiationbetweensubpopulationsforneutralgenesunderthefiniteislandmodel
AT rollinsleeann expectedshannonentropyandshannondifferentiationbetweensubpopulationsforneutralgenesunderthefiniteislandmodel