Cargando…

Confidence Intervals for Population Allele Frequencies: The General Case of Sampling from a Finite Diploid Population of Any Size

The estimation of population allele frequencies using sample data forms a central component of studies in population genetics. These estimates can be used to test hypotheses on the evolutionary processes governing changes in genetic variation among populations. However, existing studies frequently d...

Descripción completa

Detalles Bibliográficos
Autores principales: Fung, Tak, Keenan, Kevin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3897575/
https://www.ncbi.nlm.nih.gov/pubmed/24465792
http://dx.doi.org/10.1371/journal.pone.0085925
_version_ 1782300258459451392
author Fung, Tak
Keenan, Kevin
author_facet Fung, Tak
Keenan, Kevin
author_sort Fung, Tak
collection PubMed
description The estimation of population allele frequencies using sample data forms a central component of studies in population genetics. These estimates can be used to test hypotheses on the evolutionary processes governing changes in genetic variation among populations. However, existing studies frequently do not account for sampling uncertainty in these estimates, thus compromising their utility. Incorporation of this uncertainty has been hindered by the lack of a method for constructing confidence intervals containing the population allele frequencies, for the general case of sampling from a finite diploid population of any size. In this study, we address this important knowledge gap by presenting a rigorous mathematical method to construct such confidence intervals. For a range of scenarios, the method is used to demonstrate that for a particular allele, in order to obtain accurate estimates within 0.05 of the population allele frequency with high probability ([Image: see text]%), a sample size of [Image: see text] is often required. This analysis is augmented by an application of the method to empirical sample allele frequency data for two populations of the checkerspot butterfly (Melitaea cinxia L.), occupying meadows in Finland. For each population, the method is used to derive [Image: see text]% confidence intervals for the population frequencies of three alleles. These intervals are then used to construct two joint [Image: see text]% confidence regions, one for the set of three frequencies for each population. These regions are then used to derive a [Image: see text]% confidence interval for Jost's D, a measure of genetic differentiation between the two populations. Overall, the results demonstrate the practical utility of the method with respect to informing sampling design and accounting for sampling uncertainty in studies of population genetics, important for scientific hypothesis-testing and also for risk-based natural resource management.
format Online
Article
Text
id pubmed-3897575
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38975752014-01-24 Confidence Intervals for Population Allele Frequencies: The General Case of Sampling from a Finite Diploid Population of Any Size Fung, Tak Keenan, Kevin PLoS One Research Article The estimation of population allele frequencies using sample data forms a central component of studies in population genetics. These estimates can be used to test hypotheses on the evolutionary processes governing changes in genetic variation among populations. However, existing studies frequently do not account for sampling uncertainty in these estimates, thus compromising their utility. Incorporation of this uncertainty has been hindered by the lack of a method for constructing confidence intervals containing the population allele frequencies, for the general case of sampling from a finite diploid population of any size. In this study, we address this important knowledge gap by presenting a rigorous mathematical method to construct such confidence intervals. For a range of scenarios, the method is used to demonstrate that for a particular allele, in order to obtain accurate estimates within 0.05 of the population allele frequency with high probability ([Image: see text]%), a sample size of [Image: see text] is often required. This analysis is augmented by an application of the method to empirical sample allele frequency data for two populations of the checkerspot butterfly (Melitaea cinxia L.), occupying meadows in Finland. For each population, the method is used to derive [Image: see text]% confidence intervals for the population frequencies of three alleles. These intervals are then used to construct two joint [Image: see text]% confidence regions, one for the set of three frequencies for each population. These regions are then used to derive a [Image: see text]% confidence interval for Jost's D, a measure of genetic differentiation between the two populations. Overall, the results demonstrate the practical utility of the method with respect to informing sampling design and accounting for sampling uncertainty in studies of population genetics, important for scientific hypothesis-testing and also for risk-based natural resource management. Public Library of Science 2014-01-21 /pmc/articles/PMC3897575/ /pubmed/24465792 http://dx.doi.org/10.1371/journal.pone.0085925 Text en © 2014 Fung, Keenan http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Fung, Tak
Keenan, Kevin
Confidence Intervals for Population Allele Frequencies: The General Case of Sampling from a Finite Diploid Population of Any Size
title Confidence Intervals for Population Allele Frequencies: The General Case of Sampling from a Finite Diploid Population of Any Size
title_full Confidence Intervals for Population Allele Frequencies: The General Case of Sampling from a Finite Diploid Population of Any Size
title_fullStr Confidence Intervals for Population Allele Frequencies: The General Case of Sampling from a Finite Diploid Population of Any Size
title_full_unstemmed Confidence Intervals for Population Allele Frequencies: The General Case of Sampling from a Finite Diploid Population of Any Size
title_short Confidence Intervals for Population Allele Frequencies: The General Case of Sampling from a Finite Diploid Population of Any Size
title_sort confidence intervals for population allele frequencies: the general case of sampling from a finite diploid population of any size
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3897575/
https://www.ncbi.nlm.nih.gov/pubmed/24465792
http://dx.doi.org/10.1371/journal.pone.0085925
work_keys_str_mv AT fungtak confidenceintervalsforpopulationallelefrequenciesthegeneralcaseofsamplingfromafinitediploidpopulationofanysize
AT keenankevin confidenceintervalsforpopulationallelefrequenciesthegeneralcaseofsamplingfromafinitediploidpopulationofanysize