Cargando…

Gene Sampling Strategies for Multi-Locus Population Estimates of Genetic Diversity (θ)

BACKGROUND: Theoretical work suggests that data from multiple nuclear loci provide better estimates of population genetic parameters than do single loci, but just how many loci are needed and how much sequence is required from each has been little explored. METHODOLOGY/PRINCIPLE FINDINGS: To investi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Carling, Matthew D., Brumfield, Robb T.
Formato:	Texto
Lenguaje:	English
Publicado:	Public Library of Science 2007
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1764684/ https://www.ncbi.nlm.nih.gov/pubmed/17225863 http://dx.doi.org/10.1371/journal.pone.0000160

_version_	1782131626649583616
author	Carling, Matthew D. Brumfield, Robb T.
author_facet	Carling, Matthew D. Brumfield, Robb T.
author_sort	Carling, Matthew D.
collection	PubMed
description	BACKGROUND: Theoretical work suggests that data from multiple nuclear loci provide better estimates of population genetic parameters than do single loci, but just how many loci are needed and how much sequence is required from each has been little explored. METHODOLOGY/PRINCIPLE FINDINGS: To investigate how much data is required to estimate the population genetic parameter θ (4N(e)μ) accurately under ideal circumstances, we simulated datasets of DNA sequences under three values of θ per site (0.1, 0.01, 0.001), varying in both the total number of base pairs sequenced per individual and the number of equal-length loci. From these datasets we estimated θ using the maximum likelihood coalescent framework implemented in the computer program Migrate. Our results corroborated the theoretical expectation that increasing the number of loci impacted the accuracy of the estimate more than increasing the sequence length at single loci. However, when the value of θ was low (0.001), the per-locus sequence length was also important for estimating θ accurately, something that has not been emphasized in previous work. CONCLUSIONS/SIGNIFICANCE: Accurate estimation of θ required data from at least 25 independently evolving loci. Beyond this, there was little added benefit in terms of decreasing the squared coefficient of variation of the coalescent estimates relative to the extra effort required to sample more loci.
format	Text
id	pubmed-1764684
institution	National Center for Biotechnology Information
language	English
publishDate	2007
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-17646842007-01-17 Gene Sampling Strategies for Multi-Locus Population Estimates of Genetic Diversity (θ) Carling, Matthew D. Brumfield, Robb T. PLoS One Research Article BACKGROUND: Theoretical work suggests that data from multiple nuclear loci provide better estimates of population genetic parameters than do single loci, but just how many loci are needed and how much sequence is required from each has been little explored. METHODOLOGY/PRINCIPLE FINDINGS: To investigate how much data is required to estimate the population genetic parameter θ (4N(e)μ) accurately under ideal circumstances, we simulated datasets of DNA sequences under three values of θ per site (0.1, 0.01, 0.001), varying in both the total number of base pairs sequenced per individual and the number of equal-length loci. From these datasets we estimated θ using the maximum likelihood coalescent framework implemented in the computer program Migrate. Our results corroborated the theoretical expectation that increasing the number of loci impacted the accuracy of the estimate more than increasing the sequence length at single loci. However, when the value of θ was low (0.001), the per-locus sequence length was also important for estimating θ accurately, something that has not been emphasized in previous work. CONCLUSIONS/SIGNIFICANCE: Accurate estimation of θ required data from at least 25 independently evolving loci. Beyond this, there was little added benefit in terms of decreasing the squared coefficient of variation of the coalescent estimates relative to the extra effort required to sample more loci. Public Library of Science 2007-01-17 /pmc/articles/PMC1764684/ /pubmed/17225863 http://dx.doi.org/10.1371/journal.pone.0000160 Text en Carling, Brumfield. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Carling, Matthew D. Brumfield, Robb T. Gene Sampling Strategies for Multi-Locus Population Estimates of Genetic Diversity (θ)
title	Gene Sampling Strategies for Multi-Locus Population Estimates of Genetic Diversity (θ)
title_full	Gene Sampling Strategies for Multi-Locus Population Estimates of Genetic Diversity (θ)
title_fullStr	Gene Sampling Strategies for Multi-Locus Population Estimates of Genetic Diversity (θ)
title_full_unstemmed	Gene Sampling Strategies for Multi-Locus Population Estimates of Genetic Diversity (θ)
title_short	Gene Sampling Strategies for Multi-Locus Population Estimates of Genetic Diversity (θ)
title_sort	gene sampling strategies for multi-locus population estimates of genetic diversity (θ)
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1764684/ https://www.ncbi.nlm.nih.gov/pubmed/17225863 http://dx.doi.org/10.1371/journal.pone.0000160
work_keys_str_mv	AT carlingmatthewd genesamplingstrategiesformultilocuspopulationestimatesofgeneticdiversityth AT brumfieldrobbt genesamplingstrategiesformultilocuspopulationestimatesofgeneticdiversityth

Gene Sampling Strategies for Multi-Locus Population Estimates of Genetic Diversity (θ)

Ejemplares similares