Cargando…

A simple method for estimating genetic diversity in large populations from finite sample sizes

BACKGROUND: Sample size is one of the critical factors affecting the accuracy of the estimation of population genetic diversity parameters. Small sample sizes often lead to significant errors in determining the allelic richness, which is one of the most important and commonly used estimators of gene...

Descripción completa

Detalles Bibliográficos
Autores principales: Bashalkhanov, Stanislav, Pandey, Madhav, Rajora, Om P
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2800116/
https://www.ncbi.nlm.nih.gov/pubmed/20003542
http://dx.doi.org/10.1186/1471-2156-10-84
_version_ 1782175840779370496
author Bashalkhanov, Stanislav
Pandey, Madhav
Rajora, Om P
author_facet Bashalkhanov, Stanislav
Pandey, Madhav
Rajora, Om P
author_sort Bashalkhanov, Stanislav
collection PubMed
description BACKGROUND: Sample size is one of the critical factors affecting the accuracy of the estimation of population genetic diversity parameters. Small sample sizes often lead to significant errors in determining the allelic richness, which is one of the most important and commonly used estimators of genetic diversity in populations. Correct estimation of allelic richness in natural populations is challenging since they often do not conform to model assumptions. Here, we introduce a simple and robust approach to estimate the genetic diversity in large natural populations based on the empirical data for finite sample sizes. RESULTS: We developed a non-linear regression model to infer genetic diversity estimates in large natural populations from finite sample sizes. The allelic richness values predicted by our model were in good agreement with those observed in the simulated data sets and the true allelic richness observed in the source populations. The model has been validated using simulated population genetic data sets with different evolutionary scenarios implied in the simulated populations, as well as large microsatellite and allozyme experimental data sets for four conifer species with contrasting patterns of inherent genetic diversity and mating systems. Our model was a better predictor for allelic richness in natural populations than the widely-used Ewens sampling formula, coalescent approach, and rarefaction algorithm. CONCLUSIONS: Our regression model was capable of accurately estimating allelic richness in natural populations regardless of the species and marker system. This regression modeling approach is free from assumptions and can be widely used for population genetic and conservation applications.
format Text
id pubmed-2800116
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28001162009-12-31 A simple method for estimating genetic diversity in large populations from finite sample sizes Bashalkhanov, Stanislav Pandey, Madhav Rajora, Om P BMC Genet Research article BACKGROUND: Sample size is one of the critical factors affecting the accuracy of the estimation of population genetic diversity parameters. Small sample sizes often lead to significant errors in determining the allelic richness, which is one of the most important and commonly used estimators of genetic diversity in populations. Correct estimation of allelic richness in natural populations is challenging since they often do not conform to model assumptions. Here, we introduce a simple and robust approach to estimate the genetic diversity in large natural populations based on the empirical data for finite sample sizes. RESULTS: We developed a non-linear regression model to infer genetic diversity estimates in large natural populations from finite sample sizes. The allelic richness values predicted by our model were in good agreement with those observed in the simulated data sets and the true allelic richness observed in the source populations. The model has been validated using simulated population genetic data sets with different evolutionary scenarios implied in the simulated populations, as well as large microsatellite and allozyme experimental data sets for four conifer species with contrasting patterns of inherent genetic diversity and mating systems. Our model was a better predictor for allelic richness in natural populations than the widely-used Ewens sampling formula, coalescent approach, and rarefaction algorithm. CONCLUSIONS: Our regression model was capable of accurately estimating allelic richness in natural populations regardless of the species and marker system. This regression modeling approach is free from assumptions and can be widely used for population genetic and conservation applications. BioMed Central 2009-12-16 /pmc/articles/PMC2800116/ /pubmed/20003542 http://dx.doi.org/10.1186/1471-2156-10-84 Text en Copyright ©2009 Bashalkhanov et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Bashalkhanov, Stanislav
Pandey, Madhav
Rajora, Om P
A simple method for estimating genetic diversity in large populations from finite sample sizes
title A simple method for estimating genetic diversity in large populations from finite sample sizes
title_full A simple method for estimating genetic diversity in large populations from finite sample sizes
title_fullStr A simple method for estimating genetic diversity in large populations from finite sample sizes
title_full_unstemmed A simple method for estimating genetic diversity in large populations from finite sample sizes
title_short A simple method for estimating genetic diversity in large populations from finite sample sizes
title_sort simple method for estimating genetic diversity in large populations from finite sample sizes
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2800116/
https://www.ncbi.nlm.nih.gov/pubmed/20003542
http://dx.doi.org/10.1186/1471-2156-10-84
work_keys_str_mv AT bashalkhanovstanislav asimplemethodforestimatinggeneticdiversityinlargepopulationsfromfinitesamplesizes
AT pandeymadhav asimplemethodforestimatinggeneticdiversityinlargepopulationsfromfinitesamplesizes
AT rajoraomp asimplemethodforestimatinggeneticdiversityinlargepopulationsfromfinitesamplesizes
AT bashalkhanovstanislav simplemethodforestimatinggeneticdiversityinlargepopulationsfromfinitesamplesizes
AT pandeymadhav simplemethodforestimatinggeneticdiversityinlargepopulationsfromfinitesamplesizes
AT rajoraomp simplemethodforestimatinggeneticdiversityinlargepopulationsfromfinitesamplesizes