Cargando…
A simple method for estimating genetic diversity in large populations from finite sample sizes
BACKGROUND: Sample size is one of the critical factors affecting the accuracy of the estimation of population genetic diversity parameters. Small sample sizes often lead to significant errors in determining the allelic richness, which is one of the most important and commonly used estimators of gene...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2800116/ https://www.ncbi.nlm.nih.gov/pubmed/20003542 http://dx.doi.org/10.1186/1471-2156-10-84 |
_version_ | 1782175840779370496 |
---|---|
author | Bashalkhanov, Stanislav Pandey, Madhav Rajora, Om P |
author_facet | Bashalkhanov, Stanislav Pandey, Madhav Rajora, Om P |
author_sort | Bashalkhanov, Stanislav |
collection | PubMed |
description | BACKGROUND: Sample size is one of the critical factors affecting the accuracy of the estimation of population genetic diversity parameters. Small sample sizes often lead to significant errors in determining the allelic richness, which is one of the most important and commonly used estimators of genetic diversity in populations. Correct estimation of allelic richness in natural populations is challenging since they often do not conform to model assumptions. Here, we introduce a simple and robust approach to estimate the genetic diversity in large natural populations based on the empirical data for finite sample sizes. RESULTS: We developed a non-linear regression model to infer genetic diversity estimates in large natural populations from finite sample sizes. The allelic richness values predicted by our model were in good agreement with those observed in the simulated data sets and the true allelic richness observed in the source populations. The model has been validated using simulated population genetic data sets with different evolutionary scenarios implied in the simulated populations, as well as large microsatellite and allozyme experimental data sets for four conifer species with contrasting patterns of inherent genetic diversity and mating systems. Our model was a better predictor for allelic richness in natural populations than the widely-used Ewens sampling formula, coalescent approach, and rarefaction algorithm. CONCLUSIONS: Our regression model was capable of accurately estimating allelic richness in natural populations regardless of the species and marker system. This regression modeling approach is free from assumptions and can be widely used for population genetic and conservation applications. |
format | Text |
id | pubmed-2800116 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-28001162009-12-31 A simple method for estimating genetic diversity in large populations from finite sample sizes Bashalkhanov, Stanislav Pandey, Madhav Rajora, Om P BMC Genet Research article BACKGROUND: Sample size is one of the critical factors affecting the accuracy of the estimation of population genetic diversity parameters. Small sample sizes often lead to significant errors in determining the allelic richness, which is one of the most important and commonly used estimators of genetic diversity in populations. Correct estimation of allelic richness in natural populations is challenging since they often do not conform to model assumptions. Here, we introduce a simple and robust approach to estimate the genetic diversity in large natural populations based on the empirical data for finite sample sizes. RESULTS: We developed a non-linear regression model to infer genetic diversity estimates in large natural populations from finite sample sizes. The allelic richness values predicted by our model were in good agreement with those observed in the simulated data sets and the true allelic richness observed in the source populations. The model has been validated using simulated population genetic data sets with different evolutionary scenarios implied in the simulated populations, as well as large microsatellite and allozyme experimental data sets for four conifer species with contrasting patterns of inherent genetic diversity and mating systems. Our model was a better predictor for allelic richness in natural populations than the widely-used Ewens sampling formula, coalescent approach, and rarefaction algorithm. CONCLUSIONS: Our regression model was capable of accurately estimating allelic richness in natural populations regardless of the species and marker system. This regression modeling approach is free from assumptions and can be widely used for population genetic and conservation applications. BioMed Central 2009-12-16 /pmc/articles/PMC2800116/ /pubmed/20003542 http://dx.doi.org/10.1186/1471-2156-10-84 Text en Copyright ©2009 Bashalkhanov et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research article Bashalkhanov, Stanislav Pandey, Madhav Rajora, Om P A simple method for estimating genetic diversity in large populations from finite sample sizes |
title | A simple method for estimating genetic diversity in large populations from finite sample sizes |
title_full | A simple method for estimating genetic diversity in large populations from finite sample sizes |
title_fullStr | A simple method for estimating genetic diversity in large populations from finite sample sizes |
title_full_unstemmed | A simple method for estimating genetic diversity in large populations from finite sample sizes |
title_short | A simple method for estimating genetic diversity in large populations from finite sample sizes |
title_sort | simple method for estimating genetic diversity in large populations from finite sample sizes |
topic | Research article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2800116/ https://www.ncbi.nlm.nih.gov/pubmed/20003542 http://dx.doi.org/10.1186/1471-2156-10-84 |
work_keys_str_mv | AT bashalkhanovstanislav asimplemethodforestimatinggeneticdiversityinlargepopulationsfromfinitesamplesizes AT pandeymadhav asimplemethodforestimatinggeneticdiversityinlargepopulationsfromfinitesamplesizes AT rajoraomp asimplemethodforestimatinggeneticdiversityinlargepopulationsfromfinitesamplesizes AT bashalkhanovstanislav simplemethodforestimatinggeneticdiversityinlargepopulationsfromfinitesamplesizes AT pandeymadhav simplemethodforestimatinggeneticdiversityinlargepopulationsfromfinitesamplesizes AT rajoraomp simplemethodforestimatinggeneticdiversityinlargepopulationsfromfinitesamplesizes |