Cargando…

Empirical Distributions of F (ST) from Large-Scale Human Polymorphism Data

Studies of the apportionment of human genetic variation have long established that most human variation is within population groups and that the additional variation between population groups is small but greatest when comparing different continental populations. These studies often used Wright’s F...

Descripción completa

Detalles Bibliográficos
Autor principal: Elhaik, Eran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3504095/
https://www.ncbi.nlm.nih.gov/pubmed/23185452
http://dx.doi.org/10.1371/journal.pone.0049837
_version_ 1782250573153697792
author Elhaik, Eran
author_facet Elhaik, Eran
author_sort Elhaik, Eran
collection PubMed
description Studies of the apportionment of human genetic variation have long established that most human variation is within population groups and that the additional variation between population groups is small but greatest when comparing different continental populations. These studies often used Wright’s F (ST) that apportions the standardized variance in allele frequencies within and between population groups. Because local adaptations increase population differentiation, high-F (ST) may be found at closely linked loci under selection and used to identify genes undergoing directional or heterotic selection. We re-examined these processes using HapMap data. We analyzed 3 million SNPs on 602 samples from eight worldwide populations and a consensus subset of 1 million SNPs found in all populations. We identified four major features of the data: First, a hierarchically F (ST) analysis showed that only a paucity (12%) of the total genetic variation is distributed between continental populations and even a lesser genetic variation (1%) is found between intra-continental populations. Second, the global F (ST) distribution closely follows an exponential distribution. Third, although the overall F (ST) distribution is similarly shaped (inverse J), F (ST) distributions varies markedly by allele frequency when divided into non-overlapping groups by allele frequency range. Because the mean allele frequency is a crude indicator of allele age, these distributions mark the time-dependent change in genetic differentiation. Finally, the change in mean-F (ST) of these groups is linear in allele frequency. These results suggest that investigating the extremes of the F (ST) distribution for each allele frequency group is more efficient for detecting selection. Consequently, we demonstrate that such extreme SNPs are more clustered along the chromosomes than expected from linkage disequilibrium for each allele frequency group. These genomic regions are therefore likely candidates for natural selection.
format Online
Article
Text
id pubmed-3504095
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35040952012-11-26 Empirical Distributions of F (ST) from Large-Scale Human Polymorphism Data Elhaik, Eran PLoS One Research Article Studies of the apportionment of human genetic variation have long established that most human variation is within population groups and that the additional variation between population groups is small but greatest when comparing different continental populations. These studies often used Wright’s F (ST) that apportions the standardized variance in allele frequencies within and between population groups. Because local adaptations increase population differentiation, high-F (ST) may be found at closely linked loci under selection and used to identify genes undergoing directional or heterotic selection. We re-examined these processes using HapMap data. We analyzed 3 million SNPs on 602 samples from eight worldwide populations and a consensus subset of 1 million SNPs found in all populations. We identified four major features of the data: First, a hierarchically F (ST) analysis showed that only a paucity (12%) of the total genetic variation is distributed between continental populations and even a lesser genetic variation (1%) is found between intra-continental populations. Second, the global F (ST) distribution closely follows an exponential distribution. Third, although the overall F (ST) distribution is similarly shaped (inverse J), F (ST) distributions varies markedly by allele frequency when divided into non-overlapping groups by allele frequency range. Because the mean allele frequency is a crude indicator of allele age, these distributions mark the time-dependent change in genetic differentiation. Finally, the change in mean-F (ST) of these groups is linear in allele frequency. These results suggest that investigating the extremes of the F (ST) distribution for each allele frequency group is more efficient for detecting selection. Consequently, we demonstrate that such extreme SNPs are more clustered along the chromosomes than expected from linkage disequilibrium for each allele frequency group. These genomic regions are therefore likely candidates for natural selection. Public Library of Science 2012-11-21 /pmc/articles/PMC3504095/ /pubmed/23185452 http://dx.doi.org/10.1371/journal.pone.0049837 Text en © 2012 Eran Elhaik http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Elhaik, Eran
Empirical Distributions of F (ST) from Large-Scale Human Polymorphism Data
title Empirical Distributions of F (ST) from Large-Scale Human Polymorphism Data
title_full Empirical Distributions of F (ST) from Large-Scale Human Polymorphism Data
title_fullStr Empirical Distributions of F (ST) from Large-Scale Human Polymorphism Data
title_full_unstemmed Empirical Distributions of F (ST) from Large-Scale Human Polymorphism Data
title_short Empirical Distributions of F (ST) from Large-Scale Human Polymorphism Data
title_sort empirical distributions of f (st) from large-scale human polymorphism data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3504095/
https://www.ncbi.nlm.nih.gov/pubmed/23185452
http://dx.doi.org/10.1371/journal.pone.0049837
work_keys_str_mv AT elhaikeran empiricaldistributionsoffstfromlargescalehumanpolymorphismdata