Cargando…

Inference of Super-exponential Human Population Growth via Efficient Computation of the Site Frequency Spectrum for Generalized Models

The site frequency spectrum (SFS) and other genetic summary statistics are at the heart of many population genetic studies. Previous studies have shown that human populations have undergone a recent epoch of fast growth in effective population size. These studies assumed that growth is exponential,...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Feng, Keinan, Alon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4701087/
https://www.ncbi.nlm.nih.gov/pubmed/26450922
http://dx.doi.org/10.1534/genetics.115.180570
_version_ 1782408425573974016
author Gao, Feng
Keinan, Alon
author_facet Gao, Feng
Keinan, Alon
author_sort Gao, Feng
collection PubMed
description The site frequency spectrum (SFS) and other genetic summary statistics are at the heart of many population genetic studies. Previous studies have shown that human populations have undergone a recent epoch of fast growth in effective population size. These studies assumed that growth is exponential, and the ensuing models leave an excess amount of extremely rare variants. This suggests that human populations might have experienced a recent growth with speed faster than exponential. Recent studies have introduced a generalized growth model where the growth speed can be faster or slower than exponential. However, only simulation approaches were available for obtaining summary statistics under such generalized models. In this study, we provide expressions to accurately and efficiently evaluate the SFS and other summary statistics under generalized models, which we further implement in a publicly available software. Investigating the power to infer deviation of growth from being exponential, we observed that adequate sample sizes facilitate accurate inference; e.g., a sample of 3000 individuals with the amount of data expected from exome sequencing allows observing and accurately estimating growth with speed deviating by ≥10% from that of exponential. Applying our inference framework to data from the NHLBI Exome Sequencing Project, we found that a model with a generalized growth epoch fits the observed SFS significantly better than the equivalent model with exponential growth (P-value [Formula: see text]). The estimated growth speed significantly deviates from exponential (P-value [Formula: see text]), with the best-fit estimate being of growth speed 12% faster than exponential.
format Online
Article
Text
id pubmed-4701087
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-47010872016-01-06 Inference of Super-exponential Human Population Growth via Efficient Computation of the Site Frequency Spectrum for Generalized Models Gao, Feng Keinan, Alon Genetics Investigations The site frequency spectrum (SFS) and other genetic summary statistics are at the heart of many population genetic studies. Previous studies have shown that human populations have undergone a recent epoch of fast growth in effective population size. These studies assumed that growth is exponential, and the ensuing models leave an excess amount of extremely rare variants. This suggests that human populations might have experienced a recent growth with speed faster than exponential. Recent studies have introduced a generalized growth model where the growth speed can be faster or slower than exponential. However, only simulation approaches were available for obtaining summary statistics under such generalized models. In this study, we provide expressions to accurately and efficiently evaluate the SFS and other summary statistics under generalized models, which we further implement in a publicly available software. Investigating the power to infer deviation of growth from being exponential, we observed that adequate sample sizes facilitate accurate inference; e.g., a sample of 3000 individuals with the amount of data expected from exome sequencing allows observing and accurately estimating growth with speed deviating by ≥10% from that of exponential. Applying our inference framework to data from the NHLBI Exome Sequencing Project, we found that a model with a generalized growth epoch fits the observed SFS significantly better than the equivalent model with exponential growth (P-value [Formula: see text]). The estimated growth speed significantly deviates from exponential (P-value [Formula: see text]), with the best-fit estimate being of growth speed 12% faster than exponential. Genetics Society of America 2016-01 2015-10-08 /pmc/articles/PMC4701087/ /pubmed/26450922 http://dx.doi.org/10.1534/genetics.115.180570 Text en Copyright © 2016 by the Genetics Society of America Available freely online through the author-supported open access option.
spellingShingle Investigations
Gao, Feng
Keinan, Alon
Inference of Super-exponential Human Population Growth via Efficient Computation of the Site Frequency Spectrum for Generalized Models
title Inference of Super-exponential Human Population Growth via Efficient Computation of the Site Frequency Spectrum for Generalized Models
title_full Inference of Super-exponential Human Population Growth via Efficient Computation of the Site Frequency Spectrum for Generalized Models
title_fullStr Inference of Super-exponential Human Population Growth via Efficient Computation of the Site Frequency Spectrum for Generalized Models
title_full_unstemmed Inference of Super-exponential Human Population Growth via Efficient Computation of the Site Frequency Spectrum for Generalized Models
title_short Inference of Super-exponential Human Population Growth via Efficient Computation of the Site Frequency Spectrum for Generalized Models
title_sort inference of super-exponential human population growth via efficient computation of the site frequency spectrum for generalized models
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4701087/
https://www.ncbi.nlm.nih.gov/pubmed/26450922
http://dx.doi.org/10.1534/genetics.115.180570
work_keys_str_mv AT gaofeng inferenceofsuperexponentialhumanpopulationgrowthviaefficientcomputationofthesitefrequencyspectrumforgeneralizedmodels
AT keinanalon inferenceofsuperexponentialhumanpopulationgrowthviaefficientcomputationofthesitefrequencyspectrumforgeneralizedmodels