Cargando…

GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data

BACKGROUND: The demographic history of any population is imprinted in the genomes of the individuals that make up the population. One of the most popular and convenient representations of genetic information is the allele frequency spectrum (AFS), the distribution of allele frequencies in population...

Descripción completa

Detalles Bibliográficos
Autores principales: Noskova, Ekaterina, Ulyantsev, Vladimir, Koepfli, Klaus-Peter, O’Brien, Stephen J, Dobrynin, Pavel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7049072/
https://www.ncbi.nlm.nih.gov/pubmed/32112099
http://dx.doi.org/10.1093/gigascience/giaa005
_version_ 1783502375419379712
author Noskova, Ekaterina
Ulyantsev, Vladimir
Koepfli, Klaus-Peter
O’Brien, Stephen J
Dobrynin, Pavel
author_facet Noskova, Ekaterina
Ulyantsev, Vladimir
Koepfli, Klaus-Peter
O’Brien, Stephen J
Dobrynin, Pavel
author_sort Noskova, Ekaterina
collection PubMed
description BACKGROUND: The demographic history of any population is imprinted in the genomes of the individuals that make up the population. One of the most popular and convenient representations of genetic information is the allele frequency spectrum (AFS), the distribution of allele frequencies in populations. The joint AFS is commonly used to reconstruct the demographic history of multiple populations, and several methods based on diffusion approximation (e.g., ∂a∂i) and ordinary differential equations (e.g., moments) have been developed and applied for demographic inference. These methods provide an opportunity to simulate AFS under a variety of researcher-specified demographic models and to estimate the best model and associated parameters using likelihood-based local optimizations. However, there are no known algorithms to perform global searches of demographic models with a given AFS. RESULTS: Here, we introduce a new method that implements a global search using a genetic algorithm for the automatic and unsupervised inference of demographic history from joint AFS data. Our method is implemented in the software GADMA (Genetic Algorithm for Demographic Model Analysis, https://github.com/ctlab/GADMA). CONCLUSIONS: We demonstrate the performance of GADMA by applying it to sequence data from humans and non-model organisms and show that it is able to automatically infer a demographic model close to or even better than the one that was previously obtained manually. Moreover, GADMA is able to infer multiple demographic models at different local optima close to the global one, providing a larger set of possible scenarios to further explore demographic history.
format Online
Article
Text
id pubmed-7049072
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-70490722020-03-03 GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data Noskova, Ekaterina Ulyantsev, Vladimir Koepfli, Klaus-Peter O’Brien, Stephen J Dobrynin, Pavel Gigascience Technical Note BACKGROUND: The demographic history of any population is imprinted in the genomes of the individuals that make up the population. One of the most popular and convenient representations of genetic information is the allele frequency spectrum (AFS), the distribution of allele frequencies in populations. The joint AFS is commonly used to reconstruct the demographic history of multiple populations, and several methods based on diffusion approximation (e.g., ∂a∂i) and ordinary differential equations (e.g., moments) have been developed and applied for demographic inference. These methods provide an opportunity to simulate AFS under a variety of researcher-specified demographic models and to estimate the best model and associated parameters using likelihood-based local optimizations. However, there are no known algorithms to perform global searches of demographic models with a given AFS. RESULTS: Here, we introduce a new method that implements a global search using a genetic algorithm for the automatic and unsupervised inference of demographic history from joint AFS data. Our method is implemented in the software GADMA (Genetic Algorithm for Demographic Model Analysis, https://github.com/ctlab/GADMA). CONCLUSIONS: We demonstrate the performance of GADMA by applying it to sequence data from humans and non-model organisms and show that it is able to automatically infer a demographic model close to or even better than the one that was previously obtained manually. Moreover, GADMA is able to infer multiple demographic models at different local optima close to the global one, providing a larger set of possible scenarios to further explore demographic history. Oxford University Press 2020-02-29 /pmc/articles/PMC7049072/ /pubmed/32112099 http://dx.doi.org/10.1093/gigascience/giaa005 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Noskova, Ekaterina
Ulyantsev, Vladimir
Koepfli, Klaus-Peter
O’Brien, Stephen J
Dobrynin, Pavel
GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data
title GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data
title_full GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data
title_fullStr GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data
title_full_unstemmed GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data
title_short GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data
title_sort gadma: genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7049072/
https://www.ncbi.nlm.nih.gov/pubmed/32112099
http://dx.doi.org/10.1093/gigascience/giaa005
work_keys_str_mv AT noskovaekaterina gadmageneticalgorithmforinferringdemographichistoryofmultiplepopulationsfromallelefrequencyspectrumdata
AT ulyantsevvladimir gadmageneticalgorithmforinferringdemographichistoryofmultiplepopulationsfromallelefrequencyspectrumdata
AT koepfliklauspeter gadmageneticalgorithmforinferringdemographichistoryofmultiplepopulationsfromallelefrequencyspectrumdata
AT obrienstephenj gadmageneticalgorithmforinferringdemographichistoryofmultiplepopulationsfromallelefrequencyspectrumdata
AT dobryninpavel gadmageneticalgorithmforinferringdemographichistoryofmultiplepopulationsfromallelefrequencyspectrumdata