Cargando…
GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data
BACKGROUND: The demographic history of any population is imprinted in the genomes of the individuals that make up the population. One of the most popular and convenient representations of genetic information is the allele frequency spectrum (AFS), the distribution of allele frequencies in population...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7049072/ https://www.ncbi.nlm.nih.gov/pubmed/32112099 http://dx.doi.org/10.1093/gigascience/giaa005 |
_version_ | 1783502375419379712 |
---|---|
author | Noskova, Ekaterina Ulyantsev, Vladimir Koepfli, Klaus-Peter O’Brien, Stephen J Dobrynin, Pavel |
author_facet | Noskova, Ekaterina Ulyantsev, Vladimir Koepfli, Klaus-Peter O’Brien, Stephen J Dobrynin, Pavel |
author_sort | Noskova, Ekaterina |
collection | PubMed |
description | BACKGROUND: The demographic history of any population is imprinted in the genomes of the individuals that make up the population. One of the most popular and convenient representations of genetic information is the allele frequency spectrum (AFS), the distribution of allele frequencies in populations. The joint AFS is commonly used to reconstruct the demographic history of multiple populations, and several methods based on diffusion approximation (e.g., ∂a∂i) and ordinary differential equations (e.g., moments) have been developed and applied for demographic inference. These methods provide an opportunity to simulate AFS under a variety of researcher-specified demographic models and to estimate the best model and associated parameters using likelihood-based local optimizations. However, there are no known algorithms to perform global searches of demographic models with a given AFS. RESULTS: Here, we introduce a new method that implements a global search using a genetic algorithm for the automatic and unsupervised inference of demographic history from joint AFS data. Our method is implemented in the software GADMA (Genetic Algorithm for Demographic Model Analysis, https://github.com/ctlab/GADMA). CONCLUSIONS: We demonstrate the performance of GADMA by applying it to sequence data from humans and non-model organisms and show that it is able to automatically infer a demographic model close to or even better than the one that was previously obtained manually. Moreover, GADMA is able to infer multiple demographic models at different local optima close to the global one, providing a larger set of possible scenarios to further explore demographic history. |
format | Online Article Text |
id | pubmed-7049072 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-70490722020-03-03 GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data Noskova, Ekaterina Ulyantsev, Vladimir Koepfli, Klaus-Peter O’Brien, Stephen J Dobrynin, Pavel Gigascience Technical Note BACKGROUND: The demographic history of any population is imprinted in the genomes of the individuals that make up the population. One of the most popular and convenient representations of genetic information is the allele frequency spectrum (AFS), the distribution of allele frequencies in populations. The joint AFS is commonly used to reconstruct the demographic history of multiple populations, and several methods based on diffusion approximation (e.g., ∂a∂i) and ordinary differential equations (e.g., moments) have been developed and applied for demographic inference. These methods provide an opportunity to simulate AFS under a variety of researcher-specified demographic models and to estimate the best model and associated parameters using likelihood-based local optimizations. However, there are no known algorithms to perform global searches of demographic models with a given AFS. RESULTS: Here, we introduce a new method that implements a global search using a genetic algorithm for the automatic and unsupervised inference of demographic history from joint AFS data. Our method is implemented in the software GADMA (Genetic Algorithm for Demographic Model Analysis, https://github.com/ctlab/GADMA). CONCLUSIONS: We demonstrate the performance of GADMA by applying it to sequence data from humans and non-model organisms and show that it is able to automatically infer a demographic model close to or even better than the one that was previously obtained manually. Moreover, GADMA is able to infer multiple demographic models at different local optima close to the global one, providing a larger set of possible scenarios to further explore demographic history. Oxford University Press 2020-02-29 /pmc/articles/PMC7049072/ /pubmed/32112099 http://dx.doi.org/10.1093/gigascience/giaa005 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Technical Note Noskova, Ekaterina Ulyantsev, Vladimir Koepfli, Klaus-Peter O’Brien, Stephen J Dobrynin, Pavel GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data |
title | GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data |
title_full | GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data |
title_fullStr | GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data |
title_full_unstemmed | GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data |
title_short | GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data |
title_sort | gadma: genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7049072/ https://www.ncbi.nlm.nih.gov/pubmed/32112099 http://dx.doi.org/10.1093/gigascience/giaa005 |
work_keys_str_mv | AT noskovaekaterina gadmageneticalgorithmforinferringdemographichistoryofmultiplepopulationsfromallelefrequencyspectrumdata AT ulyantsevvladimir gadmageneticalgorithmforinferringdemographichistoryofmultiplepopulationsfromallelefrequencyspectrumdata AT koepfliklauspeter gadmageneticalgorithmforinferringdemographichistoryofmultiplepopulationsfromallelefrequencyspectrumdata AT obrienstephenj gadmageneticalgorithmforinferringdemographichistoryofmultiplepopulationsfromallelefrequencyspectrumdata AT dobryninpavel gadmageneticalgorithmforinferringdemographichistoryofmultiplepopulationsfromallelefrequencyspectrumdata |