Cargando…
Complex genetic admixture histories reconstructed with Approximate Bayesian Computation
Admixture is a fundamental evolutionary process that has influenced genetic patterns in numerous species. Maximum‐likelihood approaches based on allele frequencies and linkage‐disequilibrium have been extensively used to infer admixture processes from genome‐wide data sets, mostly in human populatio...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8247995/ https://www.ncbi.nlm.nih.gov/pubmed/33452723 http://dx.doi.org/10.1111/1755-0998.13325 |
_version_ | 1783716630993305600 |
---|---|
author | Fortes‐Lima, Cesar A. Laurent, Romain Thouzeau, Valentin Toupance, Bruno Verdu, Paul |
author_facet | Fortes‐Lima, Cesar A. Laurent, Romain Thouzeau, Valentin Toupance, Bruno Verdu, Paul |
author_sort | Fortes‐Lima, Cesar A. |
collection | PubMed |
description | Admixture is a fundamental evolutionary process that has influenced genetic patterns in numerous species. Maximum‐likelihood approaches based on allele frequencies and linkage‐disequilibrium have been extensively used to infer admixture processes from genome‐wide data sets, mostly in human populations. Nevertheless, complex admixture histories, beyond one or two pulses of admixture, remain methodologically challenging to reconstruct. We developed an Approximate Bayesian Computation (ABC) framework to reconstruct highly complex admixture histories from independent genetic markers. We built the software package methis to simulate independent SNPs or microsatellites in a two‐way admixed population for scenarios with multiple admixture pulses, monotonically decreasing or increasing recurring admixture, or combinations of these scenarios. methis allows users to draw model‐parameter values from prior distributions set by the user, and, for each simulation, methis can calculate numerous summary statistics describing genetic diversity patterns and moments of the distribution of individual admixture fractions. We coupled methis with existing machine‐learning ABC algorithms and investigated the admixture history of admixed populations. Results showed that random forest ABC scenario‐choice could accurately distinguish among most complex admixture scenarios, and errors were mainly found in regions of the parameter space where scenarios were highly nested, and, thus, biologically similar. We focused on African American and Barbadian populations as two study‐cases. We found that neural network ABC posterior parameter estimation was accurate and reasonably conservative under complex admixture scenarios. For both admixed populations, we found that monotonically decreasing contributions over time, from Europe and Africa, explained the observed data more accurately than multiple admixture pulses. This approach will allow for reconstructing detailed admixture histories when maximum‐likelihood methods are intractable. |
format | Online Article Text |
id | pubmed-8247995 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-82479952021-07-02 Complex genetic admixture histories reconstructed with Approximate Bayesian Computation Fortes‐Lima, Cesar A. Laurent, Romain Thouzeau, Valentin Toupance, Bruno Verdu, Paul Mol Ecol Resour RESOURCE ARTICLES Admixture is a fundamental evolutionary process that has influenced genetic patterns in numerous species. Maximum‐likelihood approaches based on allele frequencies and linkage‐disequilibrium have been extensively used to infer admixture processes from genome‐wide data sets, mostly in human populations. Nevertheless, complex admixture histories, beyond one or two pulses of admixture, remain methodologically challenging to reconstruct. We developed an Approximate Bayesian Computation (ABC) framework to reconstruct highly complex admixture histories from independent genetic markers. We built the software package methis to simulate independent SNPs or microsatellites in a two‐way admixed population for scenarios with multiple admixture pulses, monotonically decreasing or increasing recurring admixture, or combinations of these scenarios. methis allows users to draw model‐parameter values from prior distributions set by the user, and, for each simulation, methis can calculate numerous summary statistics describing genetic diversity patterns and moments of the distribution of individual admixture fractions. We coupled methis with existing machine‐learning ABC algorithms and investigated the admixture history of admixed populations. Results showed that random forest ABC scenario‐choice could accurately distinguish among most complex admixture scenarios, and errors were mainly found in regions of the parameter space where scenarios were highly nested, and, thus, biologically similar. We focused on African American and Barbadian populations as two study‐cases. We found that neural network ABC posterior parameter estimation was accurate and reasonably conservative under complex admixture scenarios. For both admixed populations, we found that monotonically decreasing contributions over time, from Europe and Africa, explained the observed data more accurately than multiple admixture pulses. This approach will allow for reconstructing detailed admixture histories when maximum‐likelihood methods are intractable. John Wiley and Sons Inc. 2021-02-26 2021-05 /pmc/articles/PMC8247995/ /pubmed/33452723 http://dx.doi.org/10.1111/1755-0998.13325 Text en © 2021 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made. |
spellingShingle | RESOURCE ARTICLES Fortes‐Lima, Cesar A. Laurent, Romain Thouzeau, Valentin Toupance, Bruno Verdu, Paul Complex genetic admixture histories reconstructed with Approximate Bayesian Computation |
title | Complex genetic admixture histories reconstructed with Approximate Bayesian Computation |
title_full | Complex genetic admixture histories reconstructed with Approximate Bayesian Computation |
title_fullStr | Complex genetic admixture histories reconstructed with Approximate Bayesian Computation |
title_full_unstemmed | Complex genetic admixture histories reconstructed with Approximate Bayesian Computation |
title_short | Complex genetic admixture histories reconstructed with Approximate Bayesian Computation |
title_sort | complex genetic admixture histories reconstructed with approximate bayesian computation |
topic | RESOURCE ARTICLES |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8247995/ https://www.ncbi.nlm.nih.gov/pubmed/33452723 http://dx.doi.org/10.1111/1755-0998.13325 |
work_keys_str_mv | AT forteslimacesara complexgeneticadmixturehistoriesreconstructedwithapproximatebayesiancomputation AT laurentromain complexgeneticadmixturehistoriesreconstructedwithapproximatebayesiancomputation AT thouzeauvalentin complexgeneticadmixturehistoriesreconstructedwithapproximatebayesiancomputation AT toupancebruno complexgeneticadmixturehistoriesreconstructedwithapproximatebayesiancomputation AT verdupaul complexgeneticadmixturehistoriesreconstructedwithapproximatebayesiancomputation |