Cargando…

ACG: rapid inference of population history from recombining nucleotide sequences

BACKGROUND: Reconstruction of population history from genetic data often requires Monte Carlo integration over the genealogy of the samples. Among tools that perform such computations, few are able to consider genetic histories including recombination events, precluding their use on most alignments...

Descripción completa

Detalles Bibliográficos
Autor principal: O'Fallon, Brendan D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3575405/
https://www.ncbi.nlm.nih.gov/pubmed/23379678
http://dx.doi.org/10.1186/1471-2105-14-40
_version_ 1782259721012510720
author O'Fallon, Brendan D
author_facet O'Fallon, Brendan D
author_sort O'Fallon, Brendan D
collection PubMed
description BACKGROUND: Reconstruction of population history from genetic data often requires Monte Carlo integration over the genealogy of the samples. Among tools that perform such computations, few are able to consider genetic histories including recombination events, precluding their use on most alignments of nuclear DNA. Explicit consideration of recombinations requires modeling the history of the sequences with an Ancestral Recombination Graph (ARG) in place of a simple tree, which presents significant computational challenges. RESULTS: ACG is an extensible desktop application that uses a Bayesian Markov chain Monte Carlo procedure to estimate the posterior likelihood of an evolutionary model conditional on an alignment of genetic data. The ancestry of the sequences is represented by an ARG, which is estimated from the data with other model parameters. Importantly, ACG computes the full, Felsenstein likelihood of the ARG, not a pairwise or composite likelihood. Several strategies are used to speed computations, and ACG is roughly 100x faster than a similar, recombination-aware program. CONCLUSIONS: Modeling the ancestry of the sequences with an ARG allows ACG to estimate the evolutionary history of recombining nucleotide sequences. ACG can accurately estimate the posterior distribution of population parameters such as the (scaled) population size and recombination rate, as well as many aspects of the recombinant history, including the positions of recombination breakpoints, the distribution of time to most recent common ancestor along the sequence, and the non-recombining trees at individual sites. Multiple substitution models and population size models are provided. ACG also provides a richly informative graphical interface that allows users to view the evolution of model parameters and likelihoods in real time.
format Online
Article
Text
id pubmed-3575405
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35754052013-02-20 ACG: rapid inference of population history from recombining nucleotide sequences O'Fallon, Brendan D BMC Bioinformatics Software BACKGROUND: Reconstruction of population history from genetic data often requires Monte Carlo integration over the genealogy of the samples. Among tools that perform such computations, few are able to consider genetic histories including recombination events, precluding their use on most alignments of nuclear DNA. Explicit consideration of recombinations requires modeling the history of the sequences with an Ancestral Recombination Graph (ARG) in place of a simple tree, which presents significant computational challenges. RESULTS: ACG is an extensible desktop application that uses a Bayesian Markov chain Monte Carlo procedure to estimate the posterior likelihood of an evolutionary model conditional on an alignment of genetic data. The ancestry of the sequences is represented by an ARG, which is estimated from the data with other model parameters. Importantly, ACG computes the full, Felsenstein likelihood of the ARG, not a pairwise or composite likelihood. Several strategies are used to speed computations, and ACG is roughly 100x faster than a similar, recombination-aware program. CONCLUSIONS: Modeling the ancestry of the sequences with an ARG allows ACG to estimate the evolutionary history of recombining nucleotide sequences. ACG can accurately estimate the posterior distribution of population parameters such as the (scaled) population size and recombination rate, as well as many aspects of the recombinant history, including the positions of recombination breakpoints, the distribution of time to most recent common ancestor along the sequence, and the non-recombining trees at individual sites. Multiple substitution models and population size models are provided. ACG also provides a richly informative graphical interface that allows users to view the evolution of model parameters and likelihoods in real time. BioMed Central 2013-02-05 /pmc/articles/PMC3575405/ /pubmed/23379678 http://dx.doi.org/10.1186/1471-2105-14-40 Text en Copyright ©2013 O'Fallon; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
O'Fallon, Brendan D
ACG: rapid inference of population history from recombining nucleotide sequences
title ACG: rapid inference of population history from recombining nucleotide sequences
title_full ACG: rapid inference of population history from recombining nucleotide sequences
title_fullStr ACG: rapid inference of population history from recombining nucleotide sequences
title_full_unstemmed ACG: rapid inference of population history from recombining nucleotide sequences
title_short ACG: rapid inference of population history from recombining nucleotide sequences
title_sort acg: rapid inference of population history from recombining nucleotide sequences
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3575405/
https://www.ncbi.nlm.nih.gov/pubmed/23379678
http://dx.doi.org/10.1186/1471-2105-14-40
work_keys_str_mv AT ofallonbrendand acgrapidinferenceofpopulationhistoryfromrecombiningnucleotidesequences