Cargando…

Bayesian inference of ancestral recombination graphs

We present a novel algorithm, implemented in the software ARGinfer, for probabilistic inference of the Ancestral Recombination Graph under the Coalescent with Recombination. Our Markov Chain Monte Carlo algorithm takes advantage of the Succinct Tree Sequence data structure that has allowed great adv...

Descripción completa

Detalles Bibliográficos
Autores principales: Mahmoudi, Ali, Koskela, Jere, Kelleher, Jerome, Chan, Yao-ban, Balding, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8936483/
https://www.ncbi.nlm.nih.gov/pubmed/35263345
http://dx.doi.org/10.1371/journal.pcbi.1009960
_version_ 1784672229131288576
author Mahmoudi, Ali
Koskela, Jere
Kelleher, Jerome
Chan, Yao-ban
Balding, David
author_facet Mahmoudi, Ali
Koskela, Jere
Kelleher, Jerome
Chan, Yao-ban
Balding, David
author_sort Mahmoudi, Ali
collection PubMed
description We present a novel algorithm, implemented in the software ARGinfer, for probabilistic inference of the Ancestral Recombination Graph under the Coalescent with Recombination. Our Markov Chain Monte Carlo algorithm takes advantage of the Succinct Tree Sequence data structure that has allowed great advances in simulation and point estimation, but not yet probabilistic inference. Unlike previous methods, which employ the Sequentially Markov Coalescent approximation, ARGinfer uses the Coalescent with Recombination, allowing more accurate inference of key evolutionary parameters. We show using simulations that ARGinfer can accurately estimate many properties of the evolutionary history of the sample, including the topology and branch lengths of the genealogical tree at each sequence site, and the times and locations of mutation and recombination events. ARGinfer approximates posterior probability distributions for these and other quantities, providing interpretable assessments of uncertainty that we show to be well calibrated. ARGinfer is currently limited to tens of DNA sequences of several hundreds of kilobases, but has scope for further computational improvements to increase its applicability.
format Online
Article
Text
id pubmed-8936483
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-89364832022-03-22 Bayesian inference of ancestral recombination graphs Mahmoudi, Ali Koskela, Jere Kelleher, Jerome Chan, Yao-ban Balding, David PLoS Comput Biol Research Article We present a novel algorithm, implemented in the software ARGinfer, for probabilistic inference of the Ancestral Recombination Graph under the Coalescent with Recombination. Our Markov Chain Monte Carlo algorithm takes advantage of the Succinct Tree Sequence data structure that has allowed great advances in simulation and point estimation, but not yet probabilistic inference. Unlike previous methods, which employ the Sequentially Markov Coalescent approximation, ARGinfer uses the Coalescent with Recombination, allowing more accurate inference of key evolutionary parameters. We show using simulations that ARGinfer can accurately estimate many properties of the evolutionary history of the sample, including the topology and branch lengths of the genealogical tree at each sequence site, and the times and locations of mutation and recombination events. ARGinfer approximates posterior probability distributions for these and other quantities, providing interpretable assessments of uncertainty that we show to be well calibrated. ARGinfer is currently limited to tens of DNA sequences of several hundreds of kilobases, but has scope for further computational improvements to increase its applicability. Public Library of Science 2022-03-09 /pmc/articles/PMC8936483/ /pubmed/35263345 http://dx.doi.org/10.1371/journal.pcbi.1009960 Text en © 2022 Mahmoudi et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Mahmoudi, Ali
Koskela, Jere
Kelleher, Jerome
Chan, Yao-ban
Balding, David
Bayesian inference of ancestral recombination graphs
title Bayesian inference of ancestral recombination graphs
title_full Bayesian inference of ancestral recombination graphs
title_fullStr Bayesian inference of ancestral recombination graphs
title_full_unstemmed Bayesian inference of ancestral recombination graphs
title_short Bayesian inference of ancestral recombination graphs
title_sort bayesian inference of ancestral recombination graphs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8936483/
https://www.ncbi.nlm.nih.gov/pubmed/35263345
http://dx.doi.org/10.1371/journal.pcbi.1009960
work_keys_str_mv AT mahmoudiali bayesianinferenceofancestralrecombinationgraphs
AT koskelajere bayesianinferenceofancestralrecombinationgraphs
AT kelleherjerome bayesianinferenceofancestralrecombinationgraphs
AT chanyaoban bayesianinferenceofancestralrecombinationgraphs
AT baldingdavid bayesianinferenceofancestralrecombinationgraphs