Cargando…

A divide-and-conquer method for scalable phylogenetic network inference from multilocus data

MOTIVATION: Reticulate evolutionary histories, such as those arising in the presence of hybridization, are best modeled as phylogenetic networks. Recently developed methods allow for statistical inference of phylogenetic networks while also accounting for other processes, such as incomplete lineage...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Jiafan, Liu, Xinhao, Ogilvie, Huw A, Nakhleh, Luay K
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612858/
https://www.ncbi.nlm.nih.gov/pubmed/31510688
http://dx.doi.org/10.1093/bioinformatics/btz359
_version_ 1783432953157648384
author Zhu, Jiafan
Liu, Xinhao
Ogilvie, Huw A
Nakhleh, Luay K
author_facet Zhu, Jiafan
Liu, Xinhao
Ogilvie, Huw A
Nakhleh, Luay K
author_sort Zhu, Jiafan
collection PubMed
description MOTIVATION: Reticulate evolutionary histories, such as those arising in the presence of hybridization, are best modeled as phylogenetic networks. Recently developed methods allow for statistical inference of phylogenetic networks while also accounting for other processes, such as incomplete lineage sorting. However, these methods can only handle a small number of loci from a handful of genomes. RESULTS: In this article, we introduce a novel two-step method for scalable inference of phylogenetic networks from the sequence alignments of multiple, unlinked loci. The method infers networks on subproblems and then merges them into a network on the full set of taxa. To reduce the number of trinets to infer, we formulate a Hitting Set version of the problem of finding a small number of subsets, and implement a simple heuristic to solve it. We studied their performance, in terms of both running time and accuracy, on simulated as well as on biological datasets. The two-step method accurately infers phylogenetic networks at a scale that is infeasible with existing methods. The results are a significant and promising step towards accurate, large-scale phylogenetic network inference. AVAILABILITY AND IMPLEMENTATION: We implemented the algorithms in the publicly available software package PhyloNet (https://bioinfocs.rice.edu/PhyloNet). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6612858
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-66128582019-07-12 A divide-and-conquer method for scalable phylogenetic network inference from multilocus data Zhu, Jiafan Liu, Xinhao Ogilvie, Huw A Nakhleh, Luay K Bioinformatics Ismb/Eccb 2019 Conference Proceedings MOTIVATION: Reticulate evolutionary histories, such as those arising in the presence of hybridization, are best modeled as phylogenetic networks. Recently developed methods allow for statistical inference of phylogenetic networks while also accounting for other processes, such as incomplete lineage sorting. However, these methods can only handle a small number of loci from a handful of genomes. RESULTS: In this article, we introduce a novel two-step method for scalable inference of phylogenetic networks from the sequence alignments of multiple, unlinked loci. The method infers networks on subproblems and then merges them into a network on the full set of taxa. To reduce the number of trinets to infer, we formulate a Hitting Set version of the problem of finding a small number of subsets, and implement a simple heuristic to solve it. We studied their performance, in terms of both running time and accuracy, on simulated as well as on biological datasets. The two-step method accurately infers phylogenetic networks at a scale that is infeasible with existing methods. The results are a significant and promising step towards accurate, large-scale phylogenetic network inference. AVAILABILITY AND IMPLEMENTATION: We implemented the algorithms in the publicly available software package PhyloNet (https://bioinfocs.rice.edu/PhyloNet). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-07 2019-07-05 /pmc/articles/PMC6612858/ /pubmed/31510688 http://dx.doi.org/10.1093/bioinformatics/btz359 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2019 Conference Proceedings
Zhu, Jiafan
Liu, Xinhao
Ogilvie, Huw A
Nakhleh, Luay K
A divide-and-conquer method for scalable phylogenetic network inference from multilocus data
title A divide-and-conquer method for scalable phylogenetic network inference from multilocus data
title_full A divide-and-conquer method for scalable phylogenetic network inference from multilocus data
title_fullStr A divide-and-conquer method for scalable phylogenetic network inference from multilocus data
title_full_unstemmed A divide-and-conquer method for scalable phylogenetic network inference from multilocus data
title_short A divide-and-conquer method for scalable phylogenetic network inference from multilocus data
title_sort divide-and-conquer method for scalable phylogenetic network inference from multilocus data
topic Ismb/Eccb 2019 Conference Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612858/
https://www.ncbi.nlm.nih.gov/pubmed/31510688
http://dx.doi.org/10.1093/bioinformatics/btz359
work_keys_str_mv AT zhujiafan adivideandconquermethodforscalablephylogeneticnetworkinferencefrommultilocusdata
AT liuxinhao adivideandconquermethodforscalablephylogeneticnetworkinferencefrommultilocusdata
AT ogilviehuwa adivideandconquermethodforscalablephylogeneticnetworkinferencefrommultilocusdata
AT nakhlehluayk adivideandconquermethodforscalablephylogeneticnetworkinferencefrommultilocusdata
AT zhujiafan divideandconquermethodforscalablephylogeneticnetworkinferencefrommultilocusdata
AT liuxinhao divideandconquermethodforscalablephylogeneticnetworkinferencefrommultilocusdata
AT ogilviehuwa divideandconquermethodforscalablephylogeneticnetworkinferencefrommultilocusdata
AT nakhlehluayk divideandconquermethodforscalablephylogeneticnetworkinferencefrommultilocusdata