Cargando…

Harnessing machine learning to guide phylogenetic-tree search algorithms

Inferring a phylogenetic tree is a fundamental challenge in evolutionary studies. Current paradigms for phylogenetic tree reconstruction rely on performing costly likelihood optimizations. With the aim of making tree inference feasible for problems involving more than a handful of sequences, inferen...

Descripción completa

Detalles Bibliográficos
Autores principales: Azouri, Dana, Abadi, Shiran, Mansour, Yishay, Mayrose, Itay, Pupko, Tal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8012635/
https://www.ncbi.nlm.nih.gov/pubmed/33790270
http://dx.doi.org/10.1038/s41467-021-22073-8
_version_ 1783673405837410304
author Azouri, Dana
Abadi, Shiran
Mansour, Yishay
Mayrose, Itay
Pupko, Tal
author_facet Azouri, Dana
Abadi, Shiran
Mansour, Yishay
Mayrose, Itay
Pupko, Tal
author_sort Azouri, Dana
collection PubMed
description Inferring a phylogenetic tree is a fundamental challenge in evolutionary studies. Current paradigms for phylogenetic tree reconstruction rely on performing costly likelihood optimizations. With the aim of making tree inference feasible for problems involving more than a handful of sequences, inference under the maximum-likelihood paradigm integrates heuristic approaches to evaluate only a subset of all potential trees. Consequently, existing methods suffer from the known tradeoff between accuracy and running time. In this proof-of-concept study, we train a machine-learning algorithm over an extensive cohort of empirical data to predict the neighboring trees that increase the likelihood, without actually computing their likelihood. This provides means to safely discard a large set of the search space, thus potentially accelerating heuristic tree searches without losing accuracy. Our analyses suggest that machine learning can guide tree-search methodologies towards the most promising candidate trees.
format Online
Article
Text
id pubmed-8012635
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-80126352021-04-16 Harnessing machine learning to guide phylogenetic-tree search algorithms Azouri, Dana Abadi, Shiran Mansour, Yishay Mayrose, Itay Pupko, Tal Nat Commun Article Inferring a phylogenetic tree is a fundamental challenge in evolutionary studies. Current paradigms for phylogenetic tree reconstruction rely on performing costly likelihood optimizations. With the aim of making tree inference feasible for problems involving more than a handful of sequences, inference under the maximum-likelihood paradigm integrates heuristic approaches to evaluate only a subset of all potential trees. Consequently, existing methods suffer from the known tradeoff between accuracy and running time. In this proof-of-concept study, we train a machine-learning algorithm over an extensive cohort of empirical data to predict the neighboring trees that increase the likelihood, without actually computing their likelihood. This provides means to safely discard a large set of the search space, thus potentially accelerating heuristic tree searches without losing accuracy. Our analyses suggest that machine learning can guide tree-search methodologies towards the most promising candidate trees. Nature Publishing Group UK 2021-03-31 /pmc/articles/PMC8012635/ /pubmed/33790270 http://dx.doi.org/10.1038/s41467-021-22073-8 Text en © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Azouri, Dana
Abadi, Shiran
Mansour, Yishay
Mayrose, Itay
Pupko, Tal
Harnessing machine learning to guide phylogenetic-tree search algorithms
title Harnessing machine learning to guide phylogenetic-tree search algorithms
title_full Harnessing machine learning to guide phylogenetic-tree search algorithms
title_fullStr Harnessing machine learning to guide phylogenetic-tree search algorithms
title_full_unstemmed Harnessing machine learning to guide phylogenetic-tree search algorithms
title_short Harnessing machine learning to guide phylogenetic-tree search algorithms
title_sort harnessing machine learning to guide phylogenetic-tree search algorithms
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8012635/
https://www.ncbi.nlm.nih.gov/pubmed/33790270
http://dx.doi.org/10.1038/s41467-021-22073-8
work_keys_str_mv AT azouridana harnessingmachinelearningtoguidephylogenetictreesearchalgorithms
AT abadishiran harnessingmachinelearningtoguidephylogenetictreesearchalgorithms
AT mansouryishay harnessingmachinelearningtoguidephylogenetictreesearchalgorithms
AT mayroseitay harnessingmachinelearningtoguidephylogenetictreesearchalgorithms
AT pupkotal harnessingmachinelearningtoguidephylogenetictreesearchalgorithms