Cargando…

FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments

BACKGROUND: We recently described FastTree, a tool for inferring phylogenies for alignments with up to hundreds of thousands of sequences. Here, we describe improvements to FastTree that improve its accuracy without sacrificing scalability. METHODOLOGY/PRINCIPAL FINDINGS: Where FastTree 1 used neare...

Descripción completa

Detalles Bibliográficos
Autores principales: Price, Morgan N., Dehal, Paramvir S., Arkin, Adam P.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2835736/
https://www.ncbi.nlm.nih.gov/pubmed/20224823
http://dx.doi.org/10.1371/journal.pone.0009490
_version_ 1782178659120971776
author Price, Morgan N.
Dehal, Paramvir S.
Arkin, Adam P.
author_facet Price, Morgan N.
Dehal, Paramvir S.
Arkin, Adam P.
author_sort Price, Morgan N.
collection PubMed
description BACKGROUND: We recently described FastTree, a tool for inferring phylogenies for alignments with up to hundreds of thousands of sequences. Here, we describe improvements to FastTree that improve its accuracy without sacrificing scalability. METHODOLOGY/PRINCIPAL FINDINGS: Where FastTree 1 used nearest-neighbor interchanges (NNIs) and the minimum-evolution criterion to improve the tree, FastTree 2 adds minimum-evolution subtree-pruning-regrafting (SPRs) and maximum-likelihood NNIs. FastTree 2 uses heuristics to restrict the search for better trees and estimates a rate of evolution for each site (the “CAT” approximation). Nevertheless, for both simulated and genuine alignments, FastTree 2 is slightly more accurate than a standard implementation of maximum-likelihood NNIs (PhyML 3 with default settings). Although FastTree 2 is not quite as accurate as methods that use maximum-likelihood SPRs, most of the splits that disagree are poorly supported, and for large alignments, FastTree 2 is 100–1,000 times faster. FastTree 2 inferred a topology and likelihood-based local support values for 237,882 distinct 16S ribosomal RNAs on a desktop computer in 22 hours and 5.8 gigabytes of memory. CONCLUSIONS/SIGNIFICANCE: FastTree 2 allows the inference of maximum-likelihood phylogenies for huge alignments. FastTree 2 is freely available at http://www.microbesonline.org/fasttree.
format Text
id pubmed-2835736
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28357362010-03-12 FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments Price, Morgan N. Dehal, Paramvir S. Arkin, Adam P. PLoS One Research Article BACKGROUND: We recently described FastTree, a tool for inferring phylogenies for alignments with up to hundreds of thousands of sequences. Here, we describe improvements to FastTree that improve its accuracy without sacrificing scalability. METHODOLOGY/PRINCIPAL FINDINGS: Where FastTree 1 used nearest-neighbor interchanges (NNIs) and the minimum-evolution criterion to improve the tree, FastTree 2 adds minimum-evolution subtree-pruning-regrafting (SPRs) and maximum-likelihood NNIs. FastTree 2 uses heuristics to restrict the search for better trees and estimates a rate of evolution for each site (the “CAT” approximation). Nevertheless, for both simulated and genuine alignments, FastTree 2 is slightly more accurate than a standard implementation of maximum-likelihood NNIs (PhyML 3 with default settings). Although FastTree 2 is not quite as accurate as methods that use maximum-likelihood SPRs, most of the splits that disagree are poorly supported, and for large alignments, FastTree 2 is 100–1,000 times faster. FastTree 2 inferred a topology and likelihood-based local support values for 237,882 distinct 16S ribosomal RNAs on a desktop computer in 22 hours and 5.8 gigabytes of memory. CONCLUSIONS/SIGNIFICANCE: FastTree 2 allows the inference of maximum-likelihood phylogenies for huge alignments. FastTree 2 is freely available at http://www.microbesonline.org/fasttree. Public Library of Science 2010-03-10 /pmc/articles/PMC2835736/ /pubmed/20224823 http://dx.doi.org/10.1371/journal.pone.0009490 Text en Price et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Price, Morgan N.
Dehal, Paramvir S.
Arkin, Adam P.
FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments
title FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments
title_full FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments
title_fullStr FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments
title_full_unstemmed FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments
title_short FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments
title_sort fasttree 2 – approximately maximum-likelihood trees for large alignments
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2835736/
https://www.ncbi.nlm.nih.gov/pubmed/20224823
http://dx.doi.org/10.1371/journal.pone.0009490
work_keys_str_mv AT pricemorgann fasttree2approximatelymaximumlikelihoodtreesforlargealignments
AT dehalparamvirs fasttree2approximatelymaximumlikelihoodtreesforlargealignments
AT arkinadamp fasttree2approximatelymaximumlikelihoodtreesforlargealignments