Cargando…

Recent progress on methods for estimating and updating large phylogenies

With the increased availability of sequence data and even of fully sequenced and assembled genomes, phylogeny estimation of very large trees (even of hundreds of thousands of sequences) is now a goal for some biologists. Yet, the construction of these phylogenies is a complex pipeline presenting ana...

Descripción completa

Detalles Bibliográficos
Autores principales: Zaharias, Paul, Warnow, Tandy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9393559/
https://www.ncbi.nlm.nih.gov/pubmed/35989607
http://dx.doi.org/10.1098/rstb.2021.0244
_version_ 1784771295786827776
author Zaharias, Paul
Warnow, Tandy
author_facet Zaharias, Paul
Warnow, Tandy
author_sort Zaharias, Paul
collection PubMed
description With the increased availability of sequence data and even of fully sequenced and assembled genomes, phylogeny estimation of very large trees (even of hundreds of thousands of sequences) is now a goal for some biologists. Yet, the construction of these phylogenies is a complex pipeline presenting analytical and computational challenges, especially when the number of sequences is very large. In the past few years, new methods have been developed that aim to enable highly accurate phylogeny estimations on these large datasets, including divide-and-conquer techniques for multiple sequence alignment and/or tree estimation, methods that can estimate species trees from multi-locus datasets while addressing heterogeneity due to biological processes (e.g. incomplete lineage sorting and gene duplication and loss), and methods to add sequences into large gene trees or species trees. Here we present some of these recent advances and discuss opportunities for future improvements. This article is part of a discussion meeting issue ‘Genomic population structures of microbial pathogens’.
format Online
Article
Text
id pubmed-9393559
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher The Royal Society
record_format MEDLINE/PubMed
spelling pubmed-93935592022-08-30 Recent progress on methods for estimating and updating large phylogenies Zaharias, Paul Warnow, Tandy Philos Trans R Soc Lond B Biol Sci Articles With the increased availability of sequence data and even of fully sequenced and assembled genomes, phylogeny estimation of very large trees (even of hundreds of thousands of sequences) is now a goal for some biologists. Yet, the construction of these phylogenies is a complex pipeline presenting analytical and computational challenges, especially when the number of sequences is very large. In the past few years, new methods have been developed that aim to enable highly accurate phylogeny estimations on these large datasets, including divide-and-conquer techniques for multiple sequence alignment and/or tree estimation, methods that can estimate species trees from multi-locus datasets while addressing heterogeneity due to biological processes (e.g. incomplete lineage sorting and gene duplication and loss), and methods to add sequences into large gene trees or species trees. Here we present some of these recent advances and discuss opportunities for future improvements. This article is part of a discussion meeting issue ‘Genomic population structures of microbial pathogens’. The Royal Society 2022-10-10 2022-08-22 /pmc/articles/PMC9393559/ /pubmed/35989607 http://dx.doi.org/10.1098/rstb.2021.0244 Text en © 2022 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited.
spellingShingle Articles
Zaharias, Paul
Warnow, Tandy
Recent progress on methods for estimating and updating large phylogenies
title Recent progress on methods for estimating and updating large phylogenies
title_full Recent progress on methods for estimating and updating large phylogenies
title_fullStr Recent progress on methods for estimating and updating large phylogenies
title_full_unstemmed Recent progress on methods for estimating and updating large phylogenies
title_short Recent progress on methods for estimating and updating large phylogenies
title_sort recent progress on methods for estimating and updating large phylogenies
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9393559/
https://www.ncbi.nlm.nih.gov/pubmed/35989607
http://dx.doi.org/10.1098/rstb.2021.0244
work_keys_str_mv AT zahariaspaul recentprogressonmethodsforestimatingandupdatinglargephylogenies
AT warnowtandy recentprogressonmethodsforestimatingandupdatinglargephylogenies