Cargando…
Recent progress on methods for estimating and updating large phylogenies
With the increased availability of sequence data and even of fully sequenced and assembled genomes, phylogeny estimation of very large trees (even of hundreds of thousands of sequences) is now a goal for some biologists. Yet, the construction of these phylogenies is a complex pipeline presenting ana...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9393559/ https://www.ncbi.nlm.nih.gov/pubmed/35989607 http://dx.doi.org/10.1098/rstb.2021.0244 |
_version_ | 1784771295786827776 |
---|---|
author | Zaharias, Paul Warnow, Tandy |
author_facet | Zaharias, Paul Warnow, Tandy |
author_sort | Zaharias, Paul |
collection | PubMed |
description | With the increased availability of sequence data and even of fully sequenced and assembled genomes, phylogeny estimation of very large trees (even of hundreds of thousands of sequences) is now a goal for some biologists. Yet, the construction of these phylogenies is a complex pipeline presenting analytical and computational challenges, especially when the number of sequences is very large. In the past few years, new methods have been developed that aim to enable highly accurate phylogeny estimations on these large datasets, including divide-and-conquer techniques for multiple sequence alignment and/or tree estimation, methods that can estimate species trees from multi-locus datasets while addressing heterogeneity due to biological processes (e.g. incomplete lineage sorting and gene duplication and loss), and methods to add sequences into large gene trees or species trees. Here we present some of these recent advances and discuss opportunities for future improvements. This article is part of a discussion meeting issue ‘Genomic population structures of microbial pathogens’. |
format | Online Article Text |
id | pubmed-9393559 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | The Royal Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-93935592022-08-30 Recent progress on methods for estimating and updating large phylogenies Zaharias, Paul Warnow, Tandy Philos Trans R Soc Lond B Biol Sci Articles With the increased availability of sequence data and even of fully sequenced and assembled genomes, phylogeny estimation of very large trees (even of hundreds of thousands of sequences) is now a goal for some biologists. Yet, the construction of these phylogenies is a complex pipeline presenting analytical and computational challenges, especially when the number of sequences is very large. In the past few years, new methods have been developed that aim to enable highly accurate phylogeny estimations on these large datasets, including divide-and-conquer techniques for multiple sequence alignment and/or tree estimation, methods that can estimate species trees from multi-locus datasets while addressing heterogeneity due to biological processes (e.g. incomplete lineage sorting and gene duplication and loss), and methods to add sequences into large gene trees or species trees. Here we present some of these recent advances and discuss opportunities for future improvements. This article is part of a discussion meeting issue ‘Genomic population structures of microbial pathogens’. The Royal Society 2022-10-10 2022-08-22 /pmc/articles/PMC9393559/ /pubmed/35989607 http://dx.doi.org/10.1098/rstb.2021.0244 Text en © 2022 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited. |
spellingShingle | Articles Zaharias, Paul Warnow, Tandy Recent progress on methods for estimating and updating large phylogenies |
title | Recent progress on methods for estimating and updating large phylogenies |
title_full | Recent progress on methods for estimating and updating large phylogenies |
title_fullStr | Recent progress on methods for estimating and updating large phylogenies |
title_full_unstemmed | Recent progress on methods for estimating and updating large phylogenies |
title_short | Recent progress on methods for estimating and updating large phylogenies |
title_sort | recent progress on methods for estimating and updating large phylogenies |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9393559/ https://www.ncbi.nlm.nih.gov/pubmed/35989607 http://dx.doi.org/10.1098/rstb.2021.0244 |
work_keys_str_mv | AT zahariaspaul recentprogressonmethodsforestimatingandupdatinglargephylogenies AT warnowtandy recentprogressonmethodsforestimatingandupdatinglargephylogenies |