Cargando…

Disk covering methods improve phylogenomic analyses

MOTIVATION: With the rapid growth rate of newly sequenced genomes, species tree inference from multiple genes has become a basic bioinformatics task in comparative and evolutionary biology. However, accurate species tree estimation is difficult in the presence of gene tree discordance, which is ofte...

Descripción completa

Detalles Bibliográficos
Autores principales: Bayzid, Md Shamsuzzoha, Hunt, Tyler, Warnow, Tandy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4239662/
https://www.ncbi.nlm.nih.gov/pubmed/25572610
http://dx.doi.org/10.1186/1471-2164-15-S6-S7
_version_ 1782345618791858176
author Bayzid, Md Shamsuzzoha
Hunt, Tyler
Warnow, Tandy
author_facet Bayzid, Md Shamsuzzoha
Hunt, Tyler
Warnow, Tandy
author_sort Bayzid, Md Shamsuzzoha
collection PubMed
description MOTIVATION: With the rapid growth rate of newly sequenced genomes, species tree inference from multiple genes has become a basic bioinformatics task in comparative and evolutionary biology. However, accurate species tree estimation is difficult in the presence of gene tree discordance, which is often due to incomplete lineage sorting (ILS), modelled by the multi-species coalescent. Several highly accurate coalescent-based species tree estimation methods have been developed over the last decade, including MP-EST. However, the running time for MP-EST increases rapidly as the number of species grows. RESULTS: We present divide-and-conquer techniques that improve the scalability of MP-EST so that it can run efficiently on large datasets. Surprisingly, this technique also improves the accuracy of species trees estimated by MP-EST, as our study shows on a collection of simulated and biological datasets.
format Online
Article
Text
id pubmed-4239662
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42396622014-11-25 Disk covering methods improve phylogenomic analyses Bayzid, Md Shamsuzzoha Hunt, Tyler Warnow, Tandy BMC Genomics Research MOTIVATION: With the rapid growth rate of newly sequenced genomes, species tree inference from multiple genes has become a basic bioinformatics task in comparative and evolutionary biology. However, accurate species tree estimation is difficult in the presence of gene tree discordance, which is often due to incomplete lineage sorting (ILS), modelled by the multi-species coalescent. Several highly accurate coalescent-based species tree estimation methods have been developed over the last decade, including MP-EST. However, the running time for MP-EST increases rapidly as the number of species grows. RESULTS: We present divide-and-conquer techniques that improve the scalability of MP-EST so that it can run efficiently on large datasets. Surprisingly, this technique also improves the accuracy of species trees estimated by MP-EST, as our study shows on a collection of simulated and biological datasets. BioMed Central 2014-10-17 /pmc/articles/PMC4239662/ /pubmed/25572610 http://dx.doi.org/10.1186/1471-2164-15-S6-S7 Text en Copyright © 2014 Bayzid et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Bayzid, Md Shamsuzzoha
Hunt, Tyler
Warnow, Tandy
Disk covering methods improve phylogenomic analyses
title Disk covering methods improve phylogenomic analyses
title_full Disk covering methods improve phylogenomic analyses
title_fullStr Disk covering methods improve phylogenomic analyses
title_full_unstemmed Disk covering methods improve phylogenomic analyses
title_short Disk covering methods improve phylogenomic analyses
title_sort disk covering methods improve phylogenomic analyses
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4239662/
https://www.ncbi.nlm.nih.gov/pubmed/25572610
http://dx.doi.org/10.1186/1471-2164-15-S6-S7
work_keys_str_mv AT bayzidmdshamsuzzoha diskcoveringmethodsimprovephylogenomicanalyses
AT hunttyler diskcoveringmethodsimprovephylogenomicanalyses
AT warnowtandy diskcoveringmethodsimprovephylogenomicanalyses