Cargando…

Quartet Fiduccia–Mattheyses revisited for larger phylogenetic studies

MOTIVATION: With the recent breakthroughs in sequencing technology, phylogeny estimation at a larger scale has become a huge opportunity. For accurate estimation of large-scale phylogeny, substantial endeavor is being devoted in introducing new algorithms or upgrading current approaches. In this wor...

Descripción completa

Detalles Bibliográficos
Autores principales: Mim, Sharmin Akter, Zarif-Ul-Alam, Md, Reaz, Rezwana, Bayzid, Md Shamsuzzoha, Rahman, Mohammad Saifur
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10260390/
https://www.ncbi.nlm.nih.gov/pubmed/37285316
http://dx.doi.org/10.1093/bioinformatics/btad332
_version_ 1785057849803538432
author Mim, Sharmin Akter
Zarif-Ul-Alam, Md
Reaz, Rezwana
Bayzid, Md Shamsuzzoha
Rahman, Mohammad Saifur
author_facet Mim, Sharmin Akter
Zarif-Ul-Alam, Md
Reaz, Rezwana
Bayzid, Md Shamsuzzoha
Rahman, Mohammad Saifur
author_sort Mim, Sharmin Akter
collection PubMed
description MOTIVATION: With the recent breakthroughs in sequencing technology, phylogeny estimation at a larger scale has become a huge opportunity. For accurate estimation of large-scale phylogeny, substantial endeavor is being devoted in introducing new algorithms or upgrading current approaches. In this work, we endeavor to improve the Quartet Fiduccia and Mattheyses (QFM) algorithm to resolve phylogenetic trees of better quality with better running time. QFM was already being appreciated by researchers for its good tree quality, but fell short in larger phylogenomic studies due to its excessively slow running time. RESULTS: We have re-designed QFM so that it can amalgamate millions of quartets over thousands of taxa into a species tree with a great level of accuracy within a short amount of time. Named “QFM Fast and Improved (QFM-FI)”, our version is 20 000× faster than the previous version and 400× faster than the widely used variant of QFM implemented in PAUP* on larger datasets. We have also provided a theoretical analysis of the running time and memory requirements of QFM-FI. We have conducted a comparative study of QFM-FI with other state-of-the-art phylogeny reconstruction methods, such as QFM, QMC, wQMC, wQFM, and ASTRAL, on simulated as well as real biological datasets. Our results show that QFM-FI improves on the running time and tree quality of QFM and produces trees that are comparable with state-of-the-art methods. AVAILABILITY AND IMPLEMENTATION: QFM-FI is open source and available at https://github.com/sharmin-mim/qfm_java.
format Online
Article
Text
id pubmed-10260390
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-102603902023-06-15 Quartet Fiduccia–Mattheyses revisited for larger phylogenetic studies Mim, Sharmin Akter Zarif-Ul-Alam, Md Reaz, Rezwana Bayzid, Md Shamsuzzoha Rahman, Mohammad Saifur Bioinformatics Original Paper MOTIVATION: With the recent breakthroughs in sequencing technology, phylogeny estimation at a larger scale has become a huge opportunity. For accurate estimation of large-scale phylogeny, substantial endeavor is being devoted in introducing new algorithms or upgrading current approaches. In this work, we endeavor to improve the Quartet Fiduccia and Mattheyses (QFM) algorithm to resolve phylogenetic trees of better quality with better running time. QFM was already being appreciated by researchers for its good tree quality, but fell short in larger phylogenomic studies due to its excessively slow running time. RESULTS: We have re-designed QFM so that it can amalgamate millions of quartets over thousands of taxa into a species tree with a great level of accuracy within a short amount of time. Named “QFM Fast and Improved (QFM-FI)”, our version is 20 000× faster than the previous version and 400× faster than the widely used variant of QFM implemented in PAUP* on larger datasets. We have also provided a theoretical analysis of the running time and memory requirements of QFM-FI. We have conducted a comparative study of QFM-FI with other state-of-the-art phylogeny reconstruction methods, such as QFM, QMC, wQMC, wQFM, and ASTRAL, on simulated as well as real biological datasets. Our results show that QFM-FI improves on the running time and tree quality of QFM and produces trees that are comparable with state-of-the-art methods. AVAILABILITY AND IMPLEMENTATION: QFM-FI is open source and available at https://github.com/sharmin-mim/qfm_java. Oxford University Press 2023-06-07 /pmc/articles/PMC10260390/ /pubmed/37285316 http://dx.doi.org/10.1093/bioinformatics/btad332 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Mim, Sharmin Akter
Zarif-Ul-Alam, Md
Reaz, Rezwana
Bayzid, Md Shamsuzzoha
Rahman, Mohammad Saifur
Quartet Fiduccia–Mattheyses revisited for larger phylogenetic studies
title Quartet Fiduccia–Mattheyses revisited for larger phylogenetic studies
title_full Quartet Fiduccia–Mattheyses revisited for larger phylogenetic studies
title_fullStr Quartet Fiduccia–Mattheyses revisited for larger phylogenetic studies
title_full_unstemmed Quartet Fiduccia–Mattheyses revisited for larger phylogenetic studies
title_short Quartet Fiduccia–Mattheyses revisited for larger phylogenetic studies
title_sort quartet fiduccia–mattheyses revisited for larger phylogenetic studies
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10260390/
https://www.ncbi.nlm.nih.gov/pubmed/37285316
http://dx.doi.org/10.1093/bioinformatics/btad332
work_keys_str_mv AT mimsharminakter quartetfiducciamattheysesrevisitedforlargerphylogeneticstudies
AT zarifulalammd quartetfiducciamattheysesrevisitedforlargerphylogeneticstudies
AT reazrezwana quartetfiducciamattheysesrevisitedforlargerphylogeneticstudies
AT bayzidmdshamsuzzoha quartetfiducciamattheysesrevisitedforlargerphylogeneticstudies
AT rahmanmohammadsaifur quartetfiducciamattheysesrevisitedforlargerphylogeneticstudies