Cargando…

Rec-DCM-Eigen: Reconstructing a Less Parsimonious but More Accurate Tree in Shorter Time

Maximum parsimony (MP) methods aim to reconstruct the phylogeny of extant species by finding the most parsimonious evolutionary scenario using the species' genome data. MP methods are considered to be accurate, but they are also computationally expensive especially for a large number of species...

Descripción completa

Detalles Bibliográficos
Autores principales: Kang, Seunghwa, Tang, Jijun, Schaeffer, Stephen W., Bader, David A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3160844/
https://www.ncbi.nlm.nih.gov/pubmed/21887219
http://dx.doi.org/10.1371/journal.pone.0022483
_version_ 1782210588505538560
author Kang, Seunghwa
Tang, Jijun
Schaeffer, Stephen W.
Bader, David A.
author_facet Kang, Seunghwa
Tang, Jijun
Schaeffer, Stephen W.
Bader, David A.
author_sort Kang, Seunghwa
collection PubMed
description Maximum parsimony (MP) methods aim to reconstruct the phylogeny of extant species by finding the most parsimonious evolutionary scenario using the species' genome data. MP methods are considered to be accurate, but they are also computationally expensive especially for a large number of species. Several disk-covering methods (DCMs), which decompose the input species to multiple overlapping subgroups (or disks), have been proposed to solve the problem in a divide-and-conquer way. We design a new DCM based on the spectral method and also develop the COGNAC (Comparing Orders of Genes using Novel Algorithms and high-performance Computers) software package. COGNAC uses the new DCM to reduce the phylogenetic tree search space and selects an output tree from the reduced search space based on the MP principle. We test the new DCM using gene order data and inversion distance. The new DCM not only reduces the number of candidate tree topologies but also excludes erroneous tree topologies which can be selected by original MP methods. Initial labeling of internal genomes affects the accuracy of MP methods using gene order data, and the new DCM enables more accurate initial labeling as well. COGNAC demonstrates superior accuracy as a consequence. We compare COGNAC with FastME and the combination of the state of the art DCM (Rec-I-DCM3) and GRAPPA . COGNAC clearly outperforms FastME in accuracy. COGNAC –using the new DCM–also reconstructs a much more accurate tree in significantly shorter time than GRAPPA with Rec-I-DCM3.
format Online
Article
Text
id pubmed-3160844
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31608442011-09-01 Rec-DCM-Eigen: Reconstructing a Less Parsimonious but More Accurate Tree in Shorter Time Kang, Seunghwa Tang, Jijun Schaeffer, Stephen W. Bader, David A. PLoS One Research Article Maximum parsimony (MP) methods aim to reconstruct the phylogeny of extant species by finding the most parsimonious evolutionary scenario using the species' genome data. MP methods are considered to be accurate, but they are also computationally expensive especially for a large number of species. Several disk-covering methods (DCMs), which decompose the input species to multiple overlapping subgroups (or disks), have been proposed to solve the problem in a divide-and-conquer way. We design a new DCM based on the spectral method and also develop the COGNAC (Comparing Orders of Genes using Novel Algorithms and high-performance Computers) software package. COGNAC uses the new DCM to reduce the phylogenetic tree search space and selects an output tree from the reduced search space based on the MP principle. We test the new DCM using gene order data and inversion distance. The new DCM not only reduces the number of candidate tree topologies but also excludes erroneous tree topologies which can be selected by original MP methods. Initial labeling of internal genomes affects the accuracy of MP methods using gene order data, and the new DCM enables more accurate initial labeling as well. COGNAC demonstrates superior accuracy as a consequence. We compare COGNAC with FastME and the combination of the state of the art DCM (Rec-I-DCM3) and GRAPPA . COGNAC clearly outperforms FastME in accuracy. COGNAC –using the new DCM–also reconstructs a much more accurate tree in significantly shorter time than GRAPPA with Rec-I-DCM3. Public Library of Science 2011-08-24 /pmc/articles/PMC3160844/ /pubmed/21887219 http://dx.doi.org/10.1371/journal.pone.0022483 Text en Kang et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Kang, Seunghwa
Tang, Jijun
Schaeffer, Stephen W.
Bader, David A.
Rec-DCM-Eigen: Reconstructing a Less Parsimonious but More Accurate Tree in Shorter Time
title Rec-DCM-Eigen: Reconstructing a Less Parsimonious but More Accurate Tree in Shorter Time
title_full Rec-DCM-Eigen: Reconstructing a Less Parsimonious but More Accurate Tree in Shorter Time
title_fullStr Rec-DCM-Eigen: Reconstructing a Less Parsimonious but More Accurate Tree in Shorter Time
title_full_unstemmed Rec-DCM-Eigen: Reconstructing a Less Parsimonious but More Accurate Tree in Shorter Time
title_short Rec-DCM-Eigen: Reconstructing a Less Parsimonious but More Accurate Tree in Shorter Time
title_sort rec-dcm-eigen: reconstructing a less parsimonious but more accurate tree in shorter time
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3160844/
https://www.ncbi.nlm.nih.gov/pubmed/21887219
http://dx.doi.org/10.1371/journal.pone.0022483
work_keys_str_mv AT kangseunghwa recdcmeigenreconstructingalessparsimoniousbutmoreaccuratetreeinshortertime
AT tangjijun recdcmeigenreconstructingalessparsimoniousbutmoreaccuratetreeinshortertime
AT schaefferstephenw recdcmeigenreconstructingalessparsimoniousbutmoreaccuratetreeinshortertime
AT baderdavida recdcmeigenreconstructingalessparsimoniousbutmoreaccuratetreeinshortertime