Cargando…

Whole-proteome phylogeny of large dsDNA viruses and parvoviruses through a composition vector method related to dynamical language model

BACKGROUND: The vast sequence divergence among different virus groups has presented a great challenge to alignment-based analysis of virus phylogeny. Due to the problems caused by the uncertainty in alignment, existing tools for phylogenetic analysis based on multiple alignment could not be directly...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Zu-Guo, Chu, Ka Hou, Li, Chi Pang, Anh, Vo, Zhou, Li-Qian, Wang, Roger Wei
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2898692/
https://www.ncbi.nlm.nih.gov/pubmed/20565983
http://dx.doi.org/10.1186/1471-2148-10-192
_version_ 1782183505518657536
author Yu, Zu-Guo
Chu, Ka Hou
Li, Chi Pang
Anh, Vo
Zhou, Li-Qian
Wang, Roger Wei
author_facet Yu, Zu-Guo
Chu, Ka Hou
Li, Chi Pang
Anh, Vo
Zhou, Li-Qian
Wang, Roger Wei
author_sort Yu, Zu-Guo
collection PubMed
description BACKGROUND: The vast sequence divergence among different virus groups has presented a great challenge to alignment-based analysis of virus phylogeny. Due to the problems caused by the uncertainty in alignment, existing tools for phylogenetic analysis based on multiple alignment could not be directly applied to the whole-genome comparison and phylogenomic studies of viruses. There has been a growing interest in alignment-free methods for phylogenetic analysis using complete genome data. Among the alignment-free methods, a dynamical language (DL) method proposed by our group has successfully been applied to the phylogenetic analysis of bacteria and chloroplast genomes. RESULTS: In this paper, the DL method is used to analyze the whole-proteome phylogeny of 124 large dsDNA viruses and 30 parvoviruses, two data sets with large difference in genome size. The trees from our analyses are in good agreement to the latest classification of large dsDNA viruses and parvoviruses by the International Committee on Taxonomy of Viruses (ICTV). CONCLUSIONS: The present method provides a new way for recovering the phylogeny of large dsDNA viruses and parvoviruses, and also some insights on the affiliation of a number of unclassified viruses. In comparison, some alignment-free methods such as the CV Tree method can be used for recovering the phylogeny of large dsDNA viruses, but they are not suitable for resolving the phylogeny of parvoviruses with a much smaller genome size.
format Text
id pubmed-2898692
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28986922010-07-08 Whole-proteome phylogeny of large dsDNA viruses and parvoviruses through a composition vector method related to dynamical language model Yu, Zu-Guo Chu, Ka Hou Li, Chi Pang Anh, Vo Zhou, Li-Qian Wang, Roger Wei BMC Evol Biol Research article BACKGROUND: The vast sequence divergence among different virus groups has presented a great challenge to alignment-based analysis of virus phylogeny. Due to the problems caused by the uncertainty in alignment, existing tools for phylogenetic analysis based on multiple alignment could not be directly applied to the whole-genome comparison and phylogenomic studies of viruses. There has been a growing interest in alignment-free methods for phylogenetic analysis using complete genome data. Among the alignment-free methods, a dynamical language (DL) method proposed by our group has successfully been applied to the phylogenetic analysis of bacteria and chloroplast genomes. RESULTS: In this paper, the DL method is used to analyze the whole-proteome phylogeny of 124 large dsDNA viruses and 30 parvoviruses, two data sets with large difference in genome size. The trees from our analyses are in good agreement to the latest classification of large dsDNA viruses and parvoviruses by the International Committee on Taxonomy of Viruses (ICTV). CONCLUSIONS: The present method provides a new way for recovering the phylogeny of large dsDNA viruses and parvoviruses, and also some insights on the affiliation of a number of unclassified viruses. In comparison, some alignment-free methods such as the CV Tree method can be used for recovering the phylogeny of large dsDNA viruses, but they are not suitable for resolving the phylogeny of parvoviruses with a much smaller genome size. BioMed Central 2010-06-22 /pmc/articles/PMC2898692/ /pubmed/20565983 http://dx.doi.org/10.1186/1471-2148-10-192 Text en Copyright ©2010 Yu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Yu, Zu-Guo
Chu, Ka Hou
Li, Chi Pang
Anh, Vo
Zhou, Li-Qian
Wang, Roger Wei
Whole-proteome phylogeny of large dsDNA viruses and parvoviruses through a composition vector method related to dynamical language model
title Whole-proteome phylogeny of large dsDNA viruses and parvoviruses through a composition vector method related to dynamical language model
title_full Whole-proteome phylogeny of large dsDNA viruses and parvoviruses through a composition vector method related to dynamical language model
title_fullStr Whole-proteome phylogeny of large dsDNA viruses and parvoviruses through a composition vector method related to dynamical language model
title_full_unstemmed Whole-proteome phylogeny of large dsDNA viruses and parvoviruses through a composition vector method related to dynamical language model
title_short Whole-proteome phylogeny of large dsDNA viruses and parvoviruses through a composition vector method related to dynamical language model
title_sort whole-proteome phylogeny of large dsdna viruses and parvoviruses through a composition vector method related to dynamical language model
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2898692/
https://www.ncbi.nlm.nih.gov/pubmed/20565983
http://dx.doi.org/10.1186/1471-2148-10-192
work_keys_str_mv AT yuzuguo wholeproteomephylogenyoflargedsdnavirusesandparvovirusesthroughacompositionvectormethodrelatedtodynamicallanguagemodel
AT chukahou wholeproteomephylogenyoflargedsdnavirusesandparvovirusesthroughacompositionvectormethodrelatedtodynamicallanguagemodel
AT lichipang wholeproteomephylogenyoflargedsdnavirusesandparvovirusesthroughacompositionvectormethodrelatedtodynamicallanguagemodel
AT anhvo wholeproteomephylogenyoflargedsdnavirusesandparvovirusesthroughacompositionvectormethodrelatedtodynamicallanguagemodel
AT zhouliqian wholeproteomephylogenyoflargedsdnavirusesandparvovirusesthroughacompositionvectormethodrelatedtodynamicallanguagemodel
AT wangrogerwei wholeproteomephylogenyoflargedsdnavirusesandparvovirusesthroughacompositionvectormethodrelatedtodynamicallanguagemodel