Cargando…

ComPhy: prokaryotic composite distance phylogenies inferred from whole-genome gene sets

BACKGROUND: With the increasing availability of whole genome sequences, it is becoming more and more important to use complete genome sequences for inferring species phylogenies. We developed a new tool ComPhy, 'Composite Distance Phylogeny', based on a composite distance matrix calculated...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Guan Ning, Cai, Zhipeng, Lin, Guohui, Chakraborty, Sounak, Xu, Dong
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648732/
https://www.ncbi.nlm.nih.gov/pubmed/19208152
http://dx.doi.org/10.1186/1471-2105-10-S1-S5
_version_ 1782164975204171776
author Lin, Guan Ning
Cai, Zhipeng
Lin, Guohui
Chakraborty, Sounak
Xu, Dong
author_facet Lin, Guan Ning
Cai, Zhipeng
Lin, Guohui
Chakraborty, Sounak
Xu, Dong
author_sort Lin, Guan Ning
collection PubMed
description BACKGROUND: With the increasing availability of whole genome sequences, it is becoming more and more important to use complete genome sequences for inferring species phylogenies. We developed a new tool ComPhy, 'Composite Distance Phylogeny', based on a composite distance matrix calculated from the comparison of complete gene sets between genome pairs to produce a prokaryotic phylogeny. RESULTS: The composite distance between two genomes is defined by three components: Gene Dispersion Distance (GDD), Genome Breakpoint Distance (GBD) and Gene Content Distance (GCD). GDD quantifies the dispersion of orthologous genes along the genomic coordinates from one genome to another; GBD measures the shared breakpoints between two genomes; GCD measures the level of shared orthologs between two genomes. The phylogenetic tree is constructed from the composite distance matrix using a neighbor joining method. We tested our method on 9 datasets from 398 completely sequenced prokaryotic genomes. We have achieved above 90% agreement in quartet topologies between the tree created by our method and the tree from the Bergey's taxonomy. In comparison to several other phylogenetic analysis methods, our method showed consistently better performance. CONCLUSION: ComPhy is a fast and robust tool for genome-wide inference of evolutionary relationship among genomes. It can be downloaded from .
format Text
id pubmed-2648732
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26487322009-03-03 ComPhy: prokaryotic composite distance phylogenies inferred from whole-genome gene sets Lin, Guan Ning Cai, Zhipeng Lin, Guohui Chakraborty, Sounak Xu, Dong BMC Bioinformatics Research BACKGROUND: With the increasing availability of whole genome sequences, it is becoming more and more important to use complete genome sequences for inferring species phylogenies. We developed a new tool ComPhy, 'Composite Distance Phylogeny', based on a composite distance matrix calculated from the comparison of complete gene sets between genome pairs to produce a prokaryotic phylogeny. RESULTS: The composite distance between two genomes is defined by three components: Gene Dispersion Distance (GDD), Genome Breakpoint Distance (GBD) and Gene Content Distance (GCD). GDD quantifies the dispersion of orthologous genes along the genomic coordinates from one genome to another; GBD measures the shared breakpoints between two genomes; GCD measures the level of shared orthologs between two genomes. The phylogenetic tree is constructed from the composite distance matrix using a neighbor joining method. We tested our method on 9 datasets from 398 completely sequenced prokaryotic genomes. We have achieved above 90% agreement in quartet topologies between the tree created by our method and the tree from the Bergey's taxonomy. In comparison to several other phylogenetic analysis methods, our method showed consistently better performance. CONCLUSION: ComPhy is a fast and robust tool for genome-wide inference of evolutionary relationship among genomes. It can be downloaded from . BioMed Central 2009-01-30 /pmc/articles/PMC2648732/ /pubmed/19208152 http://dx.doi.org/10.1186/1471-2105-10-S1-S5 Text en Copyright © 2009 Lin et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Lin, Guan Ning
Cai, Zhipeng
Lin, Guohui
Chakraborty, Sounak
Xu, Dong
ComPhy: prokaryotic composite distance phylogenies inferred from whole-genome gene sets
title ComPhy: prokaryotic composite distance phylogenies inferred from whole-genome gene sets
title_full ComPhy: prokaryotic composite distance phylogenies inferred from whole-genome gene sets
title_fullStr ComPhy: prokaryotic composite distance phylogenies inferred from whole-genome gene sets
title_full_unstemmed ComPhy: prokaryotic composite distance phylogenies inferred from whole-genome gene sets
title_short ComPhy: prokaryotic composite distance phylogenies inferred from whole-genome gene sets
title_sort comphy: prokaryotic composite distance phylogenies inferred from whole-genome gene sets
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648732/
https://www.ncbi.nlm.nih.gov/pubmed/19208152
http://dx.doi.org/10.1186/1471-2105-10-S1-S5
work_keys_str_mv AT linguanning comphyprokaryoticcompositedistancephylogeniesinferredfromwholegenomegenesets
AT caizhipeng comphyprokaryoticcompositedistancephylogeniesinferredfromwholegenomegenesets
AT linguohui comphyprokaryoticcompositedistancephylogeniesinferredfromwholegenomegenesets
AT chakrabortysounak comphyprokaryoticcompositedistancephylogeniesinferredfromwholegenomegenesets
AT xudong comphyprokaryoticcompositedistancephylogeniesinferredfromwholegenomegenesets