Cargando…

Genome-wide gene order distances support clustering the gram-positive bacteria

Initially using 143 genomes, we developed a method for calculating the pair-wise distance between prokaryotic genomes using a Monte Carlo method to estimate the conservation of gene order. The method was based on repeatedly selecting five or six non-adjacent random orthologs from each of two genomes...

Descripción completa

Detalles Bibliográficos
Autores principales: House, Christopher H., Pellegrini, Matteo, Fitz-Gibbon, Sorel T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4299520/
https://www.ncbi.nlm.nih.gov/pubmed/25653643
http://dx.doi.org/10.3389/fmicb.2014.00785
_version_ 1782353409637089280
author House, Christopher H.
Pellegrini, Matteo
Fitz-Gibbon, Sorel T.
author_facet House, Christopher H.
Pellegrini, Matteo
Fitz-Gibbon, Sorel T.
author_sort House, Christopher H.
collection PubMed
description Initially using 143 genomes, we developed a method for calculating the pair-wise distance between prokaryotic genomes using a Monte Carlo method to estimate the conservation of gene order. The method was based on repeatedly selecting five or six non-adjacent random orthologs from each of two genomes and determining if the chosen orthologs were in the same order. The raw distances were then corrected for gene order convergence using an adaptation of the Jukes-Cantor model, as well as using the common distance correction D′ = −ln(1-D). First, we compared the distances found via the order of six orthologs to distances found based on ortholog gene content and small subunit rRNA sequences. The Jukes-Cantor gene order distances are reasonably well correlated with the divergence of rRNA (R(2) = 0.24), especially at rRNA Jukes-Cantor distances of less than 0.2 (R(2) = 0.52). Gene content is only weakly correlated with rRNA divergence (R(2) = 0.04) over all distances, however, it is especially strongly correlated at rRNA Jukes-Cantor distances of less than 0.1 (R(2) = 0.67). This initial work suggests that gene order may be useful in conjunction with other methods to help understand the relatedness of genomes. Using the gene order distances in 143 genomes, the relations of prokaryotes were studied using neighbor joining and agreement subtrees. We then repeated our study of the relations of prokaryotes using gene order in 172 complete genomes better representing a wider-diversity of prokaryotes. Consistently, our trees show the Actinobacteria as a sister group to the bulk of the Firmicutes. In fact, the robustness of gene order support was found to be considerably greater for uniting these two phyla than for uniting any of the proteobacterial classes together. The results are supportive of the idea that Actinobacteria and Firmicutes are closely related, which in turn implies a single origin for the gram-positive cell.
format Online
Article
Text
id pubmed-4299520
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-42995202015-02-04 Genome-wide gene order distances support clustering the gram-positive bacteria House, Christopher H. Pellegrini, Matteo Fitz-Gibbon, Sorel T. Front Microbiol Microbiology Initially using 143 genomes, we developed a method for calculating the pair-wise distance between prokaryotic genomes using a Monte Carlo method to estimate the conservation of gene order. The method was based on repeatedly selecting five or six non-adjacent random orthologs from each of two genomes and determining if the chosen orthologs were in the same order. The raw distances were then corrected for gene order convergence using an adaptation of the Jukes-Cantor model, as well as using the common distance correction D′ = −ln(1-D). First, we compared the distances found via the order of six orthologs to distances found based on ortholog gene content and small subunit rRNA sequences. The Jukes-Cantor gene order distances are reasonably well correlated with the divergence of rRNA (R(2) = 0.24), especially at rRNA Jukes-Cantor distances of less than 0.2 (R(2) = 0.52). Gene content is only weakly correlated with rRNA divergence (R(2) = 0.04) over all distances, however, it is especially strongly correlated at rRNA Jukes-Cantor distances of less than 0.1 (R(2) = 0.67). This initial work suggests that gene order may be useful in conjunction with other methods to help understand the relatedness of genomes. Using the gene order distances in 143 genomes, the relations of prokaryotes were studied using neighbor joining and agreement subtrees. We then repeated our study of the relations of prokaryotes using gene order in 172 complete genomes better representing a wider-diversity of prokaryotes. Consistently, our trees show the Actinobacteria as a sister group to the bulk of the Firmicutes. In fact, the robustness of gene order support was found to be considerably greater for uniting these two phyla than for uniting any of the proteobacterial classes together. The results are supportive of the idea that Actinobacteria and Firmicutes are closely related, which in turn implies a single origin for the gram-positive cell. Frontiers Media S.A. 2015-01-20 /pmc/articles/PMC4299520/ /pubmed/25653643 http://dx.doi.org/10.3389/fmicb.2014.00785 Text en Copyright © 2015 House, Pellegrini and Fitz-Gibbon. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
House, Christopher H.
Pellegrini, Matteo
Fitz-Gibbon, Sorel T.
Genome-wide gene order distances support clustering the gram-positive bacteria
title Genome-wide gene order distances support clustering the gram-positive bacteria
title_full Genome-wide gene order distances support clustering the gram-positive bacteria
title_fullStr Genome-wide gene order distances support clustering the gram-positive bacteria
title_full_unstemmed Genome-wide gene order distances support clustering the gram-positive bacteria
title_short Genome-wide gene order distances support clustering the gram-positive bacteria
title_sort genome-wide gene order distances support clustering the gram-positive bacteria
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4299520/
https://www.ncbi.nlm.nih.gov/pubmed/25653643
http://dx.doi.org/10.3389/fmicb.2014.00785
work_keys_str_mv AT housechristopherh genomewidegeneorderdistancessupportclusteringthegrampositivebacteria
AT pellegrinimatteo genomewidegeneorderdistancessupportclusteringthegrampositivebacteria
AT fitzgibbonsorelt genomewidegeneorderdistancessupportclusteringthegrampositivebacteria