Cargando…
Genome-wide gene order distances support clustering the gram-positive bacteria
Initially using 143 genomes, we developed a method for calculating the pair-wise distance between prokaryotic genomes using a Monte Carlo method to estimate the conservation of gene order. The method was based on repeatedly selecting five or six non-adjacent random orthologs from each of two genomes...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4299520/ https://www.ncbi.nlm.nih.gov/pubmed/25653643 http://dx.doi.org/10.3389/fmicb.2014.00785 |
_version_ | 1782353409637089280 |
---|---|
author | House, Christopher H. Pellegrini, Matteo Fitz-Gibbon, Sorel T. |
author_facet | House, Christopher H. Pellegrini, Matteo Fitz-Gibbon, Sorel T. |
author_sort | House, Christopher H. |
collection | PubMed |
description | Initially using 143 genomes, we developed a method for calculating the pair-wise distance between prokaryotic genomes using a Monte Carlo method to estimate the conservation of gene order. The method was based on repeatedly selecting five or six non-adjacent random orthologs from each of two genomes and determining if the chosen orthologs were in the same order. The raw distances were then corrected for gene order convergence using an adaptation of the Jukes-Cantor model, as well as using the common distance correction D′ = −ln(1-D). First, we compared the distances found via the order of six orthologs to distances found based on ortholog gene content and small subunit rRNA sequences. The Jukes-Cantor gene order distances are reasonably well correlated with the divergence of rRNA (R(2) = 0.24), especially at rRNA Jukes-Cantor distances of less than 0.2 (R(2) = 0.52). Gene content is only weakly correlated with rRNA divergence (R(2) = 0.04) over all distances, however, it is especially strongly correlated at rRNA Jukes-Cantor distances of less than 0.1 (R(2) = 0.67). This initial work suggests that gene order may be useful in conjunction with other methods to help understand the relatedness of genomes. Using the gene order distances in 143 genomes, the relations of prokaryotes were studied using neighbor joining and agreement subtrees. We then repeated our study of the relations of prokaryotes using gene order in 172 complete genomes better representing a wider-diversity of prokaryotes. Consistently, our trees show the Actinobacteria as a sister group to the bulk of the Firmicutes. In fact, the robustness of gene order support was found to be considerably greater for uniting these two phyla than for uniting any of the proteobacterial classes together. The results are supportive of the idea that Actinobacteria and Firmicutes are closely related, which in turn implies a single origin for the gram-positive cell. |
format | Online Article Text |
id | pubmed-4299520 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-42995202015-02-04 Genome-wide gene order distances support clustering the gram-positive bacteria House, Christopher H. Pellegrini, Matteo Fitz-Gibbon, Sorel T. Front Microbiol Microbiology Initially using 143 genomes, we developed a method for calculating the pair-wise distance between prokaryotic genomes using a Monte Carlo method to estimate the conservation of gene order. The method was based on repeatedly selecting five or six non-adjacent random orthologs from each of two genomes and determining if the chosen orthologs were in the same order. The raw distances were then corrected for gene order convergence using an adaptation of the Jukes-Cantor model, as well as using the common distance correction D′ = −ln(1-D). First, we compared the distances found via the order of six orthologs to distances found based on ortholog gene content and small subunit rRNA sequences. The Jukes-Cantor gene order distances are reasonably well correlated with the divergence of rRNA (R(2) = 0.24), especially at rRNA Jukes-Cantor distances of less than 0.2 (R(2) = 0.52). Gene content is only weakly correlated with rRNA divergence (R(2) = 0.04) over all distances, however, it is especially strongly correlated at rRNA Jukes-Cantor distances of less than 0.1 (R(2) = 0.67). This initial work suggests that gene order may be useful in conjunction with other methods to help understand the relatedness of genomes. Using the gene order distances in 143 genomes, the relations of prokaryotes were studied using neighbor joining and agreement subtrees. We then repeated our study of the relations of prokaryotes using gene order in 172 complete genomes better representing a wider-diversity of prokaryotes. Consistently, our trees show the Actinobacteria as a sister group to the bulk of the Firmicutes. In fact, the robustness of gene order support was found to be considerably greater for uniting these two phyla than for uniting any of the proteobacterial classes together. The results are supportive of the idea that Actinobacteria and Firmicutes are closely related, which in turn implies a single origin for the gram-positive cell. Frontiers Media S.A. 2015-01-20 /pmc/articles/PMC4299520/ /pubmed/25653643 http://dx.doi.org/10.3389/fmicb.2014.00785 Text en Copyright © 2015 House, Pellegrini and Fitz-Gibbon. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Microbiology House, Christopher H. Pellegrini, Matteo Fitz-Gibbon, Sorel T. Genome-wide gene order distances support clustering the gram-positive bacteria |
title | Genome-wide gene order distances support clustering the gram-positive bacteria |
title_full | Genome-wide gene order distances support clustering the gram-positive bacteria |
title_fullStr | Genome-wide gene order distances support clustering the gram-positive bacteria |
title_full_unstemmed | Genome-wide gene order distances support clustering the gram-positive bacteria |
title_short | Genome-wide gene order distances support clustering the gram-positive bacteria |
title_sort | genome-wide gene order distances support clustering the gram-positive bacteria |
topic | Microbiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4299520/ https://www.ncbi.nlm.nih.gov/pubmed/25653643 http://dx.doi.org/10.3389/fmicb.2014.00785 |
work_keys_str_mv | AT housechristopherh genomewidegeneorderdistancessupportclusteringthegrampositivebacteria AT pellegrinimatteo genomewidegeneorderdistancessupportclusteringthegrampositivebacteria AT fitzgibbonsorelt genomewidegeneorderdistancessupportclusteringthegrampositivebacteria |