Cargando…

Multiple genome alignment for identifying the core structure among moderately related microbial genomes

BACKGROUND: Identifying the set of intrinsically conserved genes, or the genomic core, among related genomes is crucial for understanding prokaryotic genomes where horizontal gene transfers are common. Although core genome identification appears to be obvious among very closely related genomes, it b...

Descripción completa

Detalles Bibliográficos
Autor principal: Uchiyama, Ikuo
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2615449/
https://www.ncbi.nlm.nih.gov/pubmed/18976470
http://dx.doi.org/10.1186/1471-2164-9-515
_version_ 1782163336136228864
author Uchiyama, Ikuo
author_facet Uchiyama, Ikuo
author_sort Uchiyama, Ikuo
collection PubMed
description BACKGROUND: Identifying the set of intrinsically conserved genes, or the genomic core, among related genomes is crucial for understanding prokaryotic genomes where horizontal gene transfers are common. Although core genome identification appears to be obvious among very closely related genomes, it becomes more difficult when more distantly related genomes are compared. Here, we consider the core structure as a set of sufficiently long segments in which gene orders are conserved so that they are likely to have been inherited mainly through vertical transfer, and developed a method for identifying the core structure by finding the order of pre-identified orthologous groups (OGs) that maximally retains the conserved gene orders. RESULTS: The method was applied to genome comparisons of two well-characterized families, Bacillaceae and Enterobacteriaceae, and identified their core structures comprising 1438 and 2125 OGs, respectively. The core sets contained most of the essential genes and their related genes, which were primarily included in the intersection of the two core sets comprising around 700 OGs. The definition of the genomic core based on gene order conservation was demonstrated to be more robust than the simpler approach based only on gene conservation. We also investigated the core structures in terms of G+C content homogeneity and phylogenetic congruence, and found that the core genes primarily exhibited the expected characteristic, i.e., being indigenous and sharing the same history, more than the non-core genes. CONCLUSION: The results demonstrate that our strategy of genome alignment based on gene order conservation can provide an effective approach to identify the genomic core among moderately related microbial genomes.
format Text
id pubmed-2615449
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26154492009-01-12 Multiple genome alignment for identifying the core structure among moderately related microbial genomes Uchiyama, Ikuo BMC Genomics Research Article BACKGROUND: Identifying the set of intrinsically conserved genes, or the genomic core, among related genomes is crucial for understanding prokaryotic genomes where horizontal gene transfers are common. Although core genome identification appears to be obvious among very closely related genomes, it becomes more difficult when more distantly related genomes are compared. Here, we consider the core structure as a set of sufficiently long segments in which gene orders are conserved so that they are likely to have been inherited mainly through vertical transfer, and developed a method for identifying the core structure by finding the order of pre-identified orthologous groups (OGs) that maximally retains the conserved gene orders. RESULTS: The method was applied to genome comparisons of two well-characterized families, Bacillaceae and Enterobacteriaceae, and identified their core structures comprising 1438 and 2125 OGs, respectively. The core sets contained most of the essential genes and their related genes, which were primarily included in the intersection of the two core sets comprising around 700 OGs. The definition of the genomic core based on gene order conservation was demonstrated to be more robust than the simpler approach based only on gene conservation. We also investigated the core structures in terms of G+C content homogeneity and phylogenetic congruence, and found that the core genes primarily exhibited the expected characteristic, i.e., being indigenous and sharing the same history, more than the non-core genes. CONCLUSION: The results demonstrate that our strategy of genome alignment based on gene order conservation can provide an effective approach to identify the genomic core among moderately related microbial genomes. BioMed Central 2008-10-31 /pmc/articles/PMC2615449/ /pubmed/18976470 http://dx.doi.org/10.1186/1471-2164-9-515 Text en Copyright © 2008 Uchiyama; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Uchiyama, Ikuo
Multiple genome alignment for identifying the core structure among moderately related microbial genomes
title Multiple genome alignment for identifying the core structure among moderately related microbial genomes
title_full Multiple genome alignment for identifying the core structure among moderately related microbial genomes
title_fullStr Multiple genome alignment for identifying the core structure among moderately related microbial genomes
title_full_unstemmed Multiple genome alignment for identifying the core structure among moderately related microbial genomes
title_short Multiple genome alignment for identifying the core structure among moderately related microbial genomes
title_sort multiple genome alignment for identifying the core structure among moderately related microbial genomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2615449/
https://www.ncbi.nlm.nih.gov/pubmed/18976470
http://dx.doi.org/10.1186/1471-2164-9-515
work_keys_str_mv AT uchiyamaikuo multiplegenomealignmentforidentifyingthecorestructureamongmoderatelyrelatedmicrobialgenomes