Cargando…

SIGI: score-based identification of genomic islands

BACKGROUND: Genomic islands can be observed in many microbial genomes. These stretches of DNA have a conspicuous composition with regard to sequence or encoded functions. Genomic islands are assumed to be frequently acquired via horizontal gene transfer. For the analysis of genome structure and the...

Descripción completa

Detalles Bibliográficos
Autor principal: Merkl, Rainer
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC394314/
https://www.ncbi.nlm.nih.gov/pubmed/15113412
http://dx.doi.org/10.1186/1471-2105-5-22
_version_ 1782121310746312704
author Merkl, Rainer
author_facet Merkl, Rainer
author_sort Merkl, Rainer
collection PubMed
description BACKGROUND: Genomic islands can be observed in many microbial genomes. These stretches of DNA have a conspicuous composition with regard to sequence or encoded functions. Genomic islands are assumed to be frequently acquired via horizontal gene transfer. For the analysis of genome structure and the study of horizontal gene transfer, it is necessary to reliably identify and characterize these islands. RESULTS: A scoring scheme on codon frequencies Score_G1G2(cdn) = log(f_G2(cdn) / f_G1(cdn)) was utilized. To analyse genes of a species G1 and to test their relatedness to species G2, scores were determined by applying the formula to log-odds derived from mean codon frequencies of the two genomes. A non-redundant set of nearly 400 codon usage tables comprising microbial species was derived; its members were used alternatively at position G2. Genes having at least one score value above a species-specific and dynamically determined cut-off value were analysed further. By means of cluster analysis, genes were identified that comprise clusters of statistically significant size. These clusters were predicted as genomic islands. Finally and individually for each of these genes, the taxonomical relation among those species responsible for significant scores was interpreted. The validity of the approach and its limitations were made plausible by an extensive analysis of natural genes and synthetic ones aimed at modelling the process of gene amelioration. CONCLUSIONS: The method reliably allows to identify genomic island and the likely origin of alien genes.
format Text
id pubmed-394314
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-3943142004-04-22 SIGI: score-based identification of genomic islands Merkl, Rainer BMC Bioinformatics Methodology Article BACKGROUND: Genomic islands can be observed in many microbial genomes. These stretches of DNA have a conspicuous composition with regard to sequence or encoded functions. Genomic islands are assumed to be frequently acquired via horizontal gene transfer. For the analysis of genome structure and the study of horizontal gene transfer, it is necessary to reliably identify and characterize these islands. RESULTS: A scoring scheme on codon frequencies Score_G1G2(cdn) = log(f_G2(cdn) / f_G1(cdn)) was utilized. To analyse genes of a species G1 and to test their relatedness to species G2, scores were determined by applying the formula to log-odds derived from mean codon frequencies of the two genomes. A non-redundant set of nearly 400 codon usage tables comprising microbial species was derived; its members were used alternatively at position G2. Genes having at least one score value above a species-specific and dynamically determined cut-off value were analysed further. By means of cluster analysis, genes were identified that comprise clusters of statistically significant size. These clusters were predicted as genomic islands. Finally and individually for each of these genes, the taxonomical relation among those species responsible for significant scores was interpreted. The validity of the approach and its limitations were made plausible by an extensive analysis of natural genes and synthetic ones aimed at modelling the process of gene amelioration. CONCLUSIONS: The method reliably allows to identify genomic island and the likely origin of alien genes. BioMed Central 2004-03-03 /pmc/articles/PMC394314/ /pubmed/15113412 http://dx.doi.org/10.1186/1471-2105-5-22 Text en Copyright © 2004 Merkl; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Methodology Article
Merkl, Rainer
SIGI: score-based identification of genomic islands
title SIGI: score-based identification of genomic islands
title_full SIGI: score-based identification of genomic islands
title_fullStr SIGI: score-based identification of genomic islands
title_full_unstemmed SIGI: score-based identification of genomic islands
title_short SIGI: score-based identification of genomic islands
title_sort sigi: score-based identification of genomic islands
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC394314/
https://www.ncbi.nlm.nih.gov/pubmed/15113412
http://dx.doi.org/10.1186/1471-2105-5-22
work_keys_str_mv AT merklrainer sigiscorebasedidentificationofgenomicislands