Cargando…

A universally applicable method of operon map prediction on minimally annotated genomes using conserved genomic context

An important step in understanding the regulation of a prokaryotic genome is the generation of its transcription unit map. The current strongest operon predictor depends on the distributions of intergenic distances (IGD) separating adjacent genes within and between operons. Unfortunately, experiment...

Descripción completa

Detalles Bibliográficos
Autores principales: Edwards, Martin T., Rison, Stuart C. G., Stoker, Neil G., Wernisch, Lorenz
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1143694/
https://www.ncbi.nlm.nih.gov/pubmed/15942028
http://dx.doi.org/10.1093/nar/gki634
_version_ 1782124295287209984
author Edwards, Martin T.
Rison, Stuart C. G.
Stoker, Neil G.
Wernisch, Lorenz
author_facet Edwards, Martin T.
Rison, Stuart C. G.
Stoker, Neil G.
Wernisch, Lorenz
author_sort Edwards, Martin T.
collection PubMed
description An important step in understanding the regulation of a prokaryotic genome is the generation of its transcription unit map. The current strongest operon predictor depends on the distributions of intergenic distances (IGD) separating adjacent genes within and between operons. Unfortunately, experimental data on these distance distributions are limited to Escherichia coli and Bacillus subtilis. We suggest a new graph algorithmic approach based on comparative genomics to identify clusters of conserved genes independent of IGD and conservation of gene order. As a consequence, distance distributions of operon pairs for any arbitrary prokaryotic genome can be inferred. For E.coli, the algorithm predicts 854 conserved adjacent pairs with a precision of 85%. The IGD distribution for these pairs is virtually identical to the E.coli operon pair distribution. Statistical analysis of the predicted pair IGD distribution allows estimation of a genome-specific operon IGD cut-off, obviating the requirement for a training set in IGD-based operon prediction. We apply the method to a representative set of eight genomes, and show that these genome-specific IGD distributions differ considerably from each other and from the distribution in E.coli.
format Text
id pubmed-1143694
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-11436942005-06-08 A universally applicable method of operon map prediction on minimally annotated genomes using conserved genomic context Edwards, Martin T. Rison, Stuart C. G. Stoker, Neil G. Wernisch, Lorenz Nucleic Acids Res Article An important step in understanding the regulation of a prokaryotic genome is the generation of its transcription unit map. The current strongest operon predictor depends on the distributions of intergenic distances (IGD) separating adjacent genes within and between operons. Unfortunately, experimental data on these distance distributions are limited to Escherichia coli and Bacillus subtilis. We suggest a new graph algorithmic approach based on comparative genomics to identify clusters of conserved genes independent of IGD and conservation of gene order. As a consequence, distance distributions of operon pairs for any arbitrary prokaryotic genome can be inferred. For E.coli, the algorithm predicts 854 conserved adjacent pairs with a precision of 85%. The IGD distribution for these pairs is virtually identical to the E.coli operon pair distribution. Statistical analysis of the predicted pair IGD distribution allows estimation of a genome-specific operon IGD cut-off, obviating the requirement for a training set in IGD-based operon prediction. We apply the method to a representative set of eight genomes, and show that these genome-specific IGD distributions differ considerably from each other and from the distribution in E.coli. Oxford University Press 2005 2005-06-07 /pmc/articles/PMC1143694/ /pubmed/15942028 http://dx.doi.org/10.1093/nar/gki634 Text en © The Author 2005. Published by Oxford University Press. All rights reserved
spellingShingle Article
Edwards, Martin T.
Rison, Stuart C. G.
Stoker, Neil G.
Wernisch, Lorenz
A universally applicable method of operon map prediction on minimally annotated genomes using conserved genomic context
title A universally applicable method of operon map prediction on minimally annotated genomes using conserved genomic context
title_full A universally applicable method of operon map prediction on minimally annotated genomes using conserved genomic context
title_fullStr A universally applicable method of operon map prediction on minimally annotated genomes using conserved genomic context
title_full_unstemmed A universally applicable method of operon map prediction on minimally annotated genomes using conserved genomic context
title_short A universally applicable method of operon map prediction on minimally annotated genomes using conserved genomic context
title_sort universally applicable method of operon map prediction on minimally annotated genomes using conserved genomic context
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1143694/
https://www.ncbi.nlm.nih.gov/pubmed/15942028
http://dx.doi.org/10.1093/nar/gki634
work_keys_str_mv AT edwardsmartint auniversallyapplicablemethodofoperonmappredictiononminimallyannotatedgenomesusingconservedgenomiccontext
AT risonstuartcg auniversallyapplicablemethodofoperonmappredictiononminimallyannotatedgenomesusingconservedgenomiccontext
AT stokerneilg auniversallyapplicablemethodofoperonmappredictiononminimallyannotatedgenomesusingconservedgenomiccontext
AT wernischlorenz auniversallyapplicablemethodofoperonmappredictiononminimallyannotatedgenomesusingconservedgenomiccontext
AT edwardsmartint universallyapplicablemethodofoperonmappredictiononminimallyannotatedgenomesusingconservedgenomiccontext
AT risonstuartcg universallyapplicablemethodofoperonmappredictiononminimallyannotatedgenomesusingconservedgenomiccontext
AT stokerneilg universallyapplicablemethodofoperonmappredictiononminimallyannotatedgenomesusingconservedgenomiccontext
AT wernischlorenz universallyapplicablemethodofoperonmappredictiononminimallyannotatedgenomesusingconservedgenomiccontext