Cargando…

A versatile computational pipeline for bacterial genome annotation improvement and comparative analysis, with Brucella as a use case

We present a bacterial genome computational analysis pipeline, called GenVar. The pipeline, based on the program GeneWise, is designed to analyze an annotated genome and automatically identify missed gene calls and sequence variants such as genes with disrupted reading frames (split genes) and those...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, G.X., Snyder, E.E., Boyle, S.M., Crasta, O.R., Czar, M., Mane, S.P., Purkayastha, A., Sobral, B., Setubal, J.C.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1919506/
https://www.ncbi.nlm.nih.gov/pubmed/17553834
http://dx.doi.org/10.1093/nar/gkm377
_version_ 1782134184567898112
author Yu, G.X.
Snyder, E.E.
Boyle, S.M.
Crasta, O.R.
Czar, M.
Mane, S.P.
Purkayastha, A.
Sobral, B.
Setubal, J.C.
author_facet Yu, G.X.
Snyder, E.E.
Boyle, S.M.
Crasta, O.R.
Czar, M.
Mane, S.P.
Purkayastha, A.
Sobral, B.
Setubal, J.C.
author_sort Yu, G.X.
collection PubMed
description We present a bacterial genome computational analysis pipeline, called GenVar. The pipeline, based on the program GeneWise, is designed to analyze an annotated genome and automatically identify missed gene calls and sequence variants such as genes with disrupted reading frames (split genes) and those with insertions and deletions (indels). For a given genome to be analyzed, GenVar relies on a database containing closely related genomes (such as other species or strains) as well as a few additional reference genomes. GenVar also helps identify gene disruptions probably caused by sequencing errors. We exemplify GenVar's capabilities by presenting results from the analysis of four Brucella genomes. Brucella is an important human pathogen and zoonotic agent. The analysis revealed hundreds of missed gene calls, new split genes and indels, several of which are species specific and hence provide valuable clues to the understanding of the genome basis of Brucella pathogenicity and host specificity.
format Text
id pubmed-1919506
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-19195062007-07-24 A versatile computational pipeline for bacterial genome annotation improvement and comparative analysis, with Brucella as a use case Yu, G.X. Snyder, E.E. Boyle, S.M. Crasta, O.R. Czar, M. Mane, S.P. Purkayastha, A. Sobral, B. Setubal, J.C. Nucleic Acids Res Computational Biology We present a bacterial genome computational analysis pipeline, called GenVar. The pipeline, based on the program GeneWise, is designed to analyze an annotated genome and automatically identify missed gene calls and sequence variants such as genes with disrupted reading frames (split genes) and those with insertions and deletions (indels). For a given genome to be analyzed, GenVar relies on a database containing closely related genomes (such as other species or strains) as well as a few additional reference genomes. GenVar also helps identify gene disruptions probably caused by sequencing errors. We exemplify GenVar's capabilities by presenting results from the analysis of four Brucella genomes. Brucella is an important human pathogen and zoonotic agent. The analysis revealed hundreds of missed gene calls, new split genes and indels, several of which are species specific and hence provide valuable clues to the understanding of the genome basis of Brucella pathogenicity and host specificity. Oxford University Press 2007-06 2007-06-06 /pmc/articles/PMC1919506/ /pubmed/17553834 http://dx.doi.org/10.1093/nar/gkm377 Text en © 2007 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Yu, G.X.
Snyder, E.E.
Boyle, S.M.
Crasta, O.R.
Czar, M.
Mane, S.P.
Purkayastha, A.
Sobral, B.
Setubal, J.C.
A versatile computational pipeline for bacterial genome annotation improvement and comparative analysis, with Brucella as a use case
title A versatile computational pipeline for bacterial genome annotation improvement and comparative analysis, with Brucella as a use case
title_full A versatile computational pipeline for bacterial genome annotation improvement and comparative analysis, with Brucella as a use case
title_fullStr A versatile computational pipeline for bacterial genome annotation improvement and comparative analysis, with Brucella as a use case
title_full_unstemmed A versatile computational pipeline for bacterial genome annotation improvement and comparative analysis, with Brucella as a use case
title_short A versatile computational pipeline for bacterial genome annotation improvement and comparative analysis, with Brucella as a use case
title_sort versatile computational pipeline for bacterial genome annotation improvement and comparative analysis, with brucella as a use case
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1919506/
https://www.ncbi.nlm.nih.gov/pubmed/17553834
http://dx.doi.org/10.1093/nar/gkm377
work_keys_str_mv AT yugx aversatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT snyderee aversatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT boylesm aversatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT crastaor aversatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT czarm aversatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT manesp aversatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT purkayasthaa aversatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT sobralb aversatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT setubaljc aversatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT yugx versatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT snyderee versatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT boylesm versatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT crastaor versatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT czarm versatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT manesp versatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT purkayasthaa versatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT sobralb versatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase
AT setubaljc versatilecomputationalpipelineforbacterialgenomeannotationimprovementandcomparativeanalysiswithbrucellaasausecase