Cargando…

Improving pan-genome annotation using whole genome multiple alignment

BACKGROUND: Rapid annotation and comparisons of genomes from multiple isolates (pan-genomes) is becoming commonplace due to advances in sequencing technology. Genome annotations can contain inconsistencies and errors that hinder comparative analysis even within a single species. Tools are needed to...

Descripción completa

Detalles Bibliográficos
Autores principales: Angiuoli, Samuel V, Dunning Hotopp, Julie C, Salzberg, Steven L, Tettelin, Hervé
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3142524/
https://www.ncbi.nlm.nih.gov/pubmed/21718539
http://dx.doi.org/10.1186/1471-2105-12-272
_version_ 1782208838825410560
author Angiuoli, Samuel V
Dunning Hotopp, Julie C
Salzberg, Steven L
Tettelin, Hervé
author_facet Angiuoli, Samuel V
Dunning Hotopp, Julie C
Salzberg, Steven L
Tettelin, Hervé
author_sort Angiuoli, Samuel V
collection PubMed
description BACKGROUND: Rapid annotation and comparisons of genomes from multiple isolates (pan-genomes) is becoming commonplace due to advances in sequencing technology. Genome annotations can contain inconsistencies and errors that hinder comparative analysis even within a single species. Tools are needed to compare and improve annotation quality across sets of closely related genomes. RESULTS: We introduce a new tool, Mugsy-Annotator, that identifies orthologs and evaluates annotation quality in prokaryotic genomes using whole genome multiple alignment. Mugsy-Annotator identifies anomalies in annotated gene structures, including inconsistently located translation initiation sites and disrupted genes due to draft genome sequencing or pseudogenes. An evaluation of species pan-genomes using the tool indicates that such anomalies are common, especially at translation initiation sites. Mugsy-Annotator reports alternate annotations that improve consistency and are candidates for further review. CONCLUSIONS: Whole genome multiple alignment can be used to efficiently identify orthologs and annotation problem areas in a bacterial pan-genome. Comparisons of annotated gene structures within a species may show more variation than is actually present in the genome, indicating errors in genome annotation. Our new tool Mugsy-Annotator assists re-annotation efforts by highlighting edits that improve annotation consistency.
format Online
Article
Text
id pubmed-3142524
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31425242011-07-24 Improving pan-genome annotation using whole genome multiple alignment Angiuoli, Samuel V Dunning Hotopp, Julie C Salzberg, Steven L Tettelin, Hervé BMC Bioinformatics Methodology Article BACKGROUND: Rapid annotation and comparisons of genomes from multiple isolates (pan-genomes) is becoming commonplace due to advances in sequencing technology. Genome annotations can contain inconsistencies and errors that hinder comparative analysis even within a single species. Tools are needed to compare and improve annotation quality across sets of closely related genomes. RESULTS: We introduce a new tool, Mugsy-Annotator, that identifies orthologs and evaluates annotation quality in prokaryotic genomes using whole genome multiple alignment. Mugsy-Annotator identifies anomalies in annotated gene structures, including inconsistently located translation initiation sites and disrupted genes due to draft genome sequencing or pseudogenes. An evaluation of species pan-genomes using the tool indicates that such anomalies are common, especially at translation initiation sites. Mugsy-Annotator reports alternate annotations that improve consistency and are candidates for further review. CONCLUSIONS: Whole genome multiple alignment can be used to efficiently identify orthologs and annotation problem areas in a bacterial pan-genome. Comparisons of annotated gene structures within a species may show more variation than is actually present in the genome, indicating errors in genome annotation. Our new tool Mugsy-Annotator assists re-annotation efforts by highlighting edits that improve annotation consistency. BioMed Central 2011-06-30 /pmc/articles/PMC3142524/ /pubmed/21718539 http://dx.doi.org/10.1186/1471-2105-12-272 Text en Copyright ©2011 Angiuoli et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Angiuoli, Samuel V
Dunning Hotopp, Julie C
Salzberg, Steven L
Tettelin, Hervé
Improving pan-genome annotation using whole genome multiple alignment
title Improving pan-genome annotation using whole genome multiple alignment
title_full Improving pan-genome annotation using whole genome multiple alignment
title_fullStr Improving pan-genome annotation using whole genome multiple alignment
title_full_unstemmed Improving pan-genome annotation using whole genome multiple alignment
title_short Improving pan-genome annotation using whole genome multiple alignment
title_sort improving pan-genome annotation using whole genome multiple alignment
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3142524/
https://www.ncbi.nlm.nih.gov/pubmed/21718539
http://dx.doi.org/10.1186/1471-2105-12-272
work_keys_str_mv AT angiuolisamuelv improvingpangenomeannotationusingwholegenomemultiplealignment
AT dunninghotoppjuliec improvingpangenomeannotationusingwholegenomemultiplealignment
AT salzbergstevenl improvingpangenomeannotationusingwholegenomemultiplealignment
AT tettelinherve improvingpangenomeannotationusingwholegenomemultiplealignment