Cargando…
Improving pan-genome annotation using whole genome multiple alignment
BACKGROUND: Rapid annotation and comparisons of genomes from multiple isolates (pan-genomes) is becoming commonplace due to advances in sequencing technology. Genome annotations can contain inconsistencies and errors that hinder comparative analysis even within a single species. Tools are needed to...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3142524/ https://www.ncbi.nlm.nih.gov/pubmed/21718539 http://dx.doi.org/10.1186/1471-2105-12-272 |
_version_ | 1782208838825410560 |
---|---|
author | Angiuoli, Samuel V Dunning Hotopp, Julie C Salzberg, Steven L Tettelin, Hervé |
author_facet | Angiuoli, Samuel V Dunning Hotopp, Julie C Salzberg, Steven L Tettelin, Hervé |
author_sort | Angiuoli, Samuel V |
collection | PubMed |
description | BACKGROUND: Rapid annotation and comparisons of genomes from multiple isolates (pan-genomes) is becoming commonplace due to advances in sequencing technology. Genome annotations can contain inconsistencies and errors that hinder comparative analysis even within a single species. Tools are needed to compare and improve annotation quality across sets of closely related genomes. RESULTS: We introduce a new tool, Mugsy-Annotator, that identifies orthologs and evaluates annotation quality in prokaryotic genomes using whole genome multiple alignment. Mugsy-Annotator identifies anomalies in annotated gene structures, including inconsistently located translation initiation sites and disrupted genes due to draft genome sequencing or pseudogenes. An evaluation of species pan-genomes using the tool indicates that such anomalies are common, especially at translation initiation sites. Mugsy-Annotator reports alternate annotations that improve consistency and are candidates for further review. CONCLUSIONS: Whole genome multiple alignment can be used to efficiently identify orthologs and annotation problem areas in a bacterial pan-genome. Comparisons of annotated gene structures within a species may show more variation than is actually present in the genome, indicating errors in genome annotation. Our new tool Mugsy-Annotator assists re-annotation efforts by highlighting edits that improve annotation consistency. |
format | Online Article Text |
id | pubmed-3142524 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-31425242011-07-24 Improving pan-genome annotation using whole genome multiple alignment Angiuoli, Samuel V Dunning Hotopp, Julie C Salzberg, Steven L Tettelin, Hervé BMC Bioinformatics Methodology Article BACKGROUND: Rapid annotation and comparisons of genomes from multiple isolates (pan-genomes) is becoming commonplace due to advances in sequencing technology. Genome annotations can contain inconsistencies and errors that hinder comparative analysis even within a single species. Tools are needed to compare and improve annotation quality across sets of closely related genomes. RESULTS: We introduce a new tool, Mugsy-Annotator, that identifies orthologs and evaluates annotation quality in prokaryotic genomes using whole genome multiple alignment. Mugsy-Annotator identifies anomalies in annotated gene structures, including inconsistently located translation initiation sites and disrupted genes due to draft genome sequencing or pseudogenes. An evaluation of species pan-genomes using the tool indicates that such anomalies are common, especially at translation initiation sites. Mugsy-Annotator reports alternate annotations that improve consistency and are candidates for further review. CONCLUSIONS: Whole genome multiple alignment can be used to efficiently identify orthologs and annotation problem areas in a bacterial pan-genome. Comparisons of annotated gene structures within a species may show more variation than is actually present in the genome, indicating errors in genome annotation. Our new tool Mugsy-Annotator assists re-annotation efforts by highlighting edits that improve annotation consistency. BioMed Central 2011-06-30 /pmc/articles/PMC3142524/ /pubmed/21718539 http://dx.doi.org/10.1186/1471-2105-12-272 Text en Copyright ©2011 Angiuoli et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Angiuoli, Samuel V Dunning Hotopp, Julie C Salzberg, Steven L Tettelin, Hervé Improving pan-genome annotation using whole genome multiple alignment |
title | Improving pan-genome annotation using whole genome multiple alignment |
title_full | Improving pan-genome annotation using whole genome multiple alignment |
title_fullStr | Improving pan-genome annotation using whole genome multiple alignment |
title_full_unstemmed | Improving pan-genome annotation using whole genome multiple alignment |
title_short | Improving pan-genome annotation using whole genome multiple alignment |
title_sort | improving pan-genome annotation using whole genome multiple alignment |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3142524/ https://www.ncbi.nlm.nih.gov/pubmed/21718539 http://dx.doi.org/10.1186/1471-2105-12-272 |
work_keys_str_mv | AT angiuolisamuelv improvingpangenomeannotationusingwholegenomemultiplealignment AT dunninghotoppjuliec improvingpangenomeannotationusingwholegenomemultiplealignment AT salzbergstevenl improvingpangenomeannotationusingwholegenomemultiplealignment AT tettelinherve improvingpangenomeannotationusingwholegenomemultiplealignment |