Cargando…
In search of genome annotation consistency: solid gene clusters and how to use them
Maintaining consistency in genome annotations is important for supporting many computational tasks, particularly metabolic modeling. The SEED project has implemented a process that improves annotation consistencies across microbial genomes for proteins with conserved sequences and genomic context. I...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4026451/ https://www.ncbi.nlm.nih.gov/pubmed/28324432 http://dx.doi.org/10.1007/s13205-013-0152-2 |
_version_ | 1782316840554332160 |
---|---|
author | Davis, James J. Olsen, Gary J. Overbeek, Ross Vonstein, Veronika Xia, Fangfang |
author_facet | Davis, James J. Olsen, Gary J. Overbeek, Ross Vonstein, Veronika Xia, Fangfang |
author_sort | Davis, James J. |
collection | PubMed |
description | Maintaining consistency in genome annotations is important for supporting many computational tasks, particularly metabolic modeling. The SEED project has implemented a process that improves annotation consistencies across microbial genomes for proteins with conserved sequences and genomic context. In this research report, we describe this process and show how this effort has resulted in improvements to microbial genome annotations in the SEED. We also compare SEED annotation consistencies with other commonly used resources such as IMG (the Joint Genome Institute’s Integrated Microbial Genomes system), RefSeq (the National Center for Biotechnology Information’s Reference Sequence Database), Swiss-Prot (the annotated protein sequence database of the Swiss Institute of Bioinformatics, European Molecular Biology Laboratory and the European Bioinformatics Institute) and TrEMBL (Translated European Molecular Biology Laboratory nucleotide sequence data Library). Our analysis indicates that manual and computational efforts are paying off for the databases where consistency is a major goal. |
format | Online Article Text |
id | pubmed-4026451 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-40264512014-05-22 In search of genome annotation consistency: solid gene clusters and how to use them Davis, James J. Olsen, Gary J. Overbeek, Ross Vonstein, Veronika Xia, Fangfang 3 Biotech Short Report Maintaining consistency in genome annotations is important for supporting many computational tasks, particularly metabolic modeling. The SEED project has implemented a process that improves annotation consistencies across microbial genomes for proteins with conserved sequences and genomic context. In this research report, we describe this process and show how this effort has resulted in improvements to microbial genome annotations in the SEED. We also compare SEED annotation consistencies with other commonly used resources such as IMG (the Joint Genome Institute’s Integrated Microbial Genomes system), RefSeq (the National Center for Biotechnology Information’s Reference Sequence Database), Swiss-Prot (the annotated protein sequence database of the Swiss Institute of Bioinformatics, European Molecular Biology Laboratory and the European Bioinformatics Institute) and TrEMBL (Translated European Molecular Biology Laboratory nucleotide sequence data Library). Our analysis indicates that manual and computational efforts are paying off for the databases where consistency is a major goal. Springer Berlin Heidelberg 2013-07-06 2014-06 /pmc/articles/PMC4026451/ /pubmed/28324432 http://dx.doi.org/10.1007/s13205-013-0152-2 Text en © The Author(s) 2013 https://creativecommons.org/licenses/by/2.0/Open AccessThis article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited. |
spellingShingle | Short Report Davis, James J. Olsen, Gary J. Overbeek, Ross Vonstein, Veronika Xia, Fangfang In search of genome annotation consistency: solid gene clusters and how to use them |
title | In search of genome annotation consistency: solid gene clusters and how to use them |
title_full | In search of genome annotation consistency: solid gene clusters and how to use them |
title_fullStr | In search of genome annotation consistency: solid gene clusters and how to use them |
title_full_unstemmed | In search of genome annotation consistency: solid gene clusters and how to use them |
title_short | In search of genome annotation consistency: solid gene clusters and how to use them |
title_sort | in search of genome annotation consistency: solid gene clusters and how to use them |
topic | Short Report |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4026451/ https://www.ncbi.nlm.nih.gov/pubmed/28324432 http://dx.doi.org/10.1007/s13205-013-0152-2 |
work_keys_str_mv | AT davisjamesj insearchofgenomeannotationconsistencysolidgeneclustersandhowtousethem AT olsengaryj insearchofgenomeannotationconsistencysolidgeneclustersandhowtousethem AT overbeekross insearchofgenomeannotationconsistencysolidgeneclustersandhowtousethem AT vonsteinveronika insearchofgenomeannotationconsistencysolidgeneclustersandhowtousethem AT xiafangfang insearchofgenomeannotationconsistencysolidgeneclustersandhowtousethem |