Cargando…

RNRdb, a curated database of the universal enzyme family ribonucleotide reductase, reveals a high level of misannotation in sequences deposited to Genbank

BACKGROUND: Ribonucleotide reductases (RNRs) catalyse the only known de novo pathway for deoxyribonucleotide synthesis, and are therefore essential to DNA-based life. While ribonucleotide reduction has a single evolutionary origin, significant differences between RNRs nevertheless exist, notably in...

Descripción completa

Detalles Bibliográficos
Autores principales: Lundin, Daniel, Torrents, Eduard, Poole, Anthony M, Sjöberg, Britt-Marie
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795772/
https://www.ncbi.nlm.nih.gov/pubmed/19995434
http://dx.doi.org/10.1186/1471-2164-10-589
_version_ 1782175450433323008
author Lundin, Daniel
Torrents, Eduard
Poole, Anthony M
Sjöberg, Britt-Marie
author_facet Lundin, Daniel
Torrents, Eduard
Poole, Anthony M
Sjöberg, Britt-Marie
author_sort Lundin, Daniel
collection PubMed
description BACKGROUND: Ribonucleotide reductases (RNRs) catalyse the only known de novo pathway for deoxyribonucleotide synthesis, and are therefore essential to DNA-based life. While ribonucleotide reduction has a single evolutionary origin, significant differences between RNRs nevertheless exist, notably in cofactor requirements, subunit composition and allosteric regulation. These differences result in distinct operational constraints (anaerobicity, iron/oxygen dependence and cobalamin dependence), and form the basis for the classification of RNRs into three classes. DESCRIPTION: In RNRdb (Ribonucleotide Reductase database), we have collated and curated all known RNR protein sequences with the aim of providing a resource for exploration of RNR diversity and distribution. By comparing expert manual annotations with annotations stored in Genbank, we find that significant inaccuracies exist in larger databases. To our surprise, only 23% of protein sequences included in RNRdb are correctly annotated across the key attributes of class, role and function, with 17% being incorrectly annotated across all three categories. This illustrates the utility of specialist databases for applications where a high degree of annotation accuracy may be important. The database houses information on annotation, distribution and diversity of RNRs, and links to solved RNR structures, and can be searched through a BLAST interface. RNRdb is accessible through a public web interface at http://rnrdb.molbio.su.se. CONCLUSION: RNRdb is a specialist database that provides a reliable annotation and classification resource for RNR proteins, as well as a tool to explore distribution patterns of RNR classes. The recent expansion in available genome sequence data have provided us with a picture of RNR distribution that is more complex than believed only a few years ago; our database indicates that RNRs of all three classes are found across all three cellular domains. Moreover, we find a number of organisms that encode all three classes.
format Text
id pubmed-2795772
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27957722009-12-18 RNRdb, a curated database of the universal enzyme family ribonucleotide reductase, reveals a high level of misannotation in sequences deposited to Genbank Lundin, Daniel Torrents, Eduard Poole, Anthony M Sjöberg, Britt-Marie BMC Genomics Database BACKGROUND: Ribonucleotide reductases (RNRs) catalyse the only known de novo pathway for deoxyribonucleotide synthesis, and are therefore essential to DNA-based life. While ribonucleotide reduction has a single evolutionary origin, significant differences between RNRs nevertheless exist, notably in cofactor requirements, subunit composition and allosteric regulation. These differences result in distinct operational constraints (anaerobicity, iron/oxygen dependence and cobalamin dependence), and form the basis for the classification of RNRs into three classes. DESCRIPTION: In RNRdb (Ribonucleotide Reductase database), we have collated and curated all known RNR protein sequences with the aim of providing a resource for exploration of RNR diversity and distribution. By comparing expert manual annotations with annotations stored in Genbank, we find that significant inaccuracies exist in larger databases. To our surprise, only 23% of protein sequences included in RNRdb are correctly annotated across the key attributes of class, role and function, with 17% being incorrectly annotated across all three categories. This illustrates the utility of specialist databases for applications where a high degree of annotation accuracy may be important. The database houses information on annotation, distribution and diversity of RNRs, and links to solved RNR structures, and can be searched through a BLAST interface. RNRdb is accessible through a public web interface at http://rnrdb.molbio.su.se. CONCLUSION: RNRdb is a specialist database that provides a reliable annotation and classification resource for RNR proteins, as well as a tool to explore distribution patterns of RNR classes. The recent expansion in available genome sequence data have provided us with a picture of RNR distribution that is more complex than believed only a few years ago; our database indicates that RNRs of all three classes are found across all three cellular domains. Moreover, we find a number of organisms that encode all three classes. BioMed Central 2009-12-08 /pmc/articles/PMC2795772/ /pubmed/19995434 http://dx.doi.org/10.1186/1471-2164-10-589 Text en Copyright ©2009 Lundin et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database
Lundin, Daniel
Torrents, Eduard
Poole, Anthony M
Sjöberg, Britt-Marie
RNRdb, a curated database of the universal enzyme family ribonucleotide reductase, reveals a high level of misannotation in sequences deposited to Genbank
title RNRdb, a curated database of the universal enzyme family ribonucleotide reductase, reveals a high level of misannotation in sequences deposited to Genbank
title_full RNRdb, a curated database of the universal enzyme family ribonucleotide reductase, reveals a high level of misannotation in sequences deposited to Genbank
title_fullStr RNRdb, a curated database of the universal enzyme family ribonucleotide reductase, reveals a high level of misannotation in sequences deposited to Genbank
title_full_unstemmed RNRdb, a curated database of the universal enzyme family ribonucleotide reductase, reveals a high level of misannotation in sequences deposited to Genbank
title_short RNRdb, a curated database of the universal enzyme family ribonucleotide reductase, reveals a high level of misannotation in sequences deposited to Genbank
title_sort rnrdb, a curated database of the universal enzyme family ribonucleotide reductase, reveals a high level of misannotation in sequences deposited to genbank
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795772/
https://www.ncbi.nlm.nih.gov/pubmed/19995434
http://dx.doi.org/10.1186/1471-2164-10-589
work_keys_str_mv AT lundindaniel rnrdbacurateddatabaseoftheuniversalenzymefamilyribonucleotidereductaserevealsahighlevelofmisannotationinsequencesdepositedtogenbank
AT torrentseduard rnrdbacurateddatabaseoftheuniversalenzymefamilyribonucleotidereductaserevealsahighlevelofmisannotationinsequencesdepositedtogenbank
AT pooleanthonym rnrdbacurateddatabaseoftheuniversalenzymefamilyribonucleotidereductaserevealsahighlevelofmisannotationinsequencesdepositedtogenbank
AT sjobergbrittmarie rnrdbacurateddatabaseoftheuniversalenzymefamilyribonucleotidereductaserevealsahighlevelofmisannotationinsequencesdepositedtogenbank