Cargando…

PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics

BACKGROUND: The number of protein structures from structural genomics centers dramatically increases in the Protein Data Bank (PDB). Many of these structures are functionally unannotated because they have no sequence similarity to proteins of known function. However, it is possible to successfully i...

Descripción completa

Detalles Bibliográficos
Autores principales: von Grotthuss, Marcin, Plewczynski, Dariusz, Ginalski, Krzysztof, Rychlewski, Leszek, Shakhnovich, Eugene I
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1409798/
https://www.ncbi.nlm.nih.gov/pubmed/16460560
http://dx.doi.org/10.1186/1471-2105-7-53
_version_ 1782127057363271680
author von Grotthuss, Marcin
Plewczynski, Dariusz
Ginalski, Krzysztof
Rychlewski, Leszek
Shakhnovich, Eugene I
author_facet von Grotthuss, Marcin
Plewczynski, Dariusz
Ginalski, Krzysztof
Rychlewski, Leszek
Shakhnovich, Eugene I
author_sort von Grotthuss, Marcin
collection PubMed
description BACKGROUND: The number of protein structures from structural genomics centers dramatically increases in the Protein Data Bank (PDB). Many of these structures are functionally unannotated because they have no sequence similarity to proteins of known function. However, it is possible to successfully infer function using only structural similarity. RESULTS: Here we present the PDB-UF database, a web-accessible collection of predictions of enzymatic properties using structure-function relationship. The assignments were conducted for three-dimensional protein structures of unknown function that come from structural genomics initiatives. We show that 4 hypothetical proteins (with PDB accession codes: 1VH0, 1NS5, 1O6D, and 1TO0), for which standard BLAST tools such as PSI-BLAST or RPS-BLAST failed to assign any function, are probably methyltransferase enzymes. CONCLUSION: We suggest that the structure-based prediction of an EC number should be conducted having the different similarity score cutoff for different protein folds. Moreover, performing the annotation using two different algorithms can reduce the rate of false positive assignments. We believe, that the presented web-based repository will help to decrease the number of protein structures that have functions marked as "unknown" in the PDB file. AVAILABILITY: and
format Text
id pubmed-1409798
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-14097982006-03-23 PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics von Grotthuss, Marcin Plewczynski, Dariusz Ginalski, Krzysztof Rychlewski, Leszek Shakhnovich, Eugene I BMC Bioinformatics Database BACKGROUND: The number of protein structures from structural genomics centers dramatically increases in the Protein Data Bank (PDB). Many of these structures are functionally unannotated because they have no sequence similarity to proteins of known function. However, it is possible to successfully infer function using only structural similarity. RESULTS: Here we present the PDB-UF database, a web-accessible collection of predictions of enzymatic properties using structure-function relationship. The assignments were conducted for three-dimensional protein structures of unknown function that come from structural genomics initiatives. We show that 4 hypothetical proteins (with PDB accession codes: 1VH0, 1NS5, 1O6D, and 1TO0), for which standard BLAST tools such as PSI-BLAST or RPS-BLAST failed to assign any function, are probably methyltransferase enzymes. CONCLUSION: We suggest that the structure-based prediction of an EC number should be conducted having the different similarity score cutoff for different protein folds. Moreover, performing the annotation using two different algorithms can reduce the rate of false positive assignments. We believe, that the presented web-based repository will help to decrease the number of protein structures that have functions marked as "unknown" in the PDB file. AVAILABILITY: and BioMed Central 2006-02-06 /pmc/articles/PMC1409798/ /pubmed/16460560 http://dx.doi.org/10.1186/1471-2105-7-53 Text en Copyright © 2006 von Grotthuss et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database
von Grotthuss, Marcin
Plewczynski, Dariusz
Ginalski, Krzysztof
Rychlewski, Leszek
Shakhnovich, Eugene I
PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics
title PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics
title_full PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics
title_fullStr PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics
title_full_unstemmed PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics
title_short PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics
title_sort pdb-uf: database of predicted enzymatic functions for unannotated protein structures from structural genomics
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1409798/
https://www.ncbi.nlm.nih.gov/pubmed/16460560
http://dx.doi.org/10.1186/1471-2105-7-53
work_keys_str_mv AT vongrotthussmarcin pdbufdatabaseofpredictedenzymaticfunctionsforunannotatedproteinstructuresfromstructuralgenomics
AT plewczynskidariusz pdbufdatabaseofpredictedenzymaticfunctionsforunannotatedproteinstructuresfromstructuralgenomics
AT ginalskikrzysztof pdbufdatabaseofpredictedenzymaticfunctionsforunannotatedproteinstructuresfromstructuralgenomics
AT rychlewskileszek pdbufdatabaseofpredictedenzymaticfunctionsforunannotatedproteinstructuresfromstructuralgenomics
AT shakhnovicheugenei pdbufdatabaseofpredictedenzymaticfunctionsforunannotatedproteinstructuresfromstructuralgenomics