Cargando…
PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics
BACKGROUND: The number of protein structures from structural genomics centers dramatically increases in the Protein Data Bank (PDB). Many of these structures are functionally unannotated because they have no sequence similarity to proteins of known function. However, it is possible to successfully i...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1409798/ https://www.ncbi.nlm.nih.gov/pubmed/16460560 http://dx.doi.org/10.1186/1471-2105-7-53 |
_version_ | 1782127057363271680 |
---|---|
author | von Grotthuss, Marcin Plewczynski, Dariusz Ginalski, Krzysztof Rychlewski, Leszek Shakhnovich, Eugene I |
author_facet | von Grotthuss, Marcin Plewczynski, Dariusz Ginalski, Krzysztof Rychlewski, Leszek Shakhnovich, Eugene I |
author_sort | von Grotthuss, Marcin |
collection | PubMed |
description | BACKGROUND: The number of protein structures from structural genomics centers dramatically increases in the Protein Data Bank (PDB). Many of these structures are functionally unannotated because they have no sequence similarity to proteins of known function. However, it is possible to successfully infer function using only structural similarity. RESULTS: Here we present the PDB-UF database, a web-accessible collection of predictions of enzymatic properties using structure-function relationship. The assignments were conducted for three-dimensional protein structures of unknown function that come from structural genomics initiatives. We show that 4 hypothetical proteins (with PDB accession codes: 1VH0, 1NS5, 1O6D, and 1TO0), for which standard BLAST tools such as PSI-BLAST or RPS-BLAST failed to assign any function, are probably methyltransferase enzymes. CONCLUSION: We suggest that the structure-based prediction of an EC number should be conducted having the different similarity score cutoff for different protein folds. Moreover, performing the annotation using two different algorithms can reduce the rate of false positive assignments. We believe, that the presented web-based repository will help to decrease the number of protein structures that have functions marked as "unknown" in the PDB file. AVAILABILITY: and |
format | Text |
id | pubmed-1409798 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-14097982006-03-23 PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics von Grotthuss, Marcin Plewczynski, Dariusz Ginalski, Krzysztof Rychlewski, Leszek Shakhnovich, Eugene I BMC Bioinformatics Database BACKGROUND: The number of protein structures from structural genomics centers dramatically increases in the Protein Data Bank (PDB). Many of these structures are functionally unannotated because they have no sequence similarity to proteins of known function. However, it is possible to successfully infer function using only structural similarity. RESULTS: Here we present the PDB-UF database, a web-accessible collection of predictions of enzymatic properties using structure-function relationship. The assignments were conducted for three-dimensional protein structures of unknown function that come from structural genomics initiatives. We show that 4 hypothetical proteins (with PDB accession codes: 1VH0, 1NS5, 1O6D, and 1TO0), for which standard BLAST tools such as PSI-BLAST or RPS-BLAST failed to assign any function, are probably methyltransferase enzymes. CONCLUSION: We suggest that the structure-based prediction of an EC number should be conducted having the different similarity score cutoff for different protein folds. Moreover, performing the annotation using two different algorithms can reduce the rate of false positive assignments. We believe, that the presented web-based repository will help to decrease the number of protein structures that have functions marked as "unknown" in the PDB file. AVAILABILITY: and BioMed Central 2006-02-06 /pmc/articles/PMC1409798/ /pubmed/16460560 http://dx.doi.org/10.1186/1471-2105-7-53 Text en Copyright © 2006 von Grotthuss et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Database von Grotthuss, Marcin Plewczynski, Dariusz Ginalski, Krzysztof Rychlewski, Leszek Shakhnovich, Eugene I PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics |
title | PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics |
title_full | PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics |
title_fullStr | PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics |
title_full_unstemmed | PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics |
title_short | PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics |
title_sort | pdb-uf: database of predicted enzymatic functions for unannotated protein structures from structural genomics |
topic | Database |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1409798/ https://www.ncbi.nlm.nih.gov/pubmed/16460560 http://dx.doi.org/10.1186/1471-2105-7-53 |
work_keys_str_mv | AT vongrotthussmarcin pdbufdatabaseofpredictedenzymaticfunctionsforunannotatedproteinstructuresfromstructuralgenomics AT plewczynskidariusz pdbufdatabaseofpredictedenzymaticfunctionsforunannotatedproteinstructuresfromstructuralgenomics AT ginalskikrzysztof pdbufdatabaseofpredictedenzymaticfunctionsforunannotatedproteinstructuresfromstructuralgenomics AT rychlewskileszek pdbufdatabaseofpredictedenzymaticfunctionsforunannotatedproteinstructuresfromstructuralgenomics AT shakhnovicheugenei pdbufdatabaseofpredictedenzymaticfunctionsforunannotatedproteinstructuresfromstructuralgenomics |