Cargando…

The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver

Here, we present a major update to the SUPERFAMILY database and the webserver. We describe the addition of new SUPERFAMILY 2.0 profile HMM library containing a total of 27 623 HMMs. The database now includes Superfamily domain annotations for millions of protein sequences taken from the Universal Pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Pandurangan, Arun Prasad, Stahlhacke, Jonathan, Oates, Matt E, Smithers, Ben, Gough, Julian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6324026/
https://www.ncbi.nlm.nih.gov/pubmed/30445555
http://dx.doi.org/10.1093/nar/gky1130
_version_ 1783385894379585536
author Pandurangan, Arun Prasad
Stahlhacke, Jonathan
Oates, Matt E
Smithers, Ben
Gough, Julian
author_facet Pandurangan, Arun Prasad
Stahlhacke, Jonathan
Oates, Matt E
Smithers, Ben
Gough, Julian
author_sort Pandurangan, Arun Prasad
collection PubMed
description Here, we present a major update to the SUPERFAMILY database and the webserver. We describe the addition of new SUPERFAMILY 2.0 profile HMM library containing a total of 27 623 HMMs. The database now includes Superfamily domain annotations for millions of protein sequences taken from the Universal Protein Recourse Knowledgebase (UniProtKB) and the National Center for Biotechnology Information (NCBI). This addition constitutes about 51 and 45 million distinct protein sequences obtained from UniProtKB and NCBI respectively. Currently, the database contains annotations for 63 244 and 102 151 complete genomes taken from UniProtKB and NCBI respectively. The current sequence collection and genome update is the biggest so far in the history of SUPERFAMILY updates. In order to the deal with the massive wealth of information, here we introduce a new SUPERFAMILY 2.0 webserver (http://supfam.org). Currently, the webserver mainly focuses on the search, retrieval and display of Superfamily annotation for the entire sequence and genome collection in the database.
format Online
Article
Text
id pubmed-6324026
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-63240262019-01-10 The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver Pandurangan, Arun Prasad Stahlhacke, Jonathan Oates, Matt E Smithers, Ben Gough, Julian Nucleic Acids Res Database Issue Here, we present a major update to the SUPERFAMILY database and the webserver. We describe the addition of new SUPERFAMILY 2.0 profile HMM library containing a total of 27 623 HMMs. The database now includes Superfamily domain annotations for millions of protein sequences taken from the Universal Protein Recourse Knowledgebase (UniProtKB) and the National Center for Biotechnology Information (NCBI). This addition constitutes about 51 and 45 million distinct protein sequences obtained from UniProtKB and NCBI respectively. Currently, the database contains annotations for 63 244 and 102 151 complete genomes taken from UniProtKB and NCBI respectively. The current sequence collection and genome update is the biggest so far in the history of SUPERFAMILY updates. In order to the deal with the massive wealth of information, here we introduce a new SUPERFAMILY 2.0 webserver (http://supfam.org). Currently, the webserver mainly focuses on the search, retrieval and display of Superfamily annotation for the entire sequence and genome collection in the database. Oxford University Press 2019-01-08 2018-11-16 /pmc/articles/PMC6324026/ /pubmed/30445555 http://dx.doi.org/10.1093/nar/gky1130 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Issue
Pandurangan, Arun Prasad
Stahlhacke, Jonathan
Oates, Matt E
Smithers, Ben
Gough, Julian
The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver
title The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver
title_full The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver
title_fullStr The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver
title_full_unstemmed The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver
title_short The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver
title_sort superfamily 2.0 database: a significant proteome update and a new webserver
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6324026/
https://www.ncbi.nlm.nih.gov/pubmed/30445555
http://dx.doi.org/10.1093/nar/gky1130
work_keys_str_mv AT panduranganarunprasad thesuperfamily20databaseasignificantproteomeupdateandanewwebserver
AT stahlhackejonathan thesuperfamily20databaseasignificantproteomeupdateandanewwebserver
AT oatesmatte thesuperfamily20databaseasignificantproteomeupdateandanewwebserver
AT smithersben thesuperfamily20databaseasignificantproteomeupdateandanewwebserver
AT goughjulian thesuperfamily20databaseasignificantproteomeupdateandanewwebserver
AT panduranganarunprasad superfamily20databaseasignificantproteomeupdateandanewwebserver
AT stahlhackejonathan superfamily20databaseasignificantproteomeupdateandanewwebserver
AT oatesmatte superfamily20databaseasignificantproteomeupdateandanewwebserver
AT smithersben superfamily20databaseasignificantproteomeupdateandanewwebserver
AT goughjulian superfamily20databaseasignificantproteomeupdateandanewwebserver