Cargando…
The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver
Here, we present a major update to the SUPERFAMILY database and the webserver. We describe the addition of new SUPERFAMILY 2.0 profile HMM library containing a total of 27 623 HMMs. The database now includes Superfamily domain annotations for millions of protein sequences taken from the Universal Pr...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6324026/ https://www.ncbi.nlm.nih.gov/pubmed/30445555 http://dx.doi.org/10.1093/nar/gky1130 |
_version_ | 1783385894379585536 |
---|---|
author | Pandurangan, Arun Prasad Stahlhacke, Jonathan Oates, Matt E Smithers, Ben Gough, Julian |
author_facet | Pandurangan, Arun Prasad Stahlhacke, Jonathan Oates, Matt E Smithers, Ben Gough, Julian |
author_sort | Pandurangan, Arun Prasad |
collection | PubMed |
description | Here, we present a major update to the SUPERFAMILY database and the webserver. We describe the addition of new SUPERFAMILY 2.0 profile HMM library containing a total of 27 623 HMMs. The database now includes Superfamily domain annotations for millions of protein sequences taken from the Universal Protein Recourse Knowledgebase (UniProtKB) and the National Center for Biotechnology Information (NCBI). This addition constitutes about 51 and 45 million distinct protein sequences obtained from UniProtKB and NCBI respectively. Currently, the database contains annotations for 63 244 and 102 151 complete genomes taken from UniProtKB and NCBI respectively. The current sequence collection and genome update is the biggest so far in the history of SUPERFAMILY updates. In order to the deal with the massive wealth of information, here we introduce a new SUPERFAMILY 2.0 webserver (http://supfam.org). Currently, the webserver mainly focuses on the search, retrieval and display of Superfamily annotation for the entire sequence and genome collection in the database. |
format | Online Article Text |
id | pubmed-6324026 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-63240262019-01-10 The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver Pandurangan, Arun Prasad Stahlhacke, Jonathan Oates, Matt E Smithers, Ben Gough, Julian Nucleic Acids Res Database Issue Here, we present a major update to the SUPERFAMILY database and the webserver. We describe the addition of new SUPERFAMILY 2.0 profile HMM library containing a total of 27 623 HMMs. The database now includes Superfamily domain annotations for millions of protein sequences taken from the Universal Protein Recourse Knowledgebase (UniProtKB) and the National Center for Biotechnology Information (NCBI). This addition constitutes about 51 and 45 million distinct protein sequences obtained from UniProtKB and NCBI respectively. Currently, the database contains annotations for 63 244 and 102 151 complete genomes taken from UniProtKB and NCBI respectively. The current sequence collection and genome update is the biggest so far in the history of SUPERFAMILY updates. In order to the deal with the massive wealth of information, here we introduce a new SUPERFAMILY 2.0 webserver (http://supfam.org). Currently, the webserver mainly focuses on the search, retrieval and display of Superfamily annotation for the entire sequence and genome collection in the database. Oxford University Press 2019-01-08 2018-11-16 /pmc/articles/PMC6324026/ /pubmed/30445555 http://dx.doi.org/10.1093/nar/gky1130 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Database Issue Pandurangan, Arun Prasad Stahlhacke, Jonathan Oates, Matt E Smithers, Ben Gough, Julian The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver |
title | The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver |
title_full | The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver |
title_fullStr | The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver |
title_full_unstemmed | The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver |
title_short | The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver |
title_sort | superfamily 2.0 database: a significant proteome update and a new webserver |
topic | Database Issue |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6324026/ https://www.ncbi.nlm.nih.gov/pubmed/30445555 http://dx.doi.org/10.1093/nar/gky1130 |
work_keys_str_mv | AT panduranganarunprasad thesuperfamily20databaseasignificantproteomeupdateandanewwebserver AT stahlhackejonathan thesuperfamily20databaseasignificantproteomeupdateandanewwebserver AT oatesmatte thesuperfamily20databaseasignificantproteomeupdateandanewwebserver AT smithersben thesuperfamily20databaseasignificantproteomeupdateandanewwebserver AT goughjulian thesuperfamily20databaseasignificantproteomeupdateandanewwebserver AT panduranganarunprasad superfamily20databaseasignificantproteomeupdateandanewwebserver AT stahlhackejonathan superfamily20databaseasignificantproteomeupdateandanewwebserver AT oatesmatte superfamily20databaseasignificantproteomeupdateandanewwebserver AT smithersben superfamily20databaseasignificantproteomeupdateandanewwebserver AT goughjulian superfamily20databaseasignificantproteomeupdateandanewwebserver |