Cargando…

The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver

Here, we present a major update to the SUPERFAMILY database and the webserver. We describe the addition of new SUPERFAMILY 2.0 profile HMM library containing a total of 27 623 HMMs. The database now includes Superfamily domain annotations for millions of protein sequences taken from the Universal Pr...

Descripción completa

Detalles Bibliográficos
Autores principales:	Pandurangan, Arun Prasad, Stahlhacke, Jonathan, Oates, Matt E, Smithers, Ben, Gough, Julian
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2019
Materias:	Database Issue
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6324026/ https://www.ncbi.nlm.nih.gov/pubmed/30445555 http://dx.doi.org/10.1093/nar/gky1130

_version_	1783385894379585536
author	Pandurangan, Arun Prasad Stahlhacke, Jonathan Oates, Matt E Smithers, Ben Gough, Julian
author_facet	Pandurangan, Arun Prasad Stahlhacke, Jonathan Oates, Matt E Smithers, Ben Gough, Julian
author_sort	Pandurangan, Arun Prasad
collection	PubMed
description	Here, we present a major update to the SUPERFAMILY database and the webserver. We describe the addition of new SUPERFAMILY 2.0 profile HMM library containing a total of 27 623 HMMs. The database now includes Superfamily domain annotations for millions of protein sequences taken from the Universal Protein Recourse Knowledgebase (UniProtKB) and the National Center for Biotechnology Information (NCBI). This addition constitutes about 51 and 45 million distinct protein sequences obtained from UniProtKB and NCBI respectively. Currently, the database contains annotations for 63 244 and 102 151 complete genomes taken from UniProtKB and NCBI respectively. The current sequence collection and genome update is the biggest so far in the history of SUPERFAMILY updates. In order to the deal with the massive wealth of information, here we introduce a new SUPERFAMILY 2.0 webserver (http://supfam.org). Currently, the webserver mainly focuses on the search, retrieval and display of Superfamily annotation for the entire sequence and genome collection in the database.
format	Online Article Text
id	pubmed-6324026
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-63240262019-01-10 The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver Pandurangan, Arun Prasad Stahlhacke, Jonathan Oates, Matt E Smithers, Ben Gough, Julian Nucleic Acids Res Database Issue Here, we present a major update to the SUPERFAMILY database and the webserver. We describe the addition of new SUPERFAMILY 2.0 profile HMM library containing a total of 27 623 HMMs. The database now includes Superfamily domain annotations for millions of protein sequences taken from the Universal Protein Recourse Knowledgebase (UniProtKB) and the National Center for Biotechnology Information (NCBI). This addition constitutes about 51 and 45 million distinct protein sequences obtained from UniProtKB and NCBI respectively. Currently, the database contains annotations for 63 244 and 102 151 complete genomes taken from UniProtKB and NCBI respectively. The current sequence collection and genome update is the biggest so far in the history of SUPERFAMILY updates. In order to the deal with the massive wealth of information, here we introduce a new SUPERFAMILY 2.0 webserver (http://supfam.org). Currently, the webserver mainly focuses on the search, retrieval and display of Superfamily annotation for the entire sequence and genome collection in the database. Oxford University Press 2019-01-08 2018-11-16 /pmc/articles/PMC6324026/ /pubmed/30445555 http://dx.doi.org/10.1093/nar/gky1130 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Database Issue Pandurangan, Arun Prasad Stahlhacke, Jonathan Oates, Matt E Smithers, Ben Gough, Julian The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver
title	The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver
title_full	The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver
title_fullStr	The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver
title_full_unstemmed	The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver
title_short	The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver
title_sort	superfamily 2.0 database: a significant proteome update and a new webserver
topic	Database Issue
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6324026/ https://www.ncbi.nlm.nih.gov/pubmed/30445555 http://dx.doi.org/10.1093/nar/gky1130
work_keys_str_mv	AT panduranganarunprasad thesuperfamily20databaseasignificantproteomeupdateandanewwebserver AT stahlhackejonathan thesuperfamily20databaseasignificantproteomeupdateandanewwebserver AT oatesmatte thesuperfamily20databaseasignificantproteomeupdateandanewwebserver AT smithersben thesuperfamily20databaseasignificantproteomeupdateandanewwebserver AT goughjulian thesuperfamily20databaseasignificantproteomeupdateandanewwebserver AT panduranganarunprasad superfamily20databaseasignificantproteomeupdateandanewwebserver AT stahlhackejonathan superfamily20databaseasignificantproteomeupdateandanewwebserver AT oatesmatte superfamily20databaseasignificantproteomeupdateandanewwebserver AT smithersben superfamily20databaseasignificantproteomeupdateandanewwebserver AT goughjulian superfamily20databaseasignificantproteomeupdateandanewwebserver

The SUPERFAMILY 2.0 database: a significant proteome update and a new webserver

Ejemplares similares