Cargando…

The SUPERFAMILY 1.75 database in 2014: a doubling of data

We present updates to the SUPERFAMILY 1.75 (http://supfam.org) online resource and protein sequence collection. The hidden Markov model library that provides sequence homology to SCOP structural domains remains unchanged at version 1.75. In the last 4 years SUPERFAMILY has more than doubled its hold...

Descripción completa

Detalles Bibliográficos
Autores principales: Oates, Matt E., Stahlhacke, Jonathan, Vavoulis, Dimitrios V., Smithers, Ben, Rackham, Owen J.L., Sardar, Adam J., Zaucha, Jan, Thurlby, Natalie, Fang, Hai, Gough, Julian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4383889/
https://www.ncbi.nlm.nih.gov/pubmed/25414345
http://dx.doi.org/10.1093/nar/gku1041
_version_ 1782364806498484224
author Oates, Matt E.
Stahlhacke, Jonathan
Vavoulis, Dimitrios V.
Smithers, Ben
Rackham, Owen J.L.
Sardar, Adam J.
Zaucha, Jan
Thurlby, Natalie
Fang, Hai
Gough, Julian
author_facet Oates, Matt E.
Stahlhacke, Jonathan
Vavoulis, Dimitrios V.
Smithers, Ben
Rackham, Owen J.L.
Sardar, Adam J.
Zaucha, Jan
Thurlby, Natalie
Fang, Hai
Gough, Julian
author_sort Oates, Matt E.
collection PubMed
description We present updates to the SUPERFAMILY 1.75 (http://supfam.org) online resource and protein sequence collection. The hidden Markov model library that provides sequence homology to SCOP structural domains remains unchanged at version 1.75. In the last 4 years SUPERFAMILY has more than doubled its holding of curated complete proteomes over all cellular life, from 1400 proteomes reported previously in 2010 up to 3258 at present. Outside of the main sequence collection, SUPERFAMILY continues to provide domain annotation for sequences provided by other resources such as: UniProt, Ensembl, PDB, much of JGI Phytozome and selected subcollections of NCBI RefSeq. Despite this growth in data volume, SUPERFAMILY now provides users with an expanded and daily updated phylogenetic tree of life (sTOL). This tree is built with genomic-scale domain annotation data as before, but constantly updated when new species are introduced to the sequence library. Our Gene Ontology and other functional and phenotypic annotations previously reported have stood up to critical assessment by the function prediction community. We have now introduced these data in an integrated manner online at the level of an individual sequence, and—in the case of whole genomes—with enrichment analysis against a taxonomically defined background.
format Online
Article
Text
id pubmed-4383889
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-43838892015-04-08 The SUPERFAMILY 1.75 database in 2014: a doubling of data Oates, Matt E. Stahlhacke, Jonathan Vavoulis, Dimitrios V. Smithers, Ben Rackham, Owen J.L. Sardar, Adam J. Zaucha, Jan Thurlby, Natalie Fang, Hai Gough, Julian Nucleic Acids Res Database Issue We present updates to the SUPERFAMILY 1.75 (http://supfam.org) online resource and protein sequence collection. The hidden Markov model library that provides sequence homology to SCOP structural domains remains unchanged at version 1.75. In the last 4 years SUPERFAMILY has more than doubled its holding of curated complete proteomes over all cellular life, from 1400 proteomes reported previously in 2010 up to 3258 at present. Outside of the main sequence collection, SUPERFAMILY continues to provide domain annotation for sequences provided by other resources such as: UniProt, Ensembl, PDB, much of JGI Phytozome and selected subcollections of NCBI RefSeq. Despite this growth in data volume, SUPERFAMILY now provides users with an expanded and daily updated phylogenetic tree of life (sTOL). This tree is built with genomic-scale domain annotation data as before, but constantly updated when new species are introduced to the sequence library. Our Gene Ontology and other functional and phenotypic annotations previously reported have stood up to critical assessment by the function prediction community. We have now introduced these data in an integrated manner online at the level of an individual sequence, and—in the case of whole genomes—with enrichment analysis against a taxonomically defined background. Oxford University Press 2014-11-20 2015-01-28 /pmc/articles/PMC4383889/ /pubmed/25414345 http://dx.doi.org/10.1093/nar/gku1041 Text en © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Issue
Oates, Matt E.
Stahlhacke, Jonathan
Vavoulis, Dimitrios V.
Smithers, Ben
Rackham, Owen J.L.
Sardar, Adam J.
Zaucha, Jan
Thurlby, Natalie
Fang, Hai
Gough, Julian
The SUPERFAMILY 1.75 database in 2014: a doubling of data
title The SUPERFAMILY 1.75 database in 2014: a doubling of data
title_full The SUPERFAMILY 1.75 database in 2014: a doubling of data
title_fullStr The SUPERFAMILY 1.75 database in 2014: a doubling of data
title_full_unstemmed The SUPERFAMILY 1.75 database in 2014: a doubling of data
title_short The SUPERFAMILY 1.75 database in 2014: a doubling of data
title_sort superfamily 1.75 database in 2014: a doubling of data
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4383889/
https://www.ncbi.nlm.nih.gov/pubmed/25414345
http://dx.doi.org/10.1093/nar/gku1041
work_keys_str_mv AT oatesmatte thesuperfamily175databasein2014adoublingofdata
AT stahlhackejonathan thesuperfamily175databasein2014adoublingofdata
AT vavoulisdimitriosv thesuperfamily175databasein2014adoublingofdata
AT smithersben thesuperfamily175databasein2014adoublingofdata
AT rackhamowenjl thesuperfamily175databasein2014adoublingofdata
AT sardaradamj thesuperfamily175databasein2014adoublingofdata
AT zauchajan thesuperfamily175databasein2014adoublingofdata
AT thurlbynatalie thesuperfamily175databasein2014adoublingofdata
AT fanghai thesuperfamily175databasein2014adoublingofdata
AT goughjulian thesuperfamily175databasein2014adoublingofdata
AT oatesmatte superfamily175databasein2014adoublingofdata
AT stahlhackejonathan superfamily175databasein2014adoublingofdata
AT vavoulisdimitriosv superfamily175databasein2014adoublingofdata
AT smithersben superfamily175databasein2014adoublingofdata
AT rackhamowenjl superfamily175databasein2014adoublingofdata
AT sardaradamj superfamily175databasein2014adoublingofdata
AT zauchajan superfamily175databasein2014adoublingofdata
AT thurlbynatalie superfamily175databasein2014adoublingofdata
AT fanghai superfamily175databasein2014adoublingofdata
AT goughjulian superfamily175databasein2014adoublingofdata