Cargando…

The SUPERFAMILY database in 2007: families and functions

The SUPERFAMILY database provides protein domain assignments, at the SCOP ‘superfamily’ level, for the predicted protein sequences in over 400 completed genomes. A superfamily groups together domains of different families which have a common evolutionary ancestor based on structural, functional and...

Descripción completa

Detalles Bibliográficos
Autores principales: Wilson, Derek, Madera, Martin, Vogel, Christine, Chothia, Cyrus, Gough, Julian
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1669749/
https://www.ncbi.nlm.nih.gov/pubmed/17098927
http://dx.doi.org/10.1093/nar/gkl910
_version_ 1782131122312839168
author Wilson, Derek
Madera, Martin
Vogel, Christine
Chothia, Cyrus
Gough, Julian
author_facet Wilson, Derek
Madera, Martin
Vogel, Christine
Chothia, Cyrus
Gough, Julian
author_sort Wilson, Derek
collection PubMed
description The SUPERFAMILY database provides protein domain assignments, at the SCOP ‘superfamily’ level, for the predicted protein sequences in over 400 completed genomes. A superfamily groups together domains of different families which have a common evolutionary ancestor based on structural, functional and sequence data. SUPERFAMILY domain assignments are generated using an expert curated set of profile hidden Markov models. All models and structural assignments are available for browsing and download from . The web interface includes services such as domain architectures and alignment details for all protein assignments, searchable domain combinations, domain occurrence network visualization, detection of over- or under-represented superfamilies for a given genome by comparison with other genomes, assignment of manually submitted sequences and keyword searches. In this update we describe the SUPERFAMILY database and outline two major developments: (i) incorporation of family level assignments and (ii) a superfamily-level functional annotation. The SUPERFAMILY database can be used for general protein evolution and superfamily-specific studies, genomic annotation, and structural genomics target suggestion and assessment.
format Text
id pubmed-1669749
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-16697492007-02-22 The SUPERFAMILY database in 2007: families and functions Wilson, Derek Madera, Martin Vogel, Christine Chothia, Cyrus Gough, Julian Nucleic Acids Res Articles The SUPERFAMILY database provides protein domain assignments, at the SCOP ‘superfamily’ level, for the predicted protein sequences in over 400 completed genomes. A superfamily groups together domains of different families which have a common evolutionary ancestor based on structural, functional and sequence data. SUPERFAMILY domain assignments are generated using an expert curated set of profile hidden Markov models. All models and structural assignments are available for browsing and download from . The web interface includes services such as domain architectures and alignment details for all protein assignments, searchable domain combinations, domain occurrence network visualization, detection of over- or under-represented superfamilies for a given genome by comparison with other genomes, assignment of manually submitted sequences and keyword searches. In this update we describe the SUPERFAMILY database and outline two major developments: (i) incorporation of family level assignments and (ii) a superfamily-level functional annotation. The SUPERFAMILY database can be used for general protein evolution and superfamily-specific studies, genomic annotation, and structural genomics target suggestion and assessment. Oxford University Press 2007-01 2006-11-10 /pmc/articles/PMC1669749/ /pubmed/17098927 http://dx.doi.org/10.1093/nar/gkl910 Text en © 2006 The Author(s) This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Wilson, Derek
Madera, Martin
Vogel, Christine
Chothia, Cyrus
Gough, Julian
The SUPERFAMILY database in 2007: families and functions
title The SUPERFAMILY database in 2007: families and functions
title_full The SUPERFAMILY database in 2007: families and functions
title_fullStr The SUPERFAMILY database in 2007: families and functions
title_full_unstemmed The SUPERFAMILY database in 2007: families and functions
title_short The SUPERFAMILY database in 2007: families and functions
title_sort superfamily database in 2007: families and functions
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1669749/
https://www.ncbi.nlm.nih.gov/pubmed/17098927
http://dx.doi.org/10.1093/nar/gkl910
work_keys_str_mv AT wilsonderek thesuperfamilydatabasein2007familiesandfunctions
AT maderamartin thesuperfamilydatabasein2007familiesandfunctions
AT vogelchristine thesuperfamilydatabasein2007familiesandfunctions
AT chothiacyrus thesuperfamilydatabasein2007familiesandfunctions
AT goughjulian thesuperfamilydatabasein2007familiesandfunctions
AT wilsonderek superfamilydatabasein2007familiesandfunctions
AT maderamartin superfamilydatabasein2007familiesandfunctions
AT vogelchristine superfamilydatabasein2007familiesandfunctions
AT chothiacyrus superfamilydatabasein2007familiesandfunctions
AT goughjulian superfamilydatabasein2007familiesandfunctions