Cargando…

PASS2: an automated database of protein alignments organised as structural superfamilies

BACKGROUND: The functional selection and three-dimensional structural constraints of proteins in nature often relates to the retention of significant sequence similarity between proteins of similar fold and function despite poor sequence identity. Organization of structure-based sequence alignments...

Descripción completa

Detalles Bibliográficos
Autores principales: Bhaduri, Anirban, Pugalenthi, Ganesan, Sowdhamini, Ramanathan
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC407847/
https://www.ncbi.nlm.nih.gov/pubmed/15059245
http://dx.doi.org/10.1186/1471-2105-5-35
_version_ 1782121394313625600
author Bhaduri, Anirban
Pugalenthi, Ganesan
Sowdhamini, Ramanathan
author_facet Bhaduri, Anirban
Pugalenthi, Ganesan
Sowdhamini, Ramanathan
author_sort Bhaduri, Anirban
collection PubMed
description BACKGROUND: The functional selection and three-dimensional structural constraints of proteins in nature often relates to the retention of significant sequence similarity between proteins of similar fold and function despite poor sequence identity. Organization of structure-based sequence alignments for distantly related proteins, provides a map of the conserved and critical regions of the protein universe that is useful for the analysis of folding principles, for the evolutionary unification of protein families and for maximizing the information return from experimental structure determination. The Protein Alignment organised as Structural Superfamily (PASS2) database represents continuously updated, structural alignments for evolutionary related, sequentially distant proteins. DESCRIPTION: An automated and updated version of PASS2 is, in direct correspondence with SCOP 1.63, consisting of sequences having identity below 40% among themselves. Protein domains have been grouped into 628 multi-member superfamilies and 566 single member superfamilies. Structure-based sequence alignments for the superfamilies have been obtained using COMPARER, while initial equivalencies have been derived from a preliminary superposition using LSQMAN or STAMP 4.0. The final sequence alignments have been annotated for structural features using JOY4.0. The database is supplemented with sequence relatives belonging to different genomes, conserved spatially interacting and structural motifs, probabilistic hidden markov models of superfamilies based on the alignments and useful links to other databases. Probabilistic models and sensitive position specific profiles obtained from reliable superfamily alignments aid annotation of remote homologues and are useful tools in structural and functional genomics. PASS2 presents the phylogeny of its members both based on sequence and structural dissimilarities. Clustering of members allows us to understand diversification of the family members. The search engine has been improved for simpler browsing of the database. CONCLUSIONS: The database resolves alignments among the structural domains consisting of evolutionarily diverged set of sequences. Availability of reliable sequence alignments of distantly related proteins despite poor sequence identity and single-member superfamilies permit better sampling of structures in libraries for fold recognition of new sequences and for the understanding of protein structure-function relationships of individual superfamilies. PASS2 is accessible at
format Text
id pubmed-407847
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-4078472004-05-15 PASS2: an automated database of protein alignments organised as structural superfamilies Bhaduri, Anirban Pugalenthi, Ganesan Sowdhamini, Ramanathan BMC Bioinformatics Database BACKGROUND: The functional selection and three-dimensional structural constraints of proteins in nature often relates to the retention of significant sequence similarity between proteins of similar fold and function despite poor sequence identity. Organization of structure-based sequence alignments for distantly related proteins, provides a map of the conserved and critical regions of the protein universe that is useful for the analysis of folding principles, for the evolutionary unification of protein families and for maximizing the information return from experimental structure determination. The Protein Alignment organised as Structural Superfamily (PASS2) database represents continuously updated, structural alignments for evolutionary related, sequentially distant proteins. DESCRIPTION: An automated and updated version of PASS2 is, in direct correspondence with SCOP 1.63, consisting of sequences having identity below 40% among themselves. Protein domains have been grouped into 628 multi-member superfamilies and 566 single member superfamilies. Structure-based sequence alignments for the superfamilies have been obtained using COMPARER, while initial equivalencies have been derived from a preliminary superposition using LSQMAN or STAMP 4.0. The final sequence alignments have been annotated for structural features using JOY4.0. The database is supplemented with sequence relatives belonging to different genomes, conserved spatially interacting and structural motifs, probabilistic hidden markov models of superfamilies based on the alignments and useful links to other databases. Probabilistic models and sensitive position specific profiles obtained from reliable superfamily alignments aid annotation of remote homologues and are useful tools in structural and functional genomics. PASS2 presents the phylogeny of its members both based on sequence and structural dissimilarities. Clustering of members allows us to understand diversification of the family members. The search engine has been improved for simpler browsing of the database. CONCLUSIONS: The database resolves alignments among the structural domains consisting of evolutionarily diverged set of sequences. Availability of reliable sequence alignments of distantly related proteins despite poor sequence identity and single-member superfamilies permit better sampling of structures in libraries for fold recognition of new sequences and for the understanding of protein structure-function relationships of individual superfamilies. PASS2 is accessible at BioMed Central 2004-04-02 /pmc/articles/PMC407847/ /pubmed/15059245 http://dx.doi.org/10.1186/1471-2105-5-35 Text en Copyright © 2004 Bhaduri et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Database
Bhaduri, Anirban
Pugalenthi, Ganesan
Sowdhamini, Ramanathan
PASS2: an automated database of protein alignments organised as structural superfamilies
title PASS2: an automated database of protein alignments organised as structural superfamilies
title_full PASS2: an automated database of protein alignments organised as structural superfamilies
title_fullStr PASS2: an automated database of protein alignments organised as structural superfamilies
title_full_unstemmed PASS2: an automated database of protein alignments organised as structural superfamilies
title_short PASS2: an automated database of protein alignments organised as structural superfamilies
title_sort pass2: an automated database of protein alignments organised as structural superfamilies
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC407847/
https://www.ncbi.nlm.nih.gov/pubmed/15059245
http://dx.doi.org/10.1186/1471-2105-5-35
work_keys_str_mv AT bhadurianirban pass2anautomateddatabaseofproteinalignmentsorganisedasstructuralsuperfamilies
AT pugalenthiganesan pass2anautomateddatabaseofproteinalignmentsorganisedasstructuralsuperfamilies
AT sowdhaminiramanathan pass2anautomateddatabaseofproteinalignmentsorganisedasstructuralsuperfamilies