Cargando…

CUSP: an algorithm to distinguish structurally conserved and unconserved regions in protein domain alignments and its application in the study of large length variations

BACKGROUND: Distantly related proteins adopt and retain similar structural scaffolds despite length variations that could be as much as two-fold in some protein superfamilies. In this paper, we describe an analysis of indel regions that accommodate length variations amongst related proteins. We have...

Descripción completa

Detalles Bibliográficos
Autores principales: Sandhya, Sankaran, Pankaj, Barah, Govind, Madabosse Kande, Offmann, Bernard, Srinivasan, Narayanaswamy, Sowdhamini, Ramanathan
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2423364/
https://www.ncbi.nlm.nih.gov/pubmed/18513436
http://dx.doi.org/10.1186/1472-6807-8-28
_version_ 1782156092283813888
author Sandhya, Sankaran
Pankaj, Barah
Govind, Madabosse Kande
Offmann, Bernard
Srinivasan, Narayanaswamy
Sowdhamini, Ramanathan
author_facet Sandhya, Sankaran
Pankaj, Barah
Govind, Madabosse Kande
Offmann, Bernard
Srinivasan, Narayanaswamy
Sowdhamini, Ramanathan
author_sort Sandhya, Sankaran
collection PubMed
description BACKGROUND: Distantly related proteins adopt and retain similar structural scaffolds despite length variations that could be as much as two-fold in some protein superfamilies. In this paper, we describe an analysis of indel regions that accommodate length variations amongst related proteins. We have developed an algorithm CUSP, to examine multi-membered PASS2 superfamily alignments to identify indel regions in an automated manner. Further, we have used the method to characterize the length, structural type and biochemical features of indels in related protein domains. RESULTS: CUSP, examines protein domain structural alignments to distinguish regions of conserved structure common to related proteins from structurally unconserved regions that vary in length and type of structure. On a non-redundant dataset of 353 domain superfamily alignments from PASS2, we find that 'length- deviant' protein superfamilies show > 30% length variation from their average domain length. 60% of additional lengths that occur in indels are short-length structures (< 5 residues) while 6% of indels are > 15 residues in length. Structural types in indels also show class-specific trends. CONCLUSION: The extent of length variation varies across different superfamilies and indels show class-specific trends for preferred lengths and structural types. Such indels of different lengths even within a single protein domain superfamily could have structural and functional consequences that drive their selection, underlying their importance in similarity detection and computational modelling. The availability of systematic algorithms, like CUSP, should enable decision making in a domain superfamily-specific manner.
format Text
id pubmed-2423364
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-24233642008-06-10 CUSP: an algorithm to distinguish structurally conserved and unconserved regions in protein domain alignments and its application in the study of large length variations Sandhya, Sankaran Pankaj, Barah Govind, Madabosse Kande Offmann, Bernard Srinivasan, Narayanaswamy Sowdhamini, Ramanathan BMC Struct Biol Research Article BACKGROUND: Distantly related proteins adopt and retain similar structural scaffolds despite length variations that could be as much as two-fold in some protein superfamilies. In this paper, we describe an analysis of indel regions that accommodate length variations amongst related proteins. We have developed an algorithm CUSP, to examine multi-membered PASS2 superfamily alignments to identify indel regions in an automated manner. Further, we have used the method to characterize the length, structural type and biochemical features of indels in related protein domains. RESULTS: CUSP, examines protein domain structural alignments to distinguish regions of conserved structure common to related proteins from structurally unconserved regions that vary in length and type of structure. On a non-redundant dataset of 353 domain superfamily alignments from PASS2, we find that 'length- deviant' protein superfamilies show > 30% length variation from their average domain length. 60% of additional lengths that occur in indels are short-length structures (< 5 residues) while 6% of indels are > 15 residues in length. Structural types in indels also show class-specific trends. CONCLUSION: The extent of length variation varies across different superfamilies and indels show class-specific trends for preferred lengths and structural types. Such indels of different lengths even within a single protein domain superfamily could have structural and functional consequences that drive their selection, underlying their importance in similarity detection and computational modelling. The availability of systematic algorithms, like CUSP, should enable decision making in a domain superfamily-specific manner. BioMed Central 2008-05-31 /pmc/articles/PMC2423364/ /pubmed/18513436 http://dx.doi.org/10.1186/1472-6807-8-28 Text en Copyright © 2008 Sandhya et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Sandhya, Sankaran
Pankaj, Barah
Govind, Madabosse Kande
Offmann, Bernard
Srinivasan, Narayanaswamy
Sowdhamini, Ramanathan
CUSP: an algorithm to distinguish structurally conserved and unconserved regions in protein domain alignments and its application in the study of large length variations
title CUSP: an algorithm to distinguish structurally conserved and unconserved regions in protein domain alignments and its application in the study of large length variations
title_full CUSP: an algorithm to distinguish structurally conserved and unconserved regions in protein domain alignments and its application in the study of large length variations
title_fullStr CUSP: an algorithm to distinguish structurally conserved and unconserved regions in protein domain alignments and its application in the study of large length variations
title_full_unstemmed CUSP: an algorithm to distinguish structurally conserved and unconserved regions in protein domain alignments and its application in the study of large length variations
title_short CUSP: an algorithm to distinguish structurally conserved and unconserved regions in protein domain alignments and its application in the study of large length variations
title_sort cusp: an algorithm to distinguish structurally conserved and unconserved regions in protein domain alignments and its application in the study of large length variations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2423364/
https://www.ncbi.nlm.nih.gov/pubmed/18513436
http://dx.doi.org/10.1186/1472-6807-8-28
work_keys_str_mv AT sandhyasankaran cuspanalgorithmtodistinguishstructurallyconservedandunconservedregionsinproteindomainalignmentsanditsapplicationinthestudyoflargelengthvariations
AT pankajbarah cuspanalgorithmtodistinguishstructurallyconservedandunconservedregionsinproteindomainalignmentsanditsapplicationinthestudyoflargelengthvariations
AT govindmadabossekande cuspanalgorithmtodistinguishstructurallyconservedandunconservedregionsinproteindomainalignmentsanditsapplicationinthestudyoflargelengthvariations
AT offmannbernard cuspanalgorithmtodistinguishstructurallyconservedandunconservedregionsinproteindomainalignmentsanditsapplicationinthestudyoflargelengthvariations
AT srinivasannarayanaswamy cuspanalgorithmtodistinguishstructurallyconservedandunconservedregionsinproteindomainalignmentsanditsapplicationinthestudyoflargelengthvariations
AT sowdhaminiramanathan cuspanalgorithmtodistinguishstructurallyconservedandunconservedregionsinproteindomainalignmentsanditsapplicationinthestudyoflargelengthvariations