Cargando…

Detailed protein sequence alignment based on Spectral Similarity Score (SSS)

BACKGROUND: The chemical property and biological function of a protein is a direct consequence of its primary structure. Several algorithms have been developed which determine alignment and similarity of primary protein sequences. However, character based similarity cannot provide insight into the s...

Descripción completa

Detalles Bibliográficos
Autores principales: Gupta, Kshitiz, Thomas, Dina, Vidya, SV, Venkatesh, KV, Ramakumar, S
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1131888/
https://www.ncbi.nlm.nih.gov/pubmed/15850477
http://dx.doi.org/10.1186/1471-2105-6-105
_version_ 1782123951467528192
author Gupta, Kshitiz
Thomas, Dina
Vidya, SV
Venkatesh, KV
Ramakumar, S
author_facet Gupta, Kshitiz
Thomas, Dina
Vidya, SV
Venkatesh, KV
Ramakumar, S
author_sort Gupta, Kshitiz
collection PubMed
description BACKGROUND: The chemical property and biological function of a protein is a direct consequence of its primary structure. Several algorithms have been developed which determine alignment and similarity of primary protein sequences. However, character based similarity cannot provide insight into the structural aspects of a protein. We present a method based on spectral similarity to compare subsequences of amino acids that behave similarly but are not aligned well by considering amino acids as mere characters. This approach finds a similarity score between sequences based on any given attribute, like hydrophobicity of amino acids, on the basis of spectral information after partial conversion to the frequency domain. RESULTS: Distance matrices of various branches of the human kinome, that is the full complement of human kinases, were developed that matched the phylogenetic tree of the human kinome establishing the efficacy of the global alignment of the algorithm. PKCd and PKCe kinases share close biological properties and structural similarities but do not give high scores with character based alignments. Detailed comparison established close similarities between subsequences that do not have any significant character identity. We compared their known 3D structures to establish that the algorithm is able to pick subsequences that are not considered similar by character based matching algorithms but share structural similarities. Similarly many subsequences with low character identity were picked between xyna-theau and xyna-clotm F/10 xylanases. Comparison of 3D structures of the subsequences confirmed the claim of similarity in structure. CONCLUSION: An algorithm is developed which is inspired by successful application of spectral similarity applied to music sequences. The method captures subsequences that do not align by traditional character based alignment tools but give rise to similar secondary and tertiary structures. The Spectral Similarity Score (SSS) is an extension to the conventional similarity methods and results indicate that it holds a strong potential for analysis of various biological sequences and structural variations in proteins.
format Text
id pubmed-1131888
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-11318882005-05-20 Detailed protein sequence alignment based on Spectral Similarity Score (SSS) Gupta, Kshitiz Thomas, Dina Vidya, SV Venkatesh, KV Ramakumar, S BMC Bioinformatics Research Article BACKGROUND: The chemical property and biological function of a protein is a direct consequence of its primary structure. Several algorithms have been developed which determine alignment and similarity of primary protein sequences. However, character based similarity cannot provide insight into the structural aspects of a protein. We present a method based on spectral similarity to compare subsequences of amino acids that behave similarly but are not aligned well by considering amino acids as mere characters. This approach finds a similarity score between sequences based on any given attribute, like hydrophobicity of amino acids, on the basis of spectral information after partial conversion to the frequency domain. RESULTS: Distance matrices of various branches of the human kinome, that is the full complement of human kinases, were developed that matched the phylogenetic tree of the human kinome establishing the efficacy of the global alignment of the algorithm. PKCd and PKCe kinases share close biological properties and structural similarities but do not give high scores with character based alignments. Detailed comparison established close similarities between subsequences that do not have any significant character identity. We compared their known 3D structures to establish that the algorithm is able to pick subsequences that are not considered similar by character based matching algorithms but share structural similarities. Similarly many subsequences with low character identity were picked between xyna-theau and xyna-clotm F/10 xylanases. Comparison of 3D structures of the subsequences confirmed the claim of similarity in structure. CONCLUSION: An algorithm is developed which is inspired by successful application of spectral similarity applied to music sequences. The method captures subsequences that do not align by traditional character based alignment tools but give rise to similar secondary and tertiary structures. The Spectral Similarity Score (SSS) is an extension to the conventional similarity methods and results indicate that it holds a strong potential for analysis of various biological sequences and structural variations in proteins. BioMed Central 2005-04-23 /pmc/articles/PMC1131888/ /pubmed/15850477 http://dx.doi.org/10.1186/1471-2105-6-105 Text en Copyright © 2005 Gupta et al; licensee BioMed Central Ltd.
spellingShingle Research Article
Gupta, Kshitiz
Thomas, Dina
Vidya, SV
Venkatesh, KV
Ramakumar, S
Detailed protein sequence alignment based on Spectral Similarity Score (SSS)
title Detailed protein sequence alignment based on Spectral Similarity Score (SSS)
title_full Detailed protein sequence alignment based on Spectral Similarity Score (SSS)
title_fullStr Detailed protein sequence alignment based on Spectral Similarity Score (SSS)
title_full_unstemmed Detailed protein sequence alignment based on Spectral Similarity Score (SSS)
title_short Detailed protein sequence alignment based on Spectral Similarity Score (SSS)
title_sort detailed protein sequence alignment based on spectral similarity score (sss)
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1131888/
https://www.ncbi.nlm.nih.gov/pubmed/15850477
http://dx.doi.org/10.1186/1471-2105-6-105
work_keys_str_mv AT guptakshitiz detailedproteinsequencealignmentbasedonspectralsimilarityscoresss
AT thomasdina detailedproteinsequencealignmentbasedonspectralsimilarityscoresss
AT vidyasv detailedproteinsequencealignmentbasedonspectralsimilarityscoresss
AT venkateshkv detailedproteinsequencealignmentbasedonspectralsimilarityscoresss
AT ramakumars detailedproteinsequencealignmentbasedonspectralsimilarityscoresss