Cargando…

Sequence and structural features of carbohydrate binding in proteins and assessment of predictability using a neural network

BACKGROUND: Protein-Carbohydrate interactions are crucial in many biological processes with implications to drug targeting and gene expression. Nature of protein-carbohydrate interactions may be studied at individual residue level by analyzing local sequence and structure environments in binding reg...

Descripción completa

Detalles Bibliográficos
Autores principales: Malik, Adeel, Ahmad, Shandar
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1780050/
https://www.ncbi.nlm.nih.gov/pubmed/17201922
http://dx.doi.org/10.1186/1472-6807-7-1
_version_ 1782131841977810944
author Malik, Adeel
Ahmad, Shandar
author_facet Malik, Adeel
Ahmad, Shandar
author_sort Malik, Adeel
collection PubMed
description BACKGROUND: Protein-Carbohydrate interactions are crucial in many biological processes with implications to drug targeting and gene expression. Nature of protein-carbohydrate interactions may be studied at individual residue level by analyzing local sequence and structure environments in binding regions in comparison to non-binding regions, which provide an inherent control for such analyses. With an ultimate aim of predicting binding sites from sequence and structure, overall statistics of binding regions needs to be compiled. Sequence-based predictions of binding sites have been successfully applied to DNA-binding proteins in our earlier works. We aim to apply similar analysis to carbohydrate binding proteins. However, due to a relatively much smaller region of proteins taking part in such interactions, the methodology and results are significantly different. A comparison of protein-carbohydrate complexes has also been made with other protein-ligand complexes. RESULTS: We have compiled statistics of amino acid compositions in binding versus non-binding regions- general as well as in each different secondary structure conformation. Binding propensities of each of the 20 residue types and their structure features such as solvent accessibility, packing density and secondary structure have been calculated to assess their predisposition to carbohydrate interactions. Finally, evolutionary profiles of amino acid sequences have been used to predict binding sites using a neural network. Another set of neural networks was trained using information from single sequences and the prediction performance from the evolutionary profiles and single sequences were compared. Best of the neural network based prediction could achieve an 87% sensitivity of prediction at 23% specificity for all carbohydrate-binding sites, using evolutionary information. Single sequences gave 68% sensitivity and 55% specificity for the same data set. Sensitivity and specificity for a limited galactose binding data set were obtained as 63% and 79% respectively for evolutionary information and 62% and 68% sensitivity and specificity for single sequences. Propensity and other sequence and structural features of carbohydrate binding sites have also been compared with our similar extensive studies on DNA-binding proteins and also with protein-ligand complexes. CONCLUSION: Carbohydrates typically show a preference to bind aromatic residues and most prominently tryptophan. Higher exposed surface area of binding sites indicates a role of hydrophobic interactions. Neural networks give a moderate success of prediction, which is expected to improve when structures of more protein-carbohydrate complexes become available in future.
format Text
id pubmed-1780050
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-17800502007-01-30 Sequence and structural features of carbohydrate binding in proteins and assessment of predictability using a neural network Malik, Adeel Ahmad, Shandar BMC Struct Biol Research Article BACKGROUND: Protein-Carbohydrate interactions are crucial in many biological processes with implications to drug targeting and gene expression. Nature of protein-carbohydrate interactions may be studied at individual residue level by analyzing local sequence and structure environments in binding regions in comparison to non-binding regions, which provide an inherent control for such analyses. With an ultimate aim of predicting binding sites from sequence and structure, overall statistics of binding regions needs to be compiled. Sequence-based predictions of binding sites have been successfully applied to DNA-binding proteins in our earlier works. We aim to apply similar analysis to carbohydrate binding proteins. However, due to a relatively much smaller region of proteins taking part in such interactions, the methodology and results are significantly different. A comparison of protein-carbohydrate complexes has also been made with other protein-ligand complexes. RESULTS: We have compiled statistics of amino acid compositions in binding versus non-binding regions- general as well as in each different secondary structure conformation. Binding propensities of each of the 20 residue types and their structure features such as solvent accessibility, packing density and secondary structure have been calculated to assess their predisposition to carbohydrate interactions. Finally, evolutionary profiles of amino acid sequences have been used to predict binding sites using a neural network. Another set of neural networks was trained using information from single sequences and the prediction performance from the evolutionary profiles and single sequences were compared. Best of the neural network based prediction could achieve an 87% sensitivity of prediction at 23% specificity for all carbohydrate-binding sites, using evolutionary information. Single sequences gave 68% sensitivity and 55% specificity for the same data set. Sensitivity and specificity for a limited galactose binding data set were obtained as 63% and 79% respectively for evolutionary information and 62% and 68% sensitivity and specificity for single sequences. Propensity and other sequence and structural features of carbohydrate binding sites have also been compared with our similar extensive studies on DNA-binding proteins and also with protein-ligand complexes. CONCLUSION: Carbohydrates typically show a preference to bind aromatic residues and most prominently tryptophan. Higher exposed surface area of binding sites indicates a role of hydrophobic interactions. Neural networks give a moderate success of prediction, which is expected to improve when structures of more protein-carbohydrate complexes become available in future. BioMed Central 2007-01-03 /pmc/articles/PMC1780050/ /pubmed/17201922 http://dx.doi.org/10.1186/1472-6807-7-1 Text en Copyright © 2007 Malik and Ahmad; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Malik, Adeel
Ahmad, Shandar
Sequence and structural features of carbohydrate binding in proteins and assessment of predictability using a neural network
title Sequence and structural features of carbohydrate binding in proteins and assessment of predictability using a neural network
title_full Sequence and structural features of carbohydrate binding in proteins and assessment of predictability using a neural network
title_fullStr Sequence and structural features of carbohydrate binding in proteins and assessment of predictability using a neural network
title_full_unstemmed Sequence and structural features of carbohydrate binding in proteins and assessment of predictability using a neural network
title_short Sequence and structural features of carbohydrate binding in proteins and assessment of predictability using a neural network
title_sort sequence and structural features of carbohydrate binding in proteins and assessment of predictability using a neural network
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1780050/
https://www.ncbi.nlm.nih.gov/pubmed/17201922
http://dx.doi.org/10.1186/1472-6807-7-1
work_keys_str_mv AT malikadeel sequenceandstructuralfeaturesofcarbohydratebindinginproteinsandassessmentofpredictabilityusinganeuralnetwork
AT ahmadshandar sequenceandstructuralfeaturesofcarbohydratebindinginproteinsandassessmentofpredictabilityusinganeuralnetwork