Cargando…

SAMbinder: A Web Server for Predicting S-Adenosyl-L-Methionine Binding Residues of a Protein From Its Amino Acid Sequence

MOTIVATION: S-adenosyl-L-methionine (SAM) is an essential cofactor present in the biological system and plays a key role in many diseases. There is a need to develop a method for predicting SAM binding sites in a protein for designing drugs against SAM associated disease. To the best of our knowledg...

Descripción completa

Detalles Bibliográficos
Autores principales: Agrawal, Piyush, Mishra, Gaurav, Raghava, Gajendra P. S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7002541/
https://www.ncbi.nlm.nih.gov/pubmed/32082172
http://dx.doi.org/10.3389/fphar.2019.01690
_version_ 1783494391577444352
author Agrawal, Piyush
Mishra, Gaurav
Raghava, Gajendra P. S.
author_facet Agrawal, Piyush
Mishra, Gaurav
Raghava, Gajendra P. S.
author_sort Agrawal, Piyush
collection PubMed
description MOTIVATION: S-adenosyl-L-methionine (SAM) is an essential cofactor present in the biological system and plays a key role in many diseases. There is a need to develop a method for predicting SAM binding sites in a protein for designing drugs against SAM associated disease. To the best of our knowledge, there is no method that can predict the binding site of SAM in a given protein sequence. RESULT: This manuscript describes a method SAMbinder, developed for predicting SAM interacting residue in a protein from its primary sequence. All models were trained, tested, and evaluated on 145 SAM binding protein chains where no two chains have more than 40% sequence similarity. Firstly, models were developed using different machine learning techniques on a balanced data set containing 2,188 SAM interacting and an equal number of non-interacting residues. Our random forest based model developed using binary profile feature got maximum Matthews Correlation Coefficient (MCC) 0.42 with area under receiver operating characteristics (AUROC) 0.79 on the validation data set. The performance of our models improved significantly from MCC 0.42 to 0.61, when evolutionary information in the form of the position-specific scoring matrix (PSSM) profile is used as a feature. We also developed models on a realistic data set containing 2,188 SAM interacting and 40,029 non-interacting residues and got maximum MCC 0.61 with AUROC of 0.89. In order to evaluate the performance of our models, we used internal as well as external cross-validation technique. AVAILABILITY AND IMPLEMENTATION: https://webs.iiitd.edu.in/raghava/sambinder/.
format Online
Article
Text
id pubmed-7002541
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-70025412020-02-20 SAMbinder: A Web Server for Predicting S-Adenosyl-L-Methionine Binding Residues of a Protein From Its Amino Acid Sequence Agrawal, Piyush Mishra, Gaurav Raghava, Gajendra P. S. Front Pharmacol Pharmacology MOTIVATION: S-adenosyl-L-methionine (SAM) is an essential cofactor present in the biological system and plays a key role in many diseases. There is a need to develop a method for predicting SAM binding sites in a protein for designing drugs against SAM associated disease. To the best of our knowledge, there is no method that can predict the binding site of SAM in a given protein sequence. RESULT: This manuscript describes a method SAMbinder, developed for predicting SAM interacting residue in a protein from its primary sequence. All models were trained, tested, and evaluated on 145 SAM binding protein chains where no two chains have more than 40% sequence similarity. Firstly, models were developed using different machine learning techniques on a balanced data set containing 2,188 SAM interacting and an equal number of non-interacting residues. Our random forest based model developed using binary profile feature got maximum Matthews Correlation Coefficient (MCC) 0.42 with area under receiver operating characteristics (AUROC) 0.79 on the validation data set. The performance of our models improved significantly from MCC 0.42 to 0.61, when evolutionary information in the form of the position-specific scoring matrix (PSSM) profile is used as a feature. We also developed models on a realistic data set containing 2,188 SAM interacting and 40,029 non-interacting residues and got maximum MCC 0.61 with AUROC of 0.89. In order to evaluate the performance of our models, we used internal as well as external cross-validation technique. AVAILABILITY AND IMPLEMENTATION: https://webs.iiitd.edu.in/raghava/sambinder/. Frontiers Media S.A. 2020-01-30 /pmc/articles/PMC7002541/ /pubmed/32082172 http://dx.doi.org/10.3389/fphar.2019.01690 Text en Copyright © 2020 Agrawal, Mishra and Raghava http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Pharmacology
Agrawal, Piyush
Mishra, Gaurav
Raghava, Gajendra P. S.
SAMbinder: A Web Server for Predicting S-Adenosyl-L-Methionine Binding Residues of a Protein From Its Amino Acid Sequence
title SAMbinder: A Web Server for Predicting S-Adenosyl-L-Methionine Binding Residues of a Protein From Its Amino Acid Sequence
title_full SAMbinder: A Web Server for Predicting S-Adenosyl-L-Methionine Binding Residues of a Protein From Its Amino Acid Sequence
title_fullStr SAMbinder: A Web Server for Predicting S-Adenosyl-L-Methionine Binding Residues of a Protein From Its Amino Acid Sequence
title_full_unstemmed SAMbinder: A Web Server for Predicting S-Adenosyl-L-Methionine Binding Residues of a Protein From Its Amino Acid Sequence
title_short SAMbinder: A Web Server for Predicting S-Adenosyl-L-Methionine Binding Residues of a Protein From Its Amino Acid Sequence
title_sort sambinder: a web server for predicting s-adenosyl-l-methionine binding residues of a protein from its amino acid sequence
topic Pharmacology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7002541/
https://www.ncbi.nlm.nih.gov/pubmed/32082172
http://dx.doi.org/10.3389/fphar.2019.01690
work_keys_str_mv AT agrawalpiyush sambinderawebserverforpredictingsadenosyllmethioninebindingresiduesofaproteinfromitsaminoacidsequence
AT mishragaurav sambinderawebserverforpredictingsadenosyllmethioninebindingresiduesofaproteinfromitsaminoacidsequence
AT raghavagajendraps sambinderawebserverforpredictingsadenosyllmethioninebindingresiduesofaproteinfromitsaminoacidsequence