Cargando…
Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information
Locating ligand binding sites and finding the functionally important residues from protein sequences as well as structures became one of the challenges in understanding their function. Hence a Naïve Bayes classifier has been trained to predict whether a given amino acid residue in membrane protein s...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4329842/ https://www.ncbi.nlm.nih.gov/pubmed/25802517 http://dx.doi.org/10.1155/2015/843030 |
_version_ | 1782357504480509952 |
---|---|
author | Suresh, M. Xavier Gromiha, M. Michael Suwa, Makiko |
author_facet | Suresh, M. Xavier Gromiha, M. Michael Suwa, Makiko |
author_sort | Suresh, M. Xavier |
collection | PubMed |
description | Locating ligand binding sites and finding the functionally important residues from protein sequences as well as structures became one of the challenges in understanding their function. Hence a Naïve Bayes classifier has been trained to predict whether a given amino acid residue in membrane protein sequence is a ligand binding residue or not using only sequence based information. The input to the classifier consists of the features of the target residue and two sequence neighbors on each side of the target residue. The classifier is trained and evaluated on a nonredundant set of 42 sequences (chains with at least one transmembrane domain) from 31 alpha-helical membrane proteins. The classifier achieves an overall accuracy of 70.7% with 72.5% specificity and 61.1% sensitivity in identifying ligand binding residues from sequence. The classifier performs better when the sequence is encoded by psi-blast generated PSSM profiles. Assessment of the predictions in the context of three-dimensional structures of proteins reveals the effectiveness of this method in identifying ligand binding sites from sequence information. In 83.3% (35 out of 42) of the proteins, the classifier identifies the ligand binding sites by correctly recognizing more than half of the binding residues. This will be useful to protein engineers in exploiting potential residues for functional assessment. |
format | Online Article Text |
id | pubmed-4329842 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-43298422015-03-23 Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information Suresh, M. Xavier Gromiha, M. Michael Suwa, Makiko Adv Bioinformatics Research Article Locating ligand binding sites and finding the functionally important residues from protein sequences as well as structures became one of the challenges in understanding their function. Hence a Naïve Bayes classifier has been trained to predict whether a given amino acid residue in membrane protein sequence is a ligand binding residue or not using only sequence based information. The input to the classifier consists of the features of the target residue and two sequence neighbors on each side of the target residue. The classifier is trained and evaluated on a nonredundant set of 42 sequences (chains with at least one transmembrane domain) from 31 alpha-helical membrane proteins. The classifier achieves an overall accuracy of 70.7% with 72.5% specificity and 61.1% sensitivity in identifying ligand binding residues from sequence. The classifier performs better when the sequence is encoded by psi-blast generated PSSM profiles. Assessment of the predictions in the context of three-dimensional structures of proteins reveals the effectiveness of this method in identifying ligand binding sites from sequence information. In 83.3% (35 out of 42) of the proteins, the classifier identifies the ligand binding sites by correctly recognizing more than half of the binding residues. This will be useful to protein engineers in exploiting potential residues for functional assessment. Hindawi Publishing Corporation 2015 2015-01-31 /pmc/articles/PMC4329842/ /pubmed/25802517 http://dx.doi.org/10.1155/2015/843030 Text en Copyright © 2015 M. Xavier Suresh et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Suresh, M. Xavier Gromiha, M. Michael Suwa, Makiko Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information |
title | Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information |
title_full | Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information |
title_fullStr | Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information |
title_full_unstemmed | Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information |
title_short | Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information |
title_sort | development of a machine learning method to predict membrane protein-ligand binding residues using basic sequence information |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4329842/ https://www.ncbi.nlm.nih.gov/pubmed/25802517 http://dx.doi.org/10.1155/2015/843030 |
work_keys_str_mv | AT sureshmxavier developmentofamachinelearningmethodtopredictmembraneproteinligandbindingresiduesusingbasicsequenceinformation AT gromihammichael developmentofamachinelearningmethodtopredictmembraneproteinligandbindingresiduesusingbasicsequenceinformation AT suwamakiko developmentofamachinelearningmethodtopredictmembraneproteinligandbindingresiduesusingbasicsequenceinformation |