Cargando…

Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information

Locating ligand binding sites and finding the functionally important residues from protein sequences as well as structures became one of the challenges in understanding their function. Hence a Naïve Bayes classifier has been trained to predict whether a given amino acid residue in membrane protein s...

Descripción completa

Detalles Bibliográficos
Autores principales: Suresh, M. Xavier, Gromiha, M. Michael, Suwa, Makiko
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4329842/
https://www.ncbi.nlm.nih.gov/pubmed/25802517
http://dx.doi.org/10.1155/2015/843030
_version_ 1782357504480509952
author Suresh, M. Xavier
Gromiha, M. Michael
Suwa, Makiko
author_facet Suresh, M. Xavier
Gromiha, M. Michael
Suwa, Makiko
author_sort Suresh, M. Xavier
collection PubMed
description Locating ligand binding sites and finding the functionally important residues from protein sequences as well as structures became one of the challenges in understanding their function. Hence a Naïve Bayes classifier has been trained to predict whether a given amino acid residue in membrane protein sequence is a ligand binding residue or not using only sequence based information. The input to the classifier consists of the features of the target residue and two sequence neighbors on each side of the target residue. The classifier is trained and evaluated on a nonredundant set of 42 sequences (chains with at least one transmembrane domain) from 31 alpha-helical membrane proteins. The classifier achieves an overall accuracy of 70.7% with 72.5% specificity and 61.1% sensitivity in identifying ligand binding residues from sequence. The classifier performs better when the sequence is encoded by psi-blast generated PSSM profiles. Assessment of the predictions in the context of three-dimensional structures of proteins reveals the effectiveness of this method in identifying ligand binding sites from sequence information. In 83.3% (35 out of 42) of the proteins, the classifier identifies the ligand binding sites by correctly recognizing more than half of the binding residues. This will be useful to protein engineers in exploiting potential residues for functional assessment.
format Online
Article
Text
id pubmed-4329842
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-43298422015-03-23 Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information Suresh, M. Xavier Gromiha, M. Michael Suwa, Makiko Adv Bioinformatics Research Article Locating ligand binding sites and finding the functionally important residues from protein sequences as well as structures became one of the challenges in understanding their function. Hence a Naïve Bayes classifier has been trained to predict whether a given amino acid residue in membrane protein sequence is a ligand binding residue or not using only sequence based information. The input to the classifier consists of the features of the target residue and two sequence neighbors on each side of the target residue. The classifier is trained and evaluated on a nonredundant set of 42 sequences (chains with at least one transmembrane domain) from 31 alpha-helical membrane proteins. The classifier achieves an overall accuracy of 70.7% with 72.5% specificity and 61.1% sensitivity in identifying ligand binding residues from sequence. The classifier performs better when the sequence is encoded by psi-blast generated PSSM profiles. Assessment of the predictions in the context of three-dimensional structures of proteins reveals the effectiveness of this method in identifying ligand binding sites from sequence information. In 83.3% (35 out of 42) of the proteins, the classifier identifies the ligand binding sites by correctly recognizing more than half of the binding residues. This will be useful to protein engineers in exploiting potential residues for functional assessment. Hindawi Publishing Corporation 2015 2015-01-31 /pmc/articles/PMC4329842/ /pubmed/25802517 http://dx.doi.org/10.1155/2015/843030 Text en Copyright © 2015 M. Xavier Suresh et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Suresh, M. Xavier
Gromiha, M. Michael
Suwa, Makiko
Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information
title Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information
title_full Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information
title_fullStr Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information
title_full_unstemmed Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information
title_short Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information
title_sort development of a machine learning method to predict membrane protein-ligand binding residues using basic sequence information
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4329842/
https://www.ncbi.nlm.nih.gov/pubmed/25802517
http://dx.doi.org/10.1155/2015/843030
work_keys_str_mv AT sureshmxavier developmentofamachinelearningmethodtopredictmembraneproteinligandbindingresiduesusingbasicsequenceinformation
AT gromihammichael developmentofamachinelearningmethodtopredictmembraneproteinligandbindingresiduesusingbasicsequenceinformation
AT suwamakiko developmentofamachinelearningmethodtopredictmembraneproteinligandbindingresiduesusingbasicsequenceinformation