Cargando…

A machine learning approach for the identification of odorant binding proteins from sequence-derived properties

BACKGROUND: Odorant binding proteins (OBPs) are believed to shuttle odorants from the environment to the underlying odorant receptors, for which they could potentially serve as odorant presenters. Although several sequence based search methods have been exploited for protein family prediction, less...

Descripción completa

Detalles Bibliográficos
Autores principales: Pugalenthi, Ganesan, Tang, Ke, Suganthan, PN, Archunan, G, Sowdhamini, R
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2216042/
https://www.ncbi.nlm.nih.gov/pubmed/17880712
http://dx.doi.org/10.1186/1471-2105-8-351
_version_ 1782149098924670976
author Pugalenthi, Ganesan
Tang, Ke
Suganthan, PN
Archunan, G
Sowdhamini, R
author_facet Pugalenthi, Ganesan
Tang, Ke
Suganthan, PN
Archunan, G
Sowdhamini, R
author_sort Pugalenthi, Ganesan
collection PubMed
description BACKGROUND: Odorant binding proteins (OBPs) are believed to shuttle odorants from the environment to the underlying odorant receptors, for which they could potentially serve as odorant presenters. Although several sequence based search methods have been exploited for protein family prediction, less effort has been devoted to the prediction of OBPs from sequence data and this area is more challenging due to poor sequence identity between these proteins. RESULTS: In this paper, we propose a new algorithm that uses Regularized Least Squares Classifier (RLSC) in conjunction with multiple physicochemical properties of amino acids to predict odorant-binding proteins. The algorithm was applied to the dataset derived from Pfam and GenDiS database and we obtained overall prediction accuracy of 97.7% (94.5% and 98.4% for positive and negative classes respectively). CONCLUSION: Our study suggests that RLSC is potentially useful for predicting the odorant binding proteins from sequence-derived properties irrespective of sequence similarity. Our method predicts 92.8% of 56 odorant binding proteins non-homologous to any protein in the swissprot database and 97.1% of the 414 independent dataset proteins, suggesting the usefulness of RLSC method for facilitating the prediction of odorant binding proteins from sequence information.
format Text
id pubmed-2216042
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22160422008-01-29 A machine learning approach for the identification of odorant binding proteins from sequence-derived properties Pugalenthi, Ganesan Tang, Ke Suganthan, PN Archunan, G Sowdhamini, R BMC Bioinformatics Research Article BACKGROUND: Odorant binding proteins (OBPs) are believed to shuttle odorants from the environment to the underlying odorant receptors, for which they could potentially serve as odorant presenters. Although several sequence based search methods have been exploited for protein family prediction, less effort has been devoted to the prediction of OBPs from sequence data and this area is more challenging due to poor sequence identity between these proteins. RESULTS: In this paper, we propose a new algorithm that uses Regularized Least Squares Classifier (RLSC) in conjunction with multiple physicochemical properties of amino acids to predict odorant-binding proteins. The algorithm was applied to the dataset derived from Pfam and GenDiS database and we obtained overall prediction accuracy of 97.7% (94.5% and 98.4% for positive and negative classes respectively). CONCLUSION: Our study suggests that RLSC is potentially useful for predicting the odorant binding proteins from sequence-derived properties irrespective of sequence similarity. Our method predicts 92.8% of 56 odorant binding proteins non-homologous to any protein in the swissprot database and 97.1% of the 414 independent dataset proteins, suggesting the usefulness of RLSC method for facilitating the prediction of odorant binding proteins from sequence information. BioMed Central 2007-09-19 /pmc/articles/PMC2216042/ /pubmed/17880712 http://dx.doi.org/10.1186/1471-2105-8-351 Text en Copyright © 2007 Pugalenthi et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Pugalenthi, Ganesan
Tang, Ke
Suganthan, PN
Archunan, G
Sowdhamini, R
A machine learning approach for the identification of odorant binding proteins from sequence-derived properties
title A machine learning approach for the identification of odorant binding proteins from sequence-derived properties
title_full A machine learning approach for the identification of odorant binding proteins from sequence-derived properties
title_fullStr A machine learning approach for the identification of odorant binding proteins from sequence-derived properties
title_full_unstemmed A machine learning approach for the identification of odorant binding proteins from sequence-derived properties
title_short A machine learning approach for the identification of odorant binding proteins from sequence-derived properties
title_sort machine learning approach for the identification of odorant binding proteins from sequence-derived properties
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2216042/
https://www.ncbi.nlm.nih.gov/pubmed/17880712
http://dx.doi.org/10.1186/1471-2105-8-351
work_keys_str_mv AT pugalenthiganesan amachinelearningapproachfortheidentificationofodorantbindingproteinsfromsequencederivedproperties
AT tangke amachinelearningapproachfortheidentificationofodorantbindingproteinsfromsequencederivedproperties
AT suganthanpn amachinelearningapproachfortheidentificationofodorantbindingproteinsfromsequencederivedproperties
AT archunang amachinelearningapproachfortheidentificationofodorantbindingproteinsfromsequencederivedproperties
AT sowdhaminir amachinelearningapproachfortheidentificationofodorantbindingproteinsfromsequencederivedproperties
AT pugalenthiganesan machinelearningapproachfortheidentificationofodorantbindingproteinsfromsequencederivedproperties
AT tangke machinelearningapproachfortheidentificationofodorantbindingproteinsfromsequencederivedproperties
AT suganthanpn machinelearningapproachfortheidentificationofodorantbindingproteinsfromsequencederivedproperties
AT archunang machinelearningapproachfortheidentificationofodorantbindingproteinsfromsequencederivedproperties
AT sowdhaminir machinelearningapproachfortheidentificationofodorantbindingproteinsfromsequencederivedproperties