Cargando…

Support Vector Machine-based method for predicting subcellular localization of mycobacterial proteins using evolutionary information and motifs

BACKGROUND: In past number of methods have been developed for predicting subcellular location of eukaryotic, prokaryotic (Gram-negative and Gram-positive bacteria) and human proteins but no method has been developed for mycobacterial proteins which may represent repertoire of potent immunogens of th...

Descripción completa

Detalles Bibliográficos
Autores principales: Rashid, Mamoon, Saha, Sudipto, Raghava, Gajendra PS
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2147037/
https://www.ncbi.nlm.nih.gov/pubmed/17854501
http://dx.doi.org/10.1186/1471-2105-8-337
_version_ 1782144349193109504
author Rashid, Mamoon
Saha, Sudipto
Raghava, Gajendra PS
author_facet Rashid, Mamoon
Saha, Sudipto
Raghava, Gajendra PS
author_sort Rashid, Mamoon
collection PubMed
description BACKGROUND: In past number of methods have been developed for predicting subcellular location of eukaryotic, prokaryotic (Gram-negative and Gram-positive bacteria) and human proteins but no method has been developed for mycobacterial proteins which may represent repertoire of potent immunogens of this dreaded pathogen. In this study, attempt has been made to develop method for predicting subcellular location of mycobacterial proteins. RESULTS: The models were trained and tested on 852 mycobacterial proteins and evaluated using five-fold cross-validation technique. First SVM (Support Vector Machine) model was developed using amino acid composition and overall accuracy of 82.51% was achieved with average accuracy (mean of class-wise accuracy) of 68.47%. In order to utilize evolutionary information, a SVM model was developed using PSSM (Position-Specific Scoring Matrix) profiles obtained from PSI-BLAST (Position-Specific Iterated BLAST) and overall accuracy achieved was of 86.62% with average accuracy of 73.71%. In addition, HMM (Hidden Markov Model), MEME/MAST (Multiple Em for Motif Elicitation/Motif Alignment and Search Tool) and hybrid model that combined two or more models were also developed. We achieved maximum overall accuracy of 86.8% with average accuracy of 89.00% using combination of PSSM based SVM model and MEME/MAST. Performance of our method was compared with that of the existing methods developed for predicting subcellular locations of Gram-positive bacterial proteins. CONCLUSION: A highly accurate method has been developed for predicting subcellular location of mycobacterial proteins. This method also predicts very important class of proteins that is membrane-attached proteins. This method will be useful in annotating newly sequenced or hypothetical mycobacterial proteins. Based on above study, a freely accessible web server TBpred http://www.imtech.res.in/raghava/tbpred/ has been developed.
format Text
id pubmed-2147037
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-21470372007-12-19 Support Vector Machine-based method for predicting subcellular localization of mycobacterial proteins using evolutionary information and motifs Rashid, Mamoon Saha, Sudipto Raghava, Gajendra PS BMC Bioinformatics Research Article BACKGROUND: In past number of methods have been developed for predicting subcellular location of eukaryotic, prokaryotic (Gram-negative and Gram-positive bacteria) and human proteins but no method has been developed for mycobacterial proteins which may represent repertoire of potent immunogens of this dreaded pathogen. In this study, attempt has been made to develop method for predicting subcellular location of mycobacterial proteins. RESULTS: The models were trained and tested on 852 mycobacterial proteins and evaluated using five-fold cross-validation technique. First SVM (Support Vector Machine) model was developed using amino acid composition and overall accuracy of 82.51% was achieved with average accuracy (mean of class-wise accuracy) of 68.47%. In order to utilize evolutionary information, a SVM model was developed using PSSM (Position-Specific Scoring Matrix) profiles obtained from PSI-BLAST (Position-Specific Iterated BLAST) and overall accuracy achieved was of 86.62% with average accuracy of 73.71%. In addition, HMM (Hidden Markov Model), MEME/MAST (Multiple Em for Motif Elicitation/Motif Alignment and Search Tool) and hybrid model that combined two or more models were also developed. We achieved maximum overall accuracy of 86.8% with average accuracy of 89.00% using combination of PSSM based SVM model and MEME/MAST. Performance of our method was compared with that of the existing methods developed for predicting subcellular locations of Gram-positive bacterial proteins. CONCLUSION: A highly accurate method has been developed for predicting subcellular location of mycobacterial proteins. This method also predicts very important class of proteins that is membrane-attached proteins. This method will be useful in annotating newly sequenced or hypothetical mycobacterial proteins. Based on above study, a freely accessible web server TBpred http://www.imtech.res.in/raghava/tbpred/ has been developed. BioMed Central 2007-09-13 /pmc/articles/PMC2147037/ /pubmed/17854501 http://dx.doi.org/10.1186/1471-2105-8-337 Text en Copyright ©2007 Rashid et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Rashid, Mamoon
Saha, Sudipto
Raghava, Gajendra PS
Support Vector Machine-based method for predicting subcellular localization of mycobacterial proteins using evolutionary information and motifs
title Support Vector Machine-based method for predicting subcellular localization of mycobacterial proteins using evolutionary information and motifs
title_full Support Vector Machine-based method for predicting subcellular localization of mycobacterial proteins using evolutionary information and motifs
title_fullStr Support Vector Machine-based method for predicting subcellular localization of mycobacterial proteins using evolutionary information and motifs
title_full_unstemmed Support Vector Machine-based method for predicting subcellular localization of mycobacterial proteins using evolutionary information and motifs
title_short Support Vector Machine-based method for predicting subcellular localization of mycobacterial proteins using evolutionary information and motifs
title_sort support vector machine-based method for predicting subcellular localization of mycobacterial proteins using evolutionary information and motifs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2147037/
https://www.ncbi.nlm.nih.gov/pubmed/17854501
http://dx.doi.org/10.1186/1471-2105-8-337
work_keys_str_mv AT rashidmamoon supportvectormachinebasedmethodforpredictingsubcellularlocalizationofmycobacterialproteinsusingevolutionaryinformationandmotifs
AT sahasudipto supportvectormachinebasedmethodforpredictingsubcellularlocalizationofmycobacterialproteinsusingevolutionaryinformationandmotifs
AT raghavagajendraps supportvectormachinebasedmethodforpredictingsubcellularlocalizationofmycobacterialproteinsusingevolutionaryinformationandmotifs