Cargando…

Identification of Proteins Secreted by Malaria Parasite into Erythrocyte using SVM and PSSM profiles

BACKGROUND: Malaria parasite secretes various proteins in infected RBC for its growth and survival. Thus identification of these secretory proteins is important for developing vaccine/drug against malaria. The existing motif-based methods have got limited success due to lack of universal motif in al...

Descripción completa

Detalles Bibliográficos
Autores principales: Verma, Ruchi, Tiwari, Ajit, Kaur, Sukhwinder, Varshney, Grish C, Raghava, Gajendra PS
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2358896/
https://www.ncbi.nlm.nih.gov/pubmed/18416838
http://dx.doi.org/10.1186/1471-2105-9-201
_version_ 1782152865182121984
author Verma, Ruchi
Tiwari, Ajit
Kaur, Sukhwinder
Varshney, Grish C
Raghava, Gajendra PS
author_facet Verma, Ruchi
Tiwari, Ajit
Kaur, Sukhwinder
Varshney, Grish C
Raghava, Gajendra PS
author_sort Verma, Ruchi
collection PubMed
description BACKGROUND: Malaria parasite secretes various proteins in infected RBC for its growth and survival. Thus identification of these secretory proteins is important for developing vaccine/drug against malaria. The existing motif-based methods have got limited success due to lack of universal motif in all secretory proteins of malaria parasite. RESULTS: In this study a systematic attempt has been made to develop a general method for predicting secretory proteins of malaria parasite. All models were trained and tested on a non-redundant dataset of 252 secretory and 252 non-secretory proteins. We developed SVM models and achieved maximum MCC 0.72 with 85.65% accuracy and MCC 0.74 with 86.45% accuracy using amino acid and dipeptide composition respectively. SVM models were developed using split-amino acid and split-dipeptide composition and achieved maximum MCC 0.74 with 86.40% accuracy and MCC 0.77 with accuracy 88.22% respectively. In this study, for the first time PSSM profiles obtained from PSI-BLAST, have been used for predicting secretory proteins. We achieved maximum MCC 0.86 with 92.66% accuracy using PSSM based SVM model. All models developed in this study were evaluated using 5-fold cross-validation technique. CONCLUSION: This study demonstrates that secretory proteins have different residue composition than non-secretory proteins. Thus, it is possible to predict secretory proteins from its residue composition-using machine learning technique. The multiple sequence alignment provides more information than sequence itself. Thus performance of method based on PSSM profile is more accurate than method based on sequence composition. A web server PSEApred has been developed for predicting secretory proteins of malaria parasites,the URL can be found in the Availability and requirements section.
format Text
id pubmed-2358896
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23588962008-04-29 Identification of Proteins Secreted by Malaria Parasite into Erythrocyte using SVM and PSSM profiles Verma, Ruchi Tiwari, Ajit Kaur, Sukhwinder Varshney, Grish C Raghava, Gajendra PS BMC Bioinformatics Research Article BACKGROUND: Malaria parasite secretes various proteins in infected RBC for its growth and survival. Thus identification of these secretory proteins is important for developing vaccine/drug against malaria. The existing motif-based methods have got limited success due to lack of universal motif in all secretory proteins of malaria parasite. RESULTS: In this study a systematic attempt has been made to develop a general method for predicting secretory proteins of malaria parasite. All models were trained and tested on a non-redundant dataset of 252 secretory and 252 non-secretory proteins. We developed SVM models and achieved maximum MCC 0.72 with 85.65% accuracy and MCC 0.74 with 86.45% accuracy using amino acid and dipeptide composition respectively. SVM models were developed using split-amino acid and split-dipeptide composition and achieved maximum MCC 0.74 with 86.40% accuracy and MCC 0.77 with accuracy 88.22% respectively. In this study, for the first time PSSM profiles obtained from PSI-BLAST, have been used for predicting secretory proteins. We achieved maximum MCC 0.86 with 92.66% accuracy using PSSM based SVM model. All models developed in this study were evaluated using 5-fold cross-validation technique. CONCLUSION: This study demonstrates that secretory proteins have different residue composition than non-secretory proteins. Thus, it is possible to predict secretory proteins from its residue composition-using machine learning technique. The multiple sequence alignment provides more information than sequence itself. Thus performance of method based on PSSM profile is more accurate than method based on sequence composition. A web server PSEApred has been developed for predicting secretory proteins of malaria parasites,the URL can be found in the Availability and requirements section. BioMed Central 2008-04-16 /pmc/articles/PMC2358896/ /pubmed/18416838 http://dx.doi.org/10.1186/1471-2105-9-201 Text en Copyright © 2008 Verma et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Verma, Ruchi
Tiwari, Ajit
Kaur, Sukhwinder
Varshney, Grish C
Raghava, Gajendra PS
Identification of Proteins Secreted by Malaria Parasite into Erythrocyte using SVM and PSSM profiles
title Identification of Proteins Secreted by Malaria Parasite into Erythrocyte using SVM and PSSM profiles
title_full Identification of Proteins Secreted by Malaria Parasite into Erythrocyte using SVM and PSSM profiles
title_fullStr Identification of Proteins Secreted by Malaria Parasite into Erythrocyte using SVM and PSSM profiles
title_full_unstemmed Identification of Proteins Secreted by Malaria Parasite into Erythrocyte using SVM and PSSM profiles
title_short Identification of Proteins Secreted by Malaria Parasite into Erythrocyte using SVM and PSSM profiles
title_sort identification of proteins secreted by malaria parasite into erythrocyte using svm and pssm profiles
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2358896/
https://www.ncbi.nlm.nih.gov/pubmed/18416838
http://dx.doi.org/10.1186/1471-2105-9-201
work_keys_str_mv AT vermaruchi identificationofproteinssecretedbymalariaparasiteintoerythrocyteusingsvmandpssmprofiles
AT tiwariajit identificationofproteinssecretedbymalariaparasiteintoerythrocyteusingsvmandpssmprofiles
AT kaursukhwinder identificationofproteinssecretedbymalariaparasiteintoerythrocyteusingsvmandpssmprofiles
AT varshneygrishc identificationofproteinssecretedbymalariaparasiteintoerythrocyteusingsvmandpssmprofiles
AT raghavagajendraps identificationofproteinssecretedbymalariaparasiteintoerythrocyteusingsvmandpssmprofiles