Cargando…

NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins

BACKGROUND: Most predictive methods currently available for the identification of protein secretion mechanisms have focused on classically secreted proteins. In fact, only two methods have been reported for predicting non-classically secreted proteins of Gram-positive bacteria. This study describes...

Descripción completa

Detalles Bibliográficos
Autores principales: Restrepo-Montoya, Daniel, Pino, Camilo, Nino, Luis F, Patarroyo, Manuel E, Patarroyo, Manuel A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3025837/
https://www.ncbi.nlm.nih.gov/pubmed/21235786
http://dx.doi.org/10.1186/1471-2105-12-21
_version_ 1782196942805139456
author Restrepo-Montoya, Daniel
Pino, Camilo
Nino, Luis F
Patarroyo, Manuel E
Patarroyo, Manuel A
author_facet Restrepo-Montoya, Daniel
Pino, Camilo
Nino, Luis F
Patarroyo, Manuel E
Patarroyo, Manuel A
author_sort Restrepo-Montoya, Daniel
collection PubMed
description BACKGROUND: Most predictive methods currently available for the identification of protein secretion mechanisms have focused on classically secreted proteins. In fact, only two methods have been reported for predicting non-classically secreted proteins of Gram-positive bacteria. This study describes the implementation of a sequence-based classifier, denoted as NClassG+, for identifying non-classically secreted Gram-positive bacterial proteins. RESULTS: Several feature-based classifiers were trained using different sequence transformation vectors (frequencies, dipeptides, physicochemical factors and PSSM) and Support Vector Machines (SVMs) with Linear, Polynomial and Gaussian kernel functions. Nested k-fold cross-validation (CV) was applied to select the best models, using the inner CV loop to tune the model parameters and the outer CV group to compute the error. The parameters and Kernel functions and the combinations between all possible feature vectors were optimized using grid search. CONCLUSIONS: The final model was tested against an independent set not previously seen by the model, obtaining better predictive performance compared to SecretomeP V2.0 and SecretPV2.0 for the identification of non-classically secreted proteins. NClassG+ is freely available on the web at http://www.biolisi.unal.edu.co/web-servers/nclassgpositive/
format Text
id pubmed-3025837
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30258372011-01-25 NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins Restrepo-Montoya, Daniel Pino, Camilo Nino, Luis F Patarroyo, Manuel E Patarroyo, Manuel A BMC Bioinformatics Research Article BACKGROUND: Most predictive methods currently available for the identification of protein secretion mechanisms have focused on classically secreted proteins. In fact, only two methods have been reported for predicting non-classically secreted proteins of Gram-positive bacteria. This study describes the implementation of a sequence-based classifier, denoted as NClassG+, for identifying non-classically secreted Gram-positive bacterial proteins. RESULTS: Several feature-based classifiers were trained using different sequence transformation vectors (frequencies, dipeptides, physicochemical factors and PSSM) and Support Vector Machines (SVMs) with Linear, Polynomial and Gaussian kernel functions. Nested k-fold cross-validation (CV) was applied to select the best models, using the inner CV loop to tune the model parameters and the outer CV group to compute the error. The parameters and Kernel functions and the combinations between all possible feature vectors were optimized using grid search. CONCLUSIONS: The final model was tested against an independent set not previously seen by the model, obtaining better predictive performance compared to SecretomeP V2.0 and SecretPV2.0 for the identification of non-classically secreted proteins. NClassG+ is freely available on the web at http://www.biolisi.unal.edu.co/web-servers/nclassgpositive/ BioMed Central 2011-01-14 /pmc/articles/PMC3025837/ /pubmed/21235786 http://dx.doi.org/10.1186/1471-2105-12-21 Text en Copyright ©2011 Restrepo-Montoya et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Restrepo-Montoya, Daniel
Pino, Camilo
Nino, Luis F
Patarroyo, Manuel E
Patarroyo, Manuel A
NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins
title NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins
title_full NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins
title_fullStr NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins
title_full_unstemmed NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins
title_short NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins
title_sort nclassg+: a classifier for non-classically secreted gram-positive bacterial proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3025837/
https://www.ncbi.nlm.nih.gov/pubmed/21235786
http://dx.doi.org/10.1186/1471-2105-12-21
work_keys_str_mv AT restrepomontoyadaniel nclassgaclassifierfornonclassicallysecretedgrampositivebacterialproteins
AT pinocamilo nclassgaclassifierfornonclassicallysecretedgrampositivebacterialproteins
AT ninoluisf nclassgaclassifierfornonclassicallysecretedgrampositivebacterialproteins
AT patarroyomanuele nclassgaclassifierfornonclassicallysecretedgrampositivebacterialproteins
AT patarroyomanuela nclassgaclassifierfornonclassicallysecretedgrampositivebacterialproteins