Cargando…

Computational Comparative Study of Tuberculosis Proteomes Using a Model Learned from Signal Peptide Structures

Secretome analysis is important in pathogen studies. A fundamental and convenient way to identify secreted proteins is to first predict signal peptides, which are essential for protein secretion. However, signal peptides are highly complex functional sequences that are easily confused with transmemb...

Descripción completa

Detalles Bibliográficos
Autores principales: Lai, Jhih-Siang, Cheng, Cheng-Wei, Sung, Ting-Yi, Hsu, Wen-Lian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3322152/
https://www.ncbi.nlm.nih.gov/pubmed/22496884
http://dx.doi.org/10.1371/journal.pone.0035018
_version_ 1782229038571454464
author Lai, Jhih-Siang
Cheng, Cheng-Wei
Sung, Ting-Yi
Hsu, Wen-Lian
author_facet Lai, Jhih-Siang
Cheng, Cheng-Wei
Sung, Ting-Yi
Hsu, Wen-Lian
author_sort Lai, Jhih-Siang
collection PubMed
description Secretome analysis is important in pathogen studies. A fundamental and convenient way to identify secreted proteins is to first predict signal peptides, which are essential for protein secretion. However, signal peptides are highly complex functional sequences that are easily confused with transmembrane domains. Such confusion would obviously affect the discovery of secreted proteins. Transmembrane proteins are important drug targets, but very few transmembrane protein structures have been determined experimentally; hence, prediction of the structures is essential. In the field of structure prediction, researchers do not make assumptions about organisms, so there is a need for a general signal peptide predictor. To improve signal peptide prediction without prior knowledge of the associated organisms, we present a machine-learning method, called SVMSignal, which uses biochemical properties as features, as well as features acquired from a novel encoding, to capture biochemical profile patterns for learning the structures of signal peptides directly. We tested SVMSignal and five popular methods on two benchmark datasets from the SPdb and UniProt/Swiss-Prot databases, respectively. Although SVMSignal was trained on an old dataset, it performed well, and the results demonstrate that learning the structures of signal peptides directly is a promising approach. We also utilized SVMSignal to analyze proteomes in the entire HAMAP microbial database. Finally, we conducted a comparative study of secretome analysis on seven tuberculosis-related strains selected from the HAMAP database. We identified ten potential secreted proteins, two of which are drug resistant and four are potential transmembrane proteins. SVMSignal is publicly available at http://bio-cluster.iis.sinica.edu.tw/SVMSignal. It provides user-friendly interfaces and visualizations, and the prediction results are available for download.
format Online
Article
Text
id pubmed-3322152
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33221522012-04-11 Computational Comparative Study of Tuberculosis Proteomes Using a Model Learned from Signal Peptide Structures Lai, Jhih-Siang Cheng, Cheng-Wei Sung, Ting-Yi Hsu, Wen-Lian PLoS One Research Article Secretome analysis is important in pathogen studies. A fundamental and convenient way to identify secreted proteins is to first predict signal peptides, which are essential for protein secretion. However, signal peptides are highly complex functional sequences that are easily confused with transmembrane domains. Such confusion would obviously affect the discovery of secreted proteins. Transmembrane proteins are important drug targets, but very few transmembrane protein structures have been determined experimentally; hence, prediction of the structures is essential. In the field of structure prediction, researchers do not make assumptions about organisms, so there is a need for a general signal peptide predictor. To improve signal peptide prediction without prior knowledge of the associated organisms, we present a machine-learning method, called SVMSignal, which uses biochemical properties as features, as well as features acquired from a novel encoding, to capture biochemical profile patterns for learning the structures of signal peptides directly. We tested SVMSignal and five popular methods on two benchmark datasets from the SPdb and UniProt/Swiss-Prot databases, respectively. Although SVMSignal was trained on an old dataset, it performed well, and the results demonstrate that learning the structures of signal peptides directly is a promising approach. We also utilized SVMSignal to analyze proteomes in the entire HAMAP microbial database. Finally, we conducted a comparative study of secretome analysis on seven tuberculosis-related strains selected from the HAMAP database. We identified ten potential secreted proteins, two of which are drug resistant and four are potential transmembrane proteins. SVMSignal is publicly available at http://bio-cluster.iis.sinica.edu.tw/SVMSignal. It provides user-friendly interfaces and visualizations, and the prediction results are available for download. Public Library of Science 2012-04-09 /pmc/articles/PMC3322152/ /pubmed/22496884 http://dx.doi.org/10.1371/journal.pone.0035018 Text en Lai et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lai, Jhih-Siang
Cheng, Cheng-Wei
Sung, Ting-Yi
Hsu, Wen-Lian
Computational Comparative Study of Tuberculosis Proteomes Using a Model Learned from Signal Peptide Structures
title Computational Comparative Study of Tuberculosis Proteomes Using a Model Learned from Signal Peptide Structures
title_full Computational Comparative Study of Tuberculosis Proteomes Using a Model Learned from Signal Peptide Structures
title_fullStr Computational Comparative Study of Tuberculosis Proteomes Using a Model Learned from Signal Peptide Structures
title_full_unstemmed Computational Comparative Study of Tuberculosis Proteomes Using a Model Learned from Signal Peptide Structures
title_short Computational Comparative Study of Tuberculosis Proteomes Using a Model Learned from Signal Peptide Structures
title_sort computational comparative study of tuberculosis proteomes using a model learned from signal peptide structures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3322152/
https://www.ncbi.nlm.nih.gov/pubmed/22496884
http://dx.doi.org/10.1371/journal.pone.0035018
work_keys_str_mv AT laijhihsiang computationalcomparativestudyoftuberculosisproteomesusingamodellearnedfromsignalpeptidestructures
AT chengchengwei computationalcomparativestudyoftuberculosisproteomesusingamodellearnedfromsignalpeptidestructures
AT sungtingyi computationalcomparativestudyoftuberculosisproteomesusingamodellearnedfromsignalpeptidestructures
AT hsuwenlian computationalcomparativestudyoftuberculosisproteomesusingamodellearnedfromsignalpeptidestructures