Cargando…
Computational Comparative Study of Tuberculosis Proteomes Using a Model Learned from Signal Peptide Structures
Secretome analysis is important in pathogen studies. A fundamental and convenient way to identify secreted proteins is to first predict signal peptides, which are essential for protein secretion. However, signal peptides are highly complex functional sequences that are easily confused with transmemb...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3322152/ https://www.ncbi.nlm.nih.gov/pubmed/22496884 http://dx.doi.org/10.1371/journal.pone.0035018 |
_version_ | 1782229038571454464 |
---|---|
author | Lai, Jhih-Siang Cheng, Cheng-Wei Sung, Ting-Yi Hsu, Wen-Lian |
author_facet | Lai, Jhih-Siang Cheng, Cheng-Wei Sung, Ting-Yi Hsu, Wen-Lian |
author_sort | Lai, Jhih-Siang |
collection | PubMed |
description | Secretome analysis is important in pathogen studies. A fundamental and convenient way to identify secreted proteins is to first predict signal peptides, which are essential for protein secretion. However, signal peptides are highly complex functional sequences that are easily confused with transmembrane domains. Such confusion would obviously affect the discovery of secreted proteins. Transmembrane proteins are important drug targets, but very few transmembrane protein structures have been determined experimentally; hence, prediction of the structures is essential. In the field of structure prediction, researchers do not make assumptions about organisms, so there is a need for a general signal peptide predictor. To improve signal peptide prediction without prior knowledge of the associated organisms, we present a machine-learning method, called SVMSignal, which uses biochemical properties as features, as well as features acquired from a novel encoding, to capture biochemical profile patterns for learning the structures of signal peptides directly. We tested SVMSignal and five popular methods on two benchmark datasets from the SPdb and UniProt/Swiss-Prot databases, respectively. Although SVMSignal was trained on an old dataset, it performed well, and the results demonstrate that learning the structures of signal peptides directly is a promising approach. We also utilized SVMSignal to analyze proteomes in the entire HAMAP microbial database. Finally, we conducted a comparative study of secretome analysis on seven tuberculosis-related strains selected from the HAMAP database. We identified ten potential secreted proteins, two of which are drug resistant and four are potential transmembrane proteins. SVMSignal is publicly available at http://bio-cluster.iis.sinica.edu.tw/SVMSignal. It provides user-friendly interfaces and visualizations, and the prediction results are available for download. |
format | Online Article Text |
id | pubmed-3322152 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-33221522012-04-11 Computational Comparative Study of Tuberculosis Proteomes Using a Model Learned from Signal Peptide Structures Lai, Jhih-Siang Cheng, Cheng-Wei Sung, Ting-Yi Hsu, Wen-Lian PLoS One Research Article Secretome analysis is important in pathogen studies. A fundamental and convenient way to identify secreted proteins is to first predict signal peptides, which are essential for protein secretion. However, signal peptides are highly complex functional sequences that are easily confused with transmembrane domains. Such confusion would obviously affect the discovery of secreted proteins. Transmembrane proteins are important drug targets, but very few transmembrane protein structures have been determined experimentally; hence, prediction of the structures is essential. In the field of structure prediction, researchers do not make assumptions about organisms, so there is a need for a general signal peptide predictor. To improve signal peptide prediction without prior knowledge of the associated organisms, we present a machine-learning method, called SVMSignal, which uses biochemical properties as features, as well as features acquired from a novel encoding, to capture biochemical profile patterns for learning the structures of signal peptides directly. We tested SVMSignal and five popular methods on two benchmark datasets from the SPdb and UniProt/Swiss-Prot databases, respectively. Although SVMSignal was trained on an old dataset, it performed well, and the results demonstrate that learning the structures of signal peptides directly is a promising approach. We also utilized SVMSignal to analyze proteomes in the entire HAMAP microbial database. Finally, we conducted a comparative study of secretome analysis on seven tuberculosis-related strains selected from the HAMAP database. We identified ten potential secreted proteins, two of which are drug resistant and four are potential transmembrane proteins. SVMSignal is publicly available at http://bio-cluster.iis.sinica.edu.tw/SVMSignal. It provides user-friendly interfaces and visualizations, and the prediction results are available for download. Public Library of Science 2012-04-09 /pmc/articles/PMC3322152/ /pubmed/22496884 http://dx.doi.org/10.1371/journal.pone.0035018 Text en Lai et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Lai, Jhih-Siang Cheng, Cheng-Wei Sung, Ting-Yi Hsu, Wen-Lian Computational Comparative Study of Tuberculosis Proteomes Using a Model Learned from Signal Peptide Structures |
title | Computational Comparative Study of Tuberculosis Proteomes Using a Model Learned from Signal Peptide Structures |
title_full | Computational Comparative Study of Tuberculosis Proteomes Using a Model Learned from Signal Peptide Structures |
title_fullStr | Computational Comparative Study of Tuberculosis Proteomes Using a Model Learned from Signal Peptide Structures |
title_full_unstemmed | Computational Comparative Study of Tuberculosis Proteomes Using a Model Learned from Signal Peptide Structures |
title_short | Computational Comparative Study of Tuberculosis Proteomes Using a Model Learned from Signal Peptide Structures |
title_sort | computational comparative study of tuberculosis proteomes using a model learned from signal peptide structures |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3322152/ https://www.ncbi.nlm.nih.gov/pubmed/22496884 http://dx.doi.org/10.1371/journal.pone.0035018 |
work_keys_str_mv | AT laijhihsiang computationalcomparativestudyoftuberculosisproteomesusingamodellearnedfromsignalpeptidestructures AT chengchengwei computationalcomparativestudyoftuberculosisproteomesusingamodellearnedfromsignalpeptidestructures AT sungtingyi computationalcomparativestudyoftuberculosisproteomesusingamodellearnedfromsignalpeptidestructures AT hsuwenlian computationalcomparativestudyoftuberculosisproteomesusingamodellearnedfromsignalpeptidestructures |