Cargando…
PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method
Gram-negative bacteria use various secretion systems to deliver their secreted effectors. Among them, type IV secretion system exists widely in a variety of bacterial species, and secretes type IV secreted effectors (T4SEs), which play vital roles in host-pathogen interactions. However, experimental...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6212463/ https://www.ncbi.nlm.nih.gov/pubmed/30416498 http://dx.doi.org/10.3389/fmicb.2018.02571 |
_version_ | 1783367542871425024 |
---|---|
author | Xiong, Yi Wang, Qiankun Yang, Junchen Zhu, Xiaolei Wei, Dong-Qing |
author_facet | Xiong, Yi Wang, Qiankun Yang, Junchen Zhu, Xiaolei Wei, Dong-Qing |
author_sort | Xiong, Yi |
collection | PubMed |
description | Gram-negative bacteria use various secretion systems to deliver their secreted effectors. Among them, type IV secretion system exists widely in a variety of bacterial species, and secretes type IV secreted effectors (T4SEs), which play vital roles in host-pathogen interactions. However, experimental approaches to identify T4SEs are time- and resource-consuming. In the present study, we aim to develop an in silico stacked ensemble method to predict whether a protein is an effector of type IV secretion system or not based on its sequence information. The protein sequences were encoded by the feature of position specific scoring matrix (PSSM)-composition by summing rows that correspond to the same amino acid residues in PSSM profiles. Based on the PSSM-composition features, we develop a stacked ensemble model PredT4SE-Stack to predict T4SEs, which utilized an ensemble of base-classifiers implemented by various machine learning algorithms, such as support vector machine, gradient boosting machine, and extremely randomized trees, to generate outputs for the meta-classifier in the classification system. Our results demonstrated that the framework of PredT4SE-Stack was a feasible and effective way to accurately identify T4SEs based on protein sequence information. The datasets and source code of PredT4SE-Stack are freely available at http://xbioinfo.sjtu.edu.cn/PredT4SE_Stack/index.php. |
format | Online Article Text |
id | pubmed-6212463 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-62124632018-11-09 PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method Xiong, Yi Wang, Qiankun Yang, Junchen Zhu, Xiaolei Wei, Dong-Qing Front Microbiol Microbiology Gram-negative bacteria use various secretion systems to deliver their secreted effectors. Among them, type IV secretion system exists widely in a variety of bacterial species, and secretes type IV secreted effectors (T4SEs), which play vital roles in host-pathogen interactions. However, experimental approaches to identify T4SEs are time- and resource-consuming. In the present study, we aim to develop an in silico stacked ensemble method to predict whether a protein is an effector of type IV secretion system or not based on its sequence information. The protein sequences were encoded by the feature of position specific scoring matrix (PSSM)-composition by summing rows that correspond to the same amino acid residues in PSSM profiles. Based on the PSSM-composition features, we develop a stacked ensemble model PredT4SE-Stack to predict T4SEs, which utilized an ensemble of base-classifiers implemented by various machine learning algorithms, such as support vector machine, gradient boosting machine, and extremely randomized trees, to generate outputs for the meta-classifier in the classification system. Our results demonstrated that the framework of PredT4SE-Stack was a feasible and effective way to accurately identify T4SEs based on protein sequence information. The datasets and source code of PredT4SE-Stack are freely available at http://xbioinfo.sjtu.edu.cn/PredT4SE_Stack/index.php. Frontiers Media S.A. 2018-10-26 /pmc/articles/PMC6212463/ /pubmed/30416498 http://dx.doi.org/10.3389/fmicb.2018.02571 Text en Copyright © 2018 Xiong, Wang, Yang, Zhu and Wei. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Microbiology Xiong, Yi Wang, Qiankun Yang, Junchen Zhu, Xiaolei Wei, Dong-Qing PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method |
title | PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method |
title_full | PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method |
title_fullStr | PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method |
title_full_unstemmed | PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method |
title_short | PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method |
title_sort | predt4se-stack: prediction of bacterial type iv secreted effectors from protein sequences using a stacked ensemble method |
topic | Microbiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6212463/ https://www.ncbi.nlm.nih.gov/pubmed/30416498 http://dx.doi.org/10.3389/fmicb.2018.02571 |
work_keys_str_mv | AT xiongyi predt4sestackpredictionofbacterialtypeivsecretedeffectorsfromproteinsequencesusingastackedensemblemethod AT wangqiankun predt4sestackpredictionofbacterialtypeivsecretedeffectorsfromproteinsequencesusingastackedensemblemethod AT yangjunchen predt4sestackpredictionofbacterialtypeivsecretedeffectorsfromproteinsequencesusingastackedensemblemethod AT zhuxiaolei predt4sestackpredictionofbacterialtypeivsecretedeffectorsfromproteinsequencesusingastackedensemblemethod AT weidongqing predt4sestackpredictionofbacterialtypeivsecretedeffectorsfromproteinsequencesusingastackedensemblemethod |