Cargando…

PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method

Gram-negative bacteria use various secretion systems to deliver their secreted effectors. Among them, type IV secretion system exists widely in a variety of bacterial species, and secretes type IV secreted effectors (T4SEs), which play vital roles in host-pathogen interactions. However, experimental...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiong, Yi, Wang, Qiankun, Yang, Junchen, Zhu, Xiaolei, Wei, Dong-Qing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6212463/
https://www.ncbi.nlm.nih.gov/pubmed/30416498
http://dx.doi.org/10.3389/fmicb.2018.02571
_version_ 1783367542871425024
author Xiong, Yi
Wang, Qiankun
Yang, Junchen
Zhu, Xiaolei
Wei, Dong-Qing
author_facet Xiong, Yi
Wang, Qiankun
Yang, Junchen
Zhu, Xiaolei
Wei, Dong-Qing
author_sort Xiong, Yi
collection PubMed
description Gram-negative bacteria use various secretion systems to deliver their secreted effectors. Among them, type IV secretion system exists widely in a variety of bacterial species, and secretes type IV secreted effectors (T4SEs), which play vital roles in host-pathogen interactions. However, experimental approaches to identify T4SEs are time- and resource-consuming. In the present study, we aim to develop an in silico stacked ensemble method to predict whether a protein is an effector of type IV secretion system or not based on its sequence information. The protein sequences were encoded by the feature of position specific scoring matrix (PSSM)-composition by summing rows that correspond to the same amino acid residues in PSSM profiles. Based on the PSSM-composition features, we develop a stacked ensemble model PredT4SE-Stack to predict T4SEs, which utilized an ensemble of base-classifiers implemented by various machine learning algorithms, such as support vector machine, gradient boosting machine, and extremely randomized trees, to generate outputs for the meta-classifier in the classification system. Our results demonstrated that the framework of PredT4SE-Stack was a feasible and effective way to accurately identify T4SEs based on protein sequence information. The datasets and source code of PredT4SE-Stack are freely available at http://xbioinfo.sjtu.edu.cn/PredT4SE_Stack/index.php.
format Online
Article
Text
id pubmed-6212463
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-62124632018-11-09 PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method Xiong, Yi Wang, Qiankun Yang, Junchen Zhu, Xiaolei Wei, Dong-Qing Front Microbiol Microbiology Gram-negative bacteria use various secretion systems to deliver their secreted effectors. Among them, type IV secretion system exists widely in a variety of bacterial species, and secretes type IV secreted effectors (T4SEs), which play vital roles in host-pathogen interactions. However, experimental approaches to identify T4SEs are time- and resource-consuming. In the present study, we aim to develop an in silico stacked ensemble method to predict whether a protein is an effector of type IV secretion system or not based on its sequence information. The protein sequences were encoded by the feature of position specific scoring matrix (PSSM)-composition by summing rows that correspond to the same amino acid residues in PSSM profiles. Based on the PSSM-composition features, we develop a stacked ensemble model PredT4SE-Stack to predict T4SEs, which utilized an ensemble of base-classifiers implemented by various machine learning algorithms, such as support vector machine, gradient boosting machine, and extremely randomized trees, to generate outputs for the meta-classifier in the classification system. Our results demonstrated that the framework of PredT4SE-Stack was a feasible and effective way to accurately identify T4SEs based on protein sequence information. The datasets and source code of PredT4SE-Stack are freely available at http://xbioinfo.sjtu.edu.cn/PredT4SE_Stack/index.php. Frontiers Media S.A. 2018-10-26 /pmc/articles/PMC6212463/ /pubmed/30416498 http://dx.doi.org/10.3389/fmicb.2018.02571 Text en Copyright © 2018 Xiong, Wang, Yang, Zhu and Wei. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Xiong, Yi
Wang, Qiankun
Yang, Junchen
Zhu, Xiaolei
Wei, Dong-Qing
PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method
title PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method
title_full PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method
title_fullStr PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method
title_full_unstemmed PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method
title_short PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method
title_sort predt4se-stack: prediction of bacterial type iv secreted effectors from protein sequences using a stacked ensemble method
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6212463/
https://www.ncbi.nlm.nih.gov/pubmed/30416498
http://dx.doi.org/10.3389/fmicb.2018.02571
work_keys_str_mv AT xiongyi predt4sestackpredictionofbacterialtypeivsecretedeffectorsfromproteinsequencesusingastackedensemblemethod
AT wangqiankun predt4sestackpredictionofbacterialtypeivsecretedeffectorsfromproteinsequencesusingastackedensemblemethod
AT yangjunchen predt4sestackpredictionofbacterialtypeivsecretedeffectorsfromproteinsequencesusingastackedensemblemethod
AT zhuxiaolei predt4sestackpredictionofbacterialtypeivsecretedeffectorsfromproteinsequencesusingastackedensemblemethod
AT weidongqing predt4sestackpredictionofbacterialtypeivsecretedeffectorsfromproteinsequencesusingastackedensemblemethod