Cargando…

iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles

Many gram-negative bacteria use type IV secretion systems to deliver effector molecules to a wide range of target cells. These substrate proteins, which are called type IV secreted effectors (T4SE), manipulate host cell processes during infection, often resulting in severe diseases or even death of...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Haitao, Ding, Chenchen, Cheng, Xin, Sang, Xiuzhi, Liu, Taigang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8123216/
https://www.ncbi.nlm.nih.gov/pubmed/33923273
http://dx.doi.org/10.3390/molecules26092487
_version_ 1783692837931450368
author Han, Haitao
Ding, Chenchen
Cheng, Xin
Sang, Xiuzhi
Liu, Taigang
author_facet Han, Haitao
Ding, Chenchen
Cheng, Xin
Sang, Xiuzhi
Liu, Taigang
author_sort Han, Haitao
collection PubMed
description Many gram-negative bacteria use type IV secretion systems to deliver effector molecules to a wide range of target cells. These substrate proteins, which are called type IV secreted effectors (T4SE), manipulate host cell processes during infection, often resulting in severe diseases or even death of the host. Therefore, identification of putative T4SEs has become a very active research topic in bioinformatics due to its vital roles in understanding host-pathogen interactions. PSI-BLAST profiles have been experimentally validated to provide important and discriminatory evolutionary information for various protein classification tasks. In the present study, an accurate computational predictor termed iT4SE-EP was developed for identifying T4SEs by extracting evolutionary features from the position-specific scoring matrix and the position-specific frequency matrix profiles. First, four types of encoding strategies were designed to transform protein sequences into fixed-length feature vectors based on the two profiles. Then, the feature selection technique based on the random forest algorithm was utilized to reduce redundant or irrelevant features without much loss of information. Finally, the optimal features were input into a support vector machine classifier to carry out the prediction of T4SEs. Our experimental results demonstrated that iT4SE-EP outperformed most of existing methods based on the independent dataset test.
format Online
Article
Text
id pubmed-8123216
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-81232162021-05-16 iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles Han, Haitao Ding, Chenchen Cheng, Xin Sang, Xiuzhi Liu, Taigang Molecules Article Many gram-negative bacteria use type IV secretion systems to deliver effector molecules to a wide range of target cells. These substrate proteins, which are called type IV secreted effectors (T4SE), manipulate host cell processes during infection, often resulting in severe diseases or even death of the host. Therefore, identification of putative T4SEs has become a very active research topic in bioinformatics due to its vital roles in understanding host-pathogen interactions. PSI-BLAST profiles have been experimentally validated to provide important and discriminatory evolutionary information for various protein classification tasks. In the present study, an accurate computational predictor termed iT4SE-EP was developed for identifying T4SEs by extracting evolutionary features from the position-specific scoring matrix and the position-specific frequency matrix profiles. First, four types of encoding strategies were designed to transform protein sequences into fixed-length feature vectors based on the two profiles. Then, the feature selection technique based on the random forest algorithm was utilized to reduce redundant or irrelevant features without much loss of information. Finally, the optimal features were input into a support vector machine classifier to carry out the prediction of T4SEs. Our experimental results demonstrated that iT4SE-EP outperformed most of existing methods based on the independent dataset test. MDPI 2021-04-24 /pmc/articles/PMC8123216/ /pubmed/33923273 http://dx.doi.org/10.3390/molecules26092487 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Han, Haitao
Ding, Chenchen
Cheng, Xin
Sang, Xiuzhi
Liu, Taigang
iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles
title iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles
title_full iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles
title_fullStr iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles
title_full_unstemmed iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles
title_short iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles
title_sort it4se-ep: accurate identification of bacterial type iv secreted effectors by exploring evolutionary features from two psi-blast profiles
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8123216/
https://www.ncbi.nlm.nih.gov/pubmed/33923273
http://dx.doi.org/10.3390/molecules26092487
work_keys_str_mv AT hanhaitao it4seepaccurateidentificationofbacterialtypeivsecretedeffectorsbyexploringevolutionaryfeaturesfromtwopsiblastprofiles
AT dingchenchen it4seepaccurateidentificationofbacterialtypeivsecretedeffectorsbyexploringevolutionaryfeaturesfromtwopsiblastprofiles
AT chengxin it4seepaccurateidentificationofbacterialtypeivsecretedeffectorsbyexploringevolutionaryfeaturesfromtwopsiblastprofiles
AT sangxiuzhi it4seepaccurateidentificationofbacterialtypeivsecretedeffectorsbyexploringevolutionaryfeaturesfromtwopsiblastprofiles
AT liutaigang it4seepaccurateidentificationofbacterialtypeivsecretedeffectorsbyexploringevolutionaryfeaturesfromtwopsiblastprofiles