Cargando…
iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles
Many gram-negative bacteria use type IV secretion systems to deliver effector molecules to a wide range of target cells. These substrate proteins, which are called type IV secreted effectors (T4SE), manipulate host cell processes during infection, often resulting in severe diseases or even death of...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8123216/ https://www.ncbi.nlm.nih.gov/pubmed/33923273 http://dx.doi.org/10.3390/molecules26092487 |
_version_ | 1783692837931450368 |
---|---|
author | Han, Haitao Ding, Chenchen Cheng, Xin Sang, Xiuzhi Liu, Taigang |
author_facet | Han, Haitao Ding, Chenchen Cheng, Xin Sang, Xiuzhi Liu, Taigang |
author_sort | Han, Haitao |
collection | PubMed |
description | Many gram-negative bacteria use type IV secretion systems to deliver effector molecules to a wide range of target cells. These substrate proteins, which are called type IV secreted effectors (T4SE), manipulate host cell processes during infection, often resulting in severe diseases or even death of the host. Therefore, identification of putative T4SEs has become a very active research topic in bioinformatics due to its vital roles in understanding host-pathogen interactions. PSI-BLAST profiles have been experimentally validated to provide important and discriminatory evolutionary information for various protein classification tasks. In the present study, an accurate computational predictor termed iT4SE-EP was developed for identifying T4SEs by extracting evolutionary features from the position-specific scoring matrix and the position-specific frequency matrix profiles. First, four types of encoding strategies were designed to transform protein sequences into fixed-length feature vectors based on the two profiles. Then, the feature selection technique based on the random forest algorithm was utilized to reduce redundant or irrelevant features without much loss of information. Finally, the optimal features were input into a support vector machine classifier to carry out the prediction of T4SEs. Our experimental results demonstrated that iT4SE-EP outperformed most of existing methods based on the independent dataset test. |
format | Online Article Text |
id | pubmed-8123216 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-81232162021-05-16 iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles Han, Haitao Ding, Chenchen Cheng, Xin Sang, Xiuzhi Liu, Taigang Molecules Article Many gram-negative bacteria use type IV secretion systems to deliver effector molecules to a wide range of target cells. These substrate proteins, which are called type IV secreted effectors (T4SE), manipulate host cell processes during infection, often resulting in severe diseases or even death of the host. Therefore, identification of putative T4SEs has become a very active research topic in bioinformatics due to its vital roles in understanding host-pathogen interactions. PSI-BLAST profiles have been experimentally validated to provide important and discriminatory evolutionary information for various protein classification tasks. In the present study, an accurate computational predictor termed iT4SE-EP was developed for identifying T4SEs by extracting evolutionary features from the position-specific scoring matrix and the position-specific frequency matrix profiles. First, four types of encoding strategies were designed to transform protein sequences into fixed-length feature vectors based on the two profiles. Then, the feature selection technique based on the random forest algorithm was utilized to reduce redundant or irrelevant features without much loss of information. Finally, the optimal features were input into a support vector machine classifier to carry out the prediction of T4SEs. Our experimental results demonstrated that iT4SE-EP outperformed most of existing methods based on the independent dataset test. MDPI 2021-04-24 /pmc/articles/PMC8123216/ /pubmed/33923273 http://dx.doi.org/10.3390/molecules26092487 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Han, Haitao Ding, Chenchen Cheng, Xin Sang, Xiuzhi Liu, Taigang iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles |
title | iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles |
title_full | iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles |
title_fullStr | iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles |
title_full_unstemmed | iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles |
title_short | iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles |
title_sort | it4se-ep: accurate identification of bacterial type iv secreted effectors by exploring evolutionary features from two psi-blast profiles |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8123216/ https://www.ncbi.nlm.nih.gov/pubmed/33923273 http://dx.doi.org/10.3390/molecules26092487 |
work_keys_str_mv | AT hanhaitao it4seepaccurateidentificationofbacterialtypeivsecretedeffectorsbyexploringevolutionaryfeaturesfromtwopsiblastprofiles AT dingchenchen it4seepaccurateidentificationofbacterialtypeivsecretedeffectorsbyexploringevolutionaryfeaturesfromtwopsiblastprofiles AT chengxin it4seepaccurateidentificationofbacterialtypeivsecretedeffectorsbyexploringevolutionaryfeaturesfromtwopsiblastprofiles AT sangxiuzhi it4seepaccurateidentificationofbacterialtypeivsecretedeffectorsbyexploringevolutionaryfeaturesfromtwopsiblastprofiles AT liutaigang it4seepaccurateidentificationofbacterialtypeivsecretedeffectorsbyexploringevolutionaryfeaturesfromtwopsiblastprofiles |