Cargando…

iT3SE-PX: Identification of Bacterial Type III Secreted Effectors Using PSSM Profiles and XGBoost Feature Selection

Identification of bacterial type III secreted effectors (T3SEs) has become a popular research topic in the field of bioinformatics due to its crucial role in understanding host-pathogen interaction and developing better therapeutic targets against the pathogens. However, the recognition of all effec...

Descripción completa

Detalles Bibliográficos
Autores principales: Ding, Chenchen, Han, Haitao, Li, Qianyue, Yang, Xiaoxia, Liu, Taigang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7806399/
https://www.ncbi.nlm.nih.gov/pubmed/33505516
http://dx.doi.org/10.1155/2021/6690299
_version_ 1783636515608330240
author Ding, Chenchen
Han, Haitao
Li, Qianyue
Yang, Xiaoxia
Liu, Taigang
author_facet Ding, Chenchen
Han, Haitao
Li, Qianyue
Yang, Xiaoxia
Liu, Taigang
author_sort Ding, Chenchen
collection PubMed
description Identification of bacterial type III secreted effectors (T3SEs) has become a popular research topic in the field of bioinformatics due to its crucial role in understanding host-pathogen interaction and developing better therapeutic targets against the pathogens. However, the recognition of all effector proteins by using traditional experimental approaches is often time-consuming and laborious. Therefore, development of computational methods to accurately predict putative novel effectors is important in reducing the number of biological experiments for validation. In this study, we proposed a method, called iT3SE-PX, to identify T3SEs solely based on protein sequences. First, three kinds of features were extracted from the position-specific scoring matrix (PSSM) profiles to help train a machine learning (ML) model. Then, the extreme gradient boosting (XGBoost) algorithm was performed to rank these features based on their classification ability. Finally, the optimal features were selected as inputs to a support vector machine (SVM) classifier to predict T3SEs. Based on the two benchmark datasets, we conducted a 100-time randomized 5-fold cross validation (CV) and an independent test, respectively. The experimental results demonstrated that the proposed method achieved superior performance compared to most of the existing methods and could serve as a useful tool for identifying putative T3SEs, given only the sequence information.
format Online
Article
Text
id pubmed-7806399
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-78063992021-01-26 iT3SE-PX: Identification of Bacterial Type III Secreted Effectors Using PSSM Profiles and XGBoost Feature Selection Ding, Chenchen Han, Haitao Li, Qianyue Yang, Xiaoxia Liu, Taigang Comput Math Methods Med Research Article Identification of bacterial type III secreted effectors (T3SEs) has become a popular research topic in the field of bioinformatics due to its crucial role in understanding host-pathogen interaction and developing better therapeutic targets against the pathogens. However, the recognition of all effector proteins by using traditional experimental approaches is often time-consuming and laborious. Therefore, development of computational methods to accurately predict putative novel effectors is important in reducing the number of biological experiments for validation. In this study, we proposed a method, called iT3SE-PX, to identify T3SEs solely based on protein sequences. First, three kinds of features were extracted from the position-specific scoring matrix (PSSM) profiles to help train a machine learning (ML) model. Then, the extreme gradient boosting (XGBoost) algorithm was performed to rank these features based on their classification ability. Finally, the optimal features were selected as inputs to a support vector machine (SVM) classifier to predict T3SEs. Based on the two benchmark datasets, we conducted a 100-time randomized 5-fold cross validation (CV) and an independent test, respectively. The experimental results demonstrated that the proposed method achieved superior performance compared to most of the existing methods and could serve as a useful tool for identifying putative T3SEs, given only the sequence information. Hindawi 2021-01-06 /pmc/articles/PMC7806399/ /pubmed/33505516 http://dx.doi.org/10.1155/2021/6690299 Text en Copyright © 2021 Chenchen Ding et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ding, Chenchen
Han, Haitao
Li, Qianyue
Yang, Xiaoxia
Liu, Taigang
iT3SE-PX: Identification of Bacterial Type III Secreted Effectors Using PSSM Profiles and XGBoost Feature Selection
title iT3SE-PX: Identification of Bacterial Type III Secreted Effectors Using PSSM Profiles and XGBoost Feature Selection
title_full iT3SE-PX: Identification of Bacterial Type III Secreted Effectors Using PSSM Profiles and XGBoost Feature Selection
title_fullStr iT3SE-PX: Identification of Bacterial Type III Secreted Effectors Using PSSM Profiles and XGBoost Feature Selection
title_full_unstemmed iT3SE-PX: Identification of Bacterial Type III Secreted Effectors Using PSSM Profiles and XGBoost Feature Selection
title_short iT3SE-PX: Identification of Bacterial Type III Secreted Effectors Using PSSM Profiles and XGBoost Feature Selection
title_sort it3se-px: identification of bacterial type iii secreted effectors using pssm profiles and xgboost feature selection
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7806399/
https://www.ncbi.nlm.nih.gov/pubmed/33505516
http://dx.doi.org/10.1155/2021/6690299
work_keys_str_mv AT dingchenchen it3sepxidentificationofbacterialtypeiiisecretedeffectorsusingpssmprofilesandxgboostfeatureselection
AT hanhaitao it3sepxidentificationofbacterialtypeiiisecretedeffectorsusingpssmprofilesandxgboostfeatureselection
AT liqianyue it3sepxidentificationofbacterialtypeiiisecretedeffectorsusingpssmprofilesandxgboostfeatureselection
AT yangxiaoxia it3sepxidentificationofbacterialtypeiiisecretedeffectorsusingpssmprofilesandxgboostfeatureselection
AT liutaigang it3sepxidentificationofbacterialtypeiiisecretedeffectorsusingpssmprofilesandxgboostfeatureselection