Cargando…

AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees

Mycobacterium tuberculosis is one of the most dangerous pathogens in humans. It acts as an etiological agent of tuberculosis (TB), infecting almost one-third of the world's population. Owing to the high incidence of multidrug-resistant TB and extensively drug-resistant TB, there is an urgent ne...

Descripción completa

Detalles Bibliográficos
Autores principales: Manavalan, Balachandran, Basith, Shaherin, Shin, Tae Hwan, Wei, Leyi, Lee, Gwang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6658830/
https://www.ncbi.nlm.nih.gov/pubmed/31372196
http://dx.doi.org/10.1016/j.csbj.2019.06.024
_version_ 1783439022856601600
author Manavalan, Balachandran
Basith, Shaherin
Shin, Tae Hwan
Wei, Leyi
Lee, Gwang
author_facet Manavalan, Balachandran
Basith, Shaherin
Shin, Tae Hwan
Wei, Leyi
Lee, Gwang
author_sort Manavalan, Balachandran
collection PubMed
description Mycobacterium tuberculosis is one of the most dangerous pathogens in humans. It acts as an etiological agent of tuberculosis (TB), infecting almost one-third of the world's population. Owing to the high incidence of multidrug-resistant TB and extensively drug-resistant TB, there is an urgent need for novel and effective alternative therapies. Peptide-based therapy has several advantages, such as diverse mechanisms of action, low immunogenicity, and selective affinity to bacterial cell envelopes. However, the identification of anti-tubercular peptides (AtbPs) via experimentation is laborious and expensive; hence, the development of an efficient computational method is necessary for the prediction of AtbPs prior to both in vitro and in vivo experiments. To this end, we developed a two-layer machine learning (ML)-based predictor called AtbPpred for the identification of AtbPs. In the first layer, we applied a two-step feature selection procedure and identified the optimal feature set individually for nine different feature encodings, whose corresponding models were developed using extremely randomized tree (ERT). In the second-layer, the predicted probability of AtbPs from the above nine models were considered as input features to ERT and developed the final predictor. AtbPpred respectively achieved average accuracies of 88.3% and 87.3% during cross-validation and an independent evaluation, which were ~8.7% and 10.0% higher than the state-of-the-art method. Furthermore, we established a user-friendly webserver which is currently available at http://thegleelab.org/AtbPpred. We anticipate that this predictor could be useful in the high-throughput prediction of AtbPs and also provide mechanistic insights into its functions.
format Online
Article
Text
id pubmed-6658830
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-66588302019-08-01 AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees Manavalan, Balachandran Basith, Shaherin Shin, Tae Hwan Wei, Leyi Lee, Gwang Comput Struct Biotechnol J Research Article Mycobacterium tuberculosis is one of the most dangerous pathogens in humans. It acts as an etiological agent of tuberculosis (TB), infecting almost one-third of the world's population. Owing to the high incidence of multidrug-resistant TB and extensively drug-resistant TB, there is an urgent need for novel and effective alternative therapies. Peptide-based therapy has several advantages, such as diverse mechanisms of action, low immunogenicity, and selective affinity to bacterial cell envelopes. However, the identification of anti-tubercular peptides (AtbPs) via experimentation is laborious and expensive; hence, the development of an efficient computational method is necessary for the prediction of AtbPs prior to both in vitro and in vivo experiments. To this end, we developed a two-layer machine learning (ML)-based predictor called AtbPpred for the identification of AtbPs. In the first layer, we applied a two-step feature selection procedure and identified the optimal feature set individually for nine different feature encodings, whose corresponding models were developed using extremely randomized tree (ERT). In the second-layer, the predicted probability of AtbPs from the above nine models were considered as input features to ERT and developed the final predictor. AtbPpred respectively achieved average accuracies of 88.3% and 87.3% during cross-validation and an independent evaluation, which were ~8.7% and 10.0% higher than the state-of-the-art method. Furthermore, we established a user-friendly webserver which is currently available at http://thegleelab.org/AtbPpred. We anticipate that this predictor could be useful in the high-throughput prediction of AtbPs and also provide mechanistic insights into its functions. Research Network of Computational and Structural Biotechnology 2019-07-03 /pmc/articles/PMC6658830/ /pubmed/31372196 http://dx.doi.org/10.1016/j.csbj.2019.06.024 Text en © 2019 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Manavalan, Balachandran
Basith, Shaherin
Shin, Tae Hwan
Wei, Leyi
Lee, Gwang
AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees
title AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees
title_full AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees
title_fullStr AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees
title_full_unstemmed AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees
title_short AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees
title_sort atbppred: a robust sequence-based prediction of anti-tubercular peptides using extremely randomized trees
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6658830/
https://www.ncbi.nlm.nih.gov/pubmed/31372196
http://dx.doi.org/10.1016/j.csbj.2019.06.024
work_keys_str_mv AT manavalanbalachandran atbppredarobustsequencebasedpredictionofantitubercularpeptidesusingextremelyrandomizedtrees
AT basithshaherin atbppredarobustsequencebasedpredictionofantitubercularpeptidesusingextremelyrandomizedtrees
AT shintaehwan atbppredarobustsequencebasedpredictionofantitubercularpeptidesusingextremelyrandomizedtrees
AT weileyi atbppredarobustsequencebasedpredictionofantitubercularpeptidesusingextremelyrandomizedtrees
AT leegwang atbppredarobustsequencebasedpredictionofantitubercularpeptidesusingextremelyrandomizedtrees