Cargando…
AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees
Mycobacterium tuberculosis is one of the most dangerous pathogens in humans. It acts as an etiological agent of tuberculosis (TB), infecting almost one-third of the world's population. Owing to the high incidence of multidrug-resistant TB and extensively drug-resistant TB, there is an urgent ne...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6658830/ https://www.ncbi.nlm.nih.gov/pubmed/31372196 http://dx.doi.org/10.1016/j.csbj.2019.06.024 |
_version_ | 1783439022856601600 |
---|---|
author | Manavalan, Balachandran Basith, Shaherin Shin, Tae Hwan Wei, Leyi Lee, Gwang |
author_facet | Manavalan, Balachandran Basith, Shaherin Shin, Tae Hwan Wei, Leyi Lee, Gwang |
author_sort | Manavalan, Balachandran |
collection | PubMed |
description | Mycobacterium tuberculosis is one of the most dangerous pathogens in humans. It acts as an etiological agent of tuberculosis (TB), infecting almost one-third of the world's population. Owing to the high incidence of multidrug-resistant TB and extensively drug-resistant TB, there is an urgent need for novel and effective alternative therapies. Peptide-based therapy has several advantages, such as diverse mechanisms of action, low immunogenicity, and selective affinity to bacterial cell envelopes. However, the identification of anti-tubercular peptides (AtbPs) via experimentation is laborious and expensive; hence, the development of an efficient computational method is necessary for the prediction of AtbPs prior to both in vitro and in vivo experiments. To this end, we developed a two-layer machine learning (ML)-based predictor called AtbPpred for the identification of AtbPs. In the first layer, we applied a two-step feature selection procedure and identified the optimal feature set individually for nine different feature encodings, whose corresponding models were developed using extremely randomized tree (ERT). In the second-layer, the predicted probability of AtbPs from the above nine models were considered as input features to ERT and developed the final predictor. AtbPpred respectively achieved average accuracies of 88.3% and 87.3% during cross-validation and an independent evaluation, which were ~8.7% and 10.0% higher than the state-of-the-art method. Furthermore, we established a user-friendly webserver which is currently available at http://thegleelab.org/AtbPpred. We anticipate that this predictor could be useful in the high-throughput prediction of AtbPs and also provide mechanistic insights into its functions. |
format | Online Article Text |
id | pubmed-6658830 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-66588302019-08-01 AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees Manavalan, Balachandran Basith, Shaherin Shin, Tae Hwan Wei, Leyi Lee, Gwang Comput Struct Biotechnol J Research Article Mycobacterium tuberculosis is one of the most dangerous pathogens in humans. It acts as an etiological agent of tuberculosis (TB), infecting almost one-third of the world's population. Owing to the high incidence of multidrug-resistant TB and extensively drug-resistant TB, there is an urgent need for novel and effective alternative therapies. Peptide-based therapy has several advantages, such as diverse mechanisms of action, low immunogenicity, and selective affinity to bacterial cell envelopes. However, the identification of anti-tubercular peptides (AtbPs) via experimentation is laborious and expensive; hence, the development of an efficient computational method is necessary for the prediction of AtbPs prior to both in vitro and in vivo experiments. To this end, we developed a two-layer machine learning (ML)-based predictor called AtbPpred for the identification of AtbPs. In the first layer, we applied a two-step feature selection procedure and identified the optimal feature set individually for nine different feature encodings, whose corresponding models were developed using extremely randomized tree (ERT). In the second-layer, the predicted probability of AtbPs from the above nine models were considered as input features to ERT and developed the final predictor. AtbPpred respectively achieved average accuracies of 88.3% and 87.3% during cross-validation and an independent evaluation, which were ~8.7% and 10.0% higher than the state-of-the-art method. Furthermore, we established a user-friendly webserver which is currently available at http://thegleelab.org/AtbPpred. We anticipate that this predictor could be useful in the high-throughput prediction of AtbPs and also provide mechanistic insights into its functions. Research Network of Computational and Structural Biotechnology 2019-07-03 /pmc/articles/PMC6658830/ /pubmed/31372196 http://dx.doi.org/10.1016/j.csbj.2019.06.024 Text en © 2019 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Manavalan, Balachandran Basith, Shaherin Shin, Tae Hwan Wei, Leyi Lee, Gwang AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees |
title | AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees |
title_full | AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees |
title_fullStr | AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees |
title_full_unstemmed | AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees |
title_short | AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees |
title_sort | atbppred: a robust sequence-based prediction of anti-tubercular peptides using extremely randomized trees |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6658830/ https://www.ncbi.nlm.nih.gov/pubmed/31372196 http://dx.doi.org/10.1016/j.csbj.2019.06.024 |
work_keys_str_mv | AT manavalanbalachandran atbppredarobustsequencebasedpredictionofantitubercularpeptidesusingextremelyrandomizedtrees AT basithshaherin atbppredarobustsequencebasedpredictionofantitubercularpeptidesusingextremelyrandomizedtrees AT shintaehwan atbppredarobustsequencebasedpredictionofantitubercularpeptidesusingextremelyrandomizedtrees AT weileyi atbppredarobustsequencebasedpredictionofantitubercularpeptidesusingextremelyrandomizedtrees AT leegwang atbppredarobustsequencebasedpredictionofantitubercularpeptidesusingextremelyrandomizedtrees |