Cargando…
StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture
One of the major challenges in cancer therapy lies in the limited targeting specificity exhibited by existing anti-cancer drugs. Tumor-homing peptides (THPs) have emerged as a promising solution to this issue, due to their capability to specifically bind to and accumulate in tumor tissues while mini...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10298859/ https://www.ncbi.nlm.nih.gov/pubmed/37373494 http://dx.doi.org/10.3390/ijms241210348 |
_version_ | 1785064220516155392 |
---|---|
author | Guan, Jiahui Yao, Lantian Chung, Chia-Ru Chiang, Ying-Chih Lee, Tzong-Yi |
author_facet | Guan, Jiahui Yao, Lantian Chung, Chia-Ru Chiang, Ying-Chih Lee, Tzong-Yi |
author_sort | Guan, Jiahui |
collection | PubMed |
description | One of the major challenges in cancer therapy lies in the limited targeting specificity exhibited by existing anti-cancer drugs. Tumor-homing peptides (THPs) have emerged as a promising solution to this issue, due to their capability to specifically bind to and accumulate in tumor tissues while minimally impacting healthy tissues. THPs are short oligopeptides that offer a superior biological safety profile, with minimal antigenicity, and faster incorporation rates into target cells/tissues. However, identifying THPs experimentally, using methods such as phage display or in vivo screening, is a complex, time-consuming task, hence the need for computational methods. In this study, we proposed StackTHPred, a novel machine learning-based framework that predicts THPs using optimal features and a stacking architecture. With an effective feature selection algorithm and three tree-based machine learning algorithms, StackTHPred has demonstrated advanced performance, surpassing existing THP prediction methods. It achieved an accuracy of 0.915 and a 0.831 Matthews Correlation Coefficient (MCC) score on the main dataset, and an accuracy of 0.883 and a 0.767 MCC score on the small dataset. StackTHPred also offers favorable interpretability, enabling researchers to better understand the intrinsic characteristics of THPs. Overall, StackTHPred is beneficial for both the exploration and identification of THPs and facilitates the development of innovative cancer therapies. |
format | Online Article Text |
id | pubmed-10298859 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-102988592023-06-28 StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture Guan, Jiahui Yao, Lantian Chung, Chia-Ru Chiang, Ying-Chih Lee, Tzong-Yi Int J Mol Sci Article One of the major challenges in cancer therapy lies in the limited targeting specificity exhibited by existing anti-cancer drugs. Tumor-homing peptides (THPs) have emerged as a promising solution to this issue, due to their capability to specifically bind to and accumulate in tumor tissues while minimally impacting healthy tissues. THPs are short oligopeptides that offer a superior biological safety profile, with minimal antigenicity, and faster incorporation rates into target cells/tissues. However, identifying THPs experimentally, using methods such as phage display or in vivo screening, is a complex, time-consuming task, hence the need for computational methods. In this study, we proposed StackTHPred, a novel machine learning-based framework that predicts THPs using optimal features and a stacking architecture. With an effective feature selection algorithm and three tree-based machine learning algorithms, StackTHPred has demonstrated advanced performance, surpassing existing THP prediction methods. It achieved an accuracy of 0.915 and a 0.831 Matthews Correlation Coefficient (MCC) score on the main dataset, and an accuracy of 0.883 and a 0.767 MCC score on the small dataset. StackTHPred also offers favorable interpretability, enabling researchers to better understand the intrinsic characteristics of THPs. Overall, StackTHPred is beneficial for both the exploration and identification of THPs and facilitates the development of innovative cancer therapies. MDPI 2023-06-19 /pmc/articles/PMC10298859/ /pubmed/37373494 http://dx.doi.org/10.3390/ijms241210348 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Guan, Jiahui Yao, Lantian Chung, Chia-Ru Chiang, Ying-Chih Lee, Tzong-Yi StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture |
title | StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture |
title_full | StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture |
title_fullStr | StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture |
title_full_unstemmed | StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture |
title_short | StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture |
title_sort | stackthpred: identifying tumor-homing peptides through gbdt-based feature selection with stacking ensemble architecture |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10298859/ https://www.ncbi.nlm.nih.gov/pubmed/37373494 http://dx.doi.org/10.3390/ijms241210348 |
work_keys_str_mv | AT guanjiahui stackthpredidentifyingtumorhomingpeptidesthroughgbdtbasedfeatureselectionwithstackingensemblearchitecture AT yaolantian stackthpredidentifyingtumorhomingpeptidesthroughgbdtbasedfeatureselectionwithstackingensemblearchitecture AT chungchiaru stackthpredidentifyingtumorhomingpeptidesthroughgbdtbasedfeatureselectionwithstackingensemblearchitecture AT chiangyingchih stackthpredidentifyingtumorhomingpeptidesthroughgbdtbasedfeatureselectionwithstackingensemblearchitecture AT leetzongyi stackthpredidentifyingtumorhomingpeptidesthroughgbdtbasedfeatureselectionwithstackingensemblearchitecture |