Cargando…

StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture

One of the major challenges in cancer therapy lies in the limited targeting specificity exhibited by existing anti-cancer drugs. Tumor-homing peptides (THPs) have emerged as a promising solution to this issue, due to their capability to specifically bind to and accumulate in tumor tissues while mini...

Descripción completa

Detalles Bibliográficos
Autores principales: Guan, Jiahui, Yao, Lantian, Chung, Chia-Ru, Chiang, Ying-Chih, Lee, Tzong-Yi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10298859/
https://www.ncbi.nlm.nih.gov/pubmed/37373494
http://dx.doi.org/10.3390/ijms241210348
_version_ 1785064220516155392
author Guan, Jiahui
Yao, Lantian
Chung, Chia-Ru
Chiang, Ying-Chih
Lee, Tzong-Yi
author_facet Guan, Jiahui
Yao, Lantian
Chung, Chia-Ru
Chiang, Ying-Chih
Lee, Tzong-Yi
author_sort Guan, Jiahui
collection PubMed
description One of the major challenges in cancer therapy lies in the limited targeting specificity exhibited by existing anti-cancer drugs. Tumor-homing peptides (THPs) have emerged as a promising solution to this issue, due to their capability to specifically bind to and accumulate in tumor tissues while minimally impacting healthy tissues. THPs are short oligopeptides that offer a superior biological safety profile, with minimal antigenicity, and faster incorporation rates into target cells/tissues. However, identifying THPs experimentally, using methods such as phage display or in vivo screening, is a complex, time-consuming task, hence the need for computational methods. In this study, we proposed StackTHPred, a novel machine learning-based framework that predicts THPs using optimal features and a stacking architecture. With an effective feature selection algorithm and three tree-based machine learning algorithms, StackTHPred has demonstrated advanced performance, surpassing existing THP prediction methods. It achieved an accuracy of 0.915 and a 0.831 Matthews Correlation Coefficient (MCC) score on the main dataset, and an accuracy of 0.883 and a 0.767 MCC score on the small dataset. StackTHPred also offers favorable interpretability, enabling researchers to better understand the intrinsic characteristics of THPs. Overall, StackTHPred is beneficial for both the exploration and identification of THPs and facilitates the development of innovative cancer therapies.
format Online
Article
Text
id pubmed-10298859
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102988592023-06-28 StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture Guan, Jiahui Yao, Lantian Chung, Chia-Ru Chiang, Ying-Chih Lee, Tzong-Yi Int J Mol Sci Article One of the major challenges in cancer therapy lies in the limited targeting specificity exhibited by existing anti-cancer drugs. Tumor-homing peptides (THPs) have emerged as a promising solution to this issue, due to their capability to specifically bind to and accumulate in tumor tissues while minimally impacting healthy tissues. THPs are short oligopeptides that offer a superior biological safety profile, with minimal antigenicity, and faster incorporation rates into target cells/tissues. However, identifying THPs experimentally, using methods such as phage display or in vivo screening, is a complex, time-consuming task, hence the need for computational methods. In this study, we proposed StackTHPred, a novel machine learning-based framework that predicts THPs using optimal features and a stacking architecture. With an effective feature selection algorithm and three tree-based machine learning algorithms, StackTHPred has demonstrated advanced performance, surpassing existing THP prediction methods. It achieved an accuracy of 0.915 and a 0.831 Matthews Correlation Coefficient (MCC) score on the main dataset, and an accuracy of 0.883 and a 0.767 MCC score on the small dataset. StackTHPred also offers favorable interpretability, enabling researchers to better understand the intrinsic characteristics of THPs. Overall, StackTHPred is beneficial for both the exploration and identification of THPs and facilitates the development of innovative cancer therapies. MDPI 2023-06-19 /pmc/articles/PMC10298859/ /pubmed/37373494 http://dx.doi.org/10.3390/ijms241210348 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Guan, Jiahui
Yao, Lantian
Chung, Chia-Ru
Chiang, Ying-Chih
Lee, Tzong-Yi
StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture
title StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture
title_full StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture
title_fullStr StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture
title_full_unstemmed StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture
title_short StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture
title_sort stackthpred: identifying tumor-homing peptides through gbdt-based feature selection with stacking ensemble architecture
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10298859/
https://www.ncbi.nlm.nih.gov/pubmed/37373494
http://dx.doi.org/10.3390/ijms241210348
work_keys_str_mv AT guanjiahui stackthpredidentifyingtumorhomingpeptidesthroughgbdtbasedfeatureselectionwithstackingensemblearchitecture
AT yaolantian stackthpredidentifyingtumorhomingpeptidesthroughgbdtbasedfeatureselectionwithstackingensemblearchitecture
AT chungchiaru stackthpredidentifyingtumorhomingpeptidesthroughgbdtbasedfeatureselectionwithstackingensemblearchitecture
AT chiangyingchih stackthpredidentifyingtumorhomingpeptidesthroughgbdtbasedfeatureselectionwithstackingensemblearchitecture
AT leetzongyi stackthpredidentifyingtumorhomingpeptidesthroughgbdtbasedfeatureselectionwithstackingensemblearchitecture