Cargando…

Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation

Cancer is one of the leading diseases threatening human life and health worldwide. Peptide-based therapies have attracted much attention in recent years. Therefore, the precise prediction of anticancer peptides (ACPs) is crucial for discovering and designing novel cancer treatments. In this study, w...

Descripción completa

Detalles Bibliográficos
Autores principales: Yao, Lantian, Li, Wenshuo, Zhang, Yuntian, Deng, Junyang, Pang, Yuxuan, Huang, Yixian, Chung, Chia-Ru, Yu, Jinhan, Chiang, Ying-Chih, Lee, Tzong-Yi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10001941/
https://www.ncbi.nlm.nih.gov/pubmed/36901759
http://dx.doi.org/10.3390/ijms24054328
_version_ 1784904267823316992
author Yao, Lantian
Li, Wenshuo
Zhang, Yuntian
Deng, Junyang
Pang, Yuxuan
Huang, Yixian
Chung, Chia-Ru
Yu, Jinhan
Chiang, Ying-Chih
Lee, Tzong-Yi
author_facet Yao, Lantian
Li, Wenshuo
Zhang, Yuntian
Deng, Junyang
Pang, Yuxuan
Huang, Yixian
Chung, Chia-Ru
Yu, Jinhan
Chiang, Ying-Chih
Lee, Tzong-Yi
author_sort Yao, Lantian
collection PubMed
description Cancer is one of the leading diseases threatening human life and health worldwide. Peptide-based therapies have attracted much attention in recent years. Therefore, the precise prediction of anticancer peptides (ACPs) is crucial for discovering and designing novel cancer treatments. In this study, we proposed a novel machine learning framework (GRDF) that incorporates deep graphical representation and deep forest architecture for identifying ACPs. Specifically, GRDF extracts graphical features based on the physicochemical properties of peptides and integrates their evolutionary information along with binary profiles for constructing models. Moreover, we employ the deep forest algorithm, which adopts a layer-by-layer cascade architecture similar to deep neural networks, enabling excellent performance on small datasets but without complicated tuning of hyperparameters. The experiment shows GRDF exhibits state-of-the-art performance on two elaborate datasets (Set 1 and Set 2), achieving 77.12% accuracy and 77.54% F1-score on Set 1, as well as 94.10% accuracy and 94.15% F1-score on Set 2, exceeding existing ACP prediction methods. Our models exhibit greater robustness than the baseline algorithms commonly used for other sequence analysis tasks. In addition, GRDF is well-interpretable, enabling researchers to better understand the features of peptide sequences. The promising results demonstrate that GRDF is remarkably effective in identifying ACPs. Therefore, the framework presented in this study could assist researchers in facilitating the discovery of anticancer peptides and contribute to developing novel cancer treatments.
format Online
Article
Text
id pubmed-10001941
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100019412023-03-11 Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation Yao, Lantian Li, Wenshuo Zhang, Yuntian Deng, Junyang Pang, Yuxuan Huang, Yixian Chung, Chia-Ru Yu, Jinhan Chiang, Ying-Chih Lee, Tzong-Yi Int J Mol Sci Article Cancer is one of the leading diseases threatening human life and health worldwide. Peptide-based therapies have attracted much attention in recent years. Therefore, the precise prediction of anticancer peptides (ACPs) is crucial for discovering and designing novel cancer treatments. In this study, we proposed a novel machine learning framework (GRDF) that incorporates deep graphical representation and deep forest architecture for identifying ACPs. Specifically, GRDF extracts graphical features based on the physicochemical properties of peptides and integrates their evolutionary information along with binary profiles for constructing models. Moreover, we employ the deep forest algorithm, which adopts a layer-by-layer cascade architecture similar to deep neural networks, enabling excellent performance on small datasets but without complicated tuning of hyperparameters. The experiment shows GRDF exhibits state-of-the-art performance on two elaborate datasets (Set 1 and Set 2), achieving 77.12% accuracy and 77.54% F1-score on Set 1, as well as 94.10% accuracy and 94.15% F1-score on Set 2, exceeding existing ACP prediction methods. Our models exhibit greater robustness than the baseline algorithms commonly used for other sequence analysis tasks. In addition, GRDF is well-interpretable, enabling researchers to better understand the features of peptide sequences. The promising results demonstrate that GRDF is remarkably effective in identifying ACPs. Therefore, the framework presented in this study could assist researchers in facilitating the discovery of anticancer peptides and contribute to developing novel cancer treatments. MDPI 2023-02-21 /pmc/articles/PMC10001941/ /pubmed/36901759 http://dx.doi.org/10.3390/ijms24054328 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Yao, Lantian
Li, Wenshuo
Zhang, Yuntian
Deng, Junyang
Pang, Yuxuan
Huang, Yixian
Chung, Chia-Ru
Yu, Jinhan
Chiang, Ying-Chih
Lee, Tzong-Yi
Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation
title Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation
title_full Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation
title_fullStr Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation
title_full_unstemmed Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation
title_short Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation
title_sort accelerating the discovery of anticancer peptides through deep forest architecture with deep graphical representation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10001941/
https://www.ncbi.nlm.nih.gov/pubmed/36901759
http://dx.doi.org/10.3390/ijms24054328
work_keys_str_mv AT yaolantian acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation
AT liwenshuo acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation
AT zhangyuntian acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation
AT dengjunyang acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation
AT pangyuxuan acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation
AT huangyixian acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation
AT chungchiaru acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation
AT yujinhan acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation
AT chiangyingchih acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation
AT leetzongyi acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation