Cargando…
Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation
Cancer is one of the leading diseases threatening human life and health worldwide. Peptide-based therapies have attracted much attention in recent years. Therefore, the precise prediction of anticancer peptides (ACPs) is crucial for discovering and designing novel cancer treatments. In this study, w...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10001941/ https://www.ncbi.nlm.nih.gov/pubmed/36901759 http://dx.doi.org/10.3390/ijms24054328 |
_version_ | 1784904267823316992 |
---|---|
author | Yao, Lantian Li, Wenshuo Zhang, Yuntian Deng, Junyang Pang, Yuxuan Huang, Yixian Chung, Chia-Ru Yu, Jinhan Chiang, Ying-Chih Lee, Tzong-Yi |
author_facet | Yao, Lantian Li, Wenshuo Zhang, Yuntian Deng, Junyang Pang, Yuxuan Huang, Yixian Chung, Chia-Ru Yu, Jinhan Chiang, Ying-Chih Lee, Tzong-Yi |
author_sort | Yao, Lantian |
collection | PubMed |
description | Cancer is one of the leading diseases threatening human life and health worldwide. Peptide-based therapies have attracted much attention in recent years. Therefore, the precise prediction of anticancer peptides (ACPs) is crucial for discovering and designing novel cancer treatments. In this study, we proposed a novel machine learning framework (GRDF) that incorporates deep graphical representation and deep forest architecture for identifying ACPs. Specifically, GRDF extracts graphical features based on the physicochemical properties of peptides and integrates their evolutionary information along with binary profiles for constructing models. Moreover, we employ the deep forest algorithm, which adopts a layer-by-layer cascade architecture similar to deep neural networks, enabling excellent performance on small datasets but without complicated tuning of hyperparameters. The experiment shows GRDF exhibits state-of-the-art performance on two elaborate datasets (Set 1 and Set 2), achieving 77.12% accuracy and 77.54% F1-score on Set 1, as well as 94.10% accuracy and 94.15% F1-score on Set 2, exceeding existing ACP prediction methods. Our models exhibit greater robustness than the baseline algorithms commonly used for other sequence analysis tasks. In addition, GRDF is well-interpretable, enabling researchers to better understand the features of peptide sequences. The promising results demonstrate that GRDF is remarkably effective in identifying ACPs. Therefore, the framework presented in this study could assist researchers in facilitating the discovery of anticancer peptides and contribute to developing novel cancer treatments. |
format | Online Article Text |
id | pubmed-10001941 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-100019412023-03-11 Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation Yao, Lantian Li, Wenshuo Zhang, Yuntian Deng, Junyang Pang, Yuxuan Huang, Yixian Chung, Chia-Ru Yu, Jinhan Chiang, Ying-Chih Lee, Tzong-Yi Int J Mol Sci Article Cancer is one of the leading diseases threatening human life and health worldwide. Peptide-based therapies have attracted much attention in recent years. Therefore, the precise prediction of anticancer peptides (ACPs) is crucial for discovering and designing novel cancer treatments. In this study, we proposed a novel machine learning framework (GRDF) that incorporates deep graphical representation and deep forest architecture for identifying ACPs. Specifically, GRDF extracts graphical features based on the physicochemical properties of peptides and integrates their evolutionary information along with binary profiles for constructing models. Moreover, we employ the deep forest algorithm, which adopts a layer-by-layer cascade architecture similar to deep neural networks, enabling excellent performance on small datasets but without complicated tuning of hyperparameters. The experiment shows GRDF exhibits state-of-the-art performance on two elaborate datasets (Set 1 and Set 2), achieving 77.12% accuracy and 77.54% F1-score on Set 1, as well as 94.10% accuracy and 94.15% F1-score on Set 2, exceeding existing ACP prediction methods. Our models exhibit greater robustness than the baseline algorithms commonly used for other sequence analysis tasks. In addition, GRDF is well-interpretable, enabling researchers to better understand the features of peptide sequences. The promising results demonstrate that GRDF is remarkably effective in identifying ACPs. Therefore, the framework presented in this study could assist researchers in facilitating the discovery of anticancer peptides and contribute to developing novel cancer treatments. MDPI 2023-02-21 /pmc/articles/PMC10001941/ /pubmed/36901759 http://dx.doi.org/10.3390/ijms24054328 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Yao, Lantian Li, Wenshuo Zhang, Yuntian Deng, Junyang Pang, Yuxuan Huang, Yixian Chung, Chia-Ru Yu, Jinhan Chiang, Ying-Chih Lee, Tzong-Yi Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation |
title | Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation |
title_full | Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation |
title_fullStr | Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation |
title_full_unstemmed | Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation |
title_short | Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation |
title_sort | accelerating the discovery of anticancer peptides through deep forest architecture with deep graphical representation |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10001941/ https://www.ncbi.nlm.nih.gov/pubmed/36901759 http://dx.doi.org/10.3390/ijms24054328 |
work_keys_str_mv | AT yaolantian acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation AT liwenshuo acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation AT zhangyuntian acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation AT dengjunyang acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation AT pangyuxuan acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation AT huangyixian acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation AT chungchiaru acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation AT yujinhan acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation AT chiangyingchih acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation AT leetzongyi acceleratingthediscoveryofanticancerpeptidesthroughdeepforestarchitecturewithdeepgraphicalrepresentation |