Cargando…

ACP-BC: A Model for Accurate Identification of Anticancer Peptides Based on Fusion Features of Bidirectional Long Short-Term Memory and Chemically Derived Information

Anticancer peptides (ACPs) have been proven to possess potent anticancer activities. Although computational methods have emerged for rapid ACPs identification, their accuracy still needs improvement. In this study, we propose a model called ACP-BC, a three-channel end-to-end model that utilizes vari...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Mingwei, Hu, Haoyuan, Pang, Wei, Zhou, You
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10607064/
https://www.ncbi.nlm.nih.gov/pubmed/37895128
http://dx.doi.org/10.3390/ijms242015447
_version_ 1785127458186461184
author Sun, Mingwei
Hu, Haoyuan
Pang, Wei
Zhou, You
author_facet Sun, Mingwei
Hu, Haoyuan
Pang, Wei
Zhou, You
author_sort Sun, Mingwei
collection PubMed
description Anticancer peptides (ACPs) have been proven to possess potent anticancer activities. Although computational methods have emerged for rapid ACPs identification, their accuracy still needs improvement. In this study, we propose a model called ACP-BC, a three-channel end-to-end model that utilizes various combinations of data augmentation techniques. In the first channel, features are extracted from the raw sequence using a bidirectional long short-term memory network. In the second channel, the entire sequence is converted into a chemical molecular formula, which is further simplified using Simplified Molecular Input Line Entry System notation to obtain deep abstract features through a bidirectional encoder representation transformer (BERT). In the third channel, we manually selected four effective features according to dipeptide composition, binary profile feature, k-mer sparse matrix, and pseudo amino acid composition. Notably, the application of chemical BERT in predicting ACPs is novel and successfully integrated into our model. To validate the performance of our model, we selected two benchmark datasets, ACPs740 and ACPs240. ACP-BC achieved prediction accuracy with 87% and 90% on these two datasets, respectively, representing improvements of 1.3% and 7% compared to existing state-of-the-art methods on these datasets. Therefore, systematic comparative experiments have shown that the ACP-BC can effectively identify anticancer peptides.
format Online
Article
Text
id pubmed-10607064
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106070642023-10-28 ACP-BC: A Model for Accurate Identification of Anticancer Peptides Based on Fusion Features of Bidirectional Long Short-Term Memory and Chemically Derived Information Sun, Mingwei Hu, Haoyuan Pang, Wei Zhou, You Int J Mol Sci Article Anticancer peptides (ACPs) have been proven to possess potent anticancer activities. Although computational methods have emerged for rapid ACPs identification, their accuracy still needs improvement. In this study, we propose a model called ACP-BC, a three-channel end-to-end model that utilizes various combinations of data augmentation techniques. In the first channel, features are extracted from the raw sequence using a bidirectional long short-term memory network. In the second channel, the entire sequence is converted into a chemical molecular formula, which is further simplified using Simplified Molecular Input Line Entry System notation to obtain deep abstract features through a bidirectional encoder representation transformer (BERT). In the third channel, we manually selected four effective features according to dipeptide composition, binary profile feature, k-mer sparse matrix, and pseudo amino acid composition. Notably, the application of chemical BERT in predicting ACPs is novel and successfully integrated into our model. To validate the performance of our model, we selected two benchmark datasets, ACPs740 and ACPs240. ACP-BC achieved prediction accuracy with 87% and 90% on these two datasets, respectively, representing improvements of 1.3% and 7% compared to existing state-of-the-art methods on these datasets. Therefore, systematic comparative experiments have shown that the ACP-BC can effectively identify anticancer peptides. MDPI 2023-10-22 /pmc/articles/PMC10607064/ /pubmed/37895128 http://dx.doi.org/10.3390/ijms242015447 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Sun, Mingwei
Hu, Haoyuan
Pang, Wei
Zhou, You
ACP-BC: A Model for Accurate Identification of Anticancer Peptides Based on Fusion Features of Bidirectional Long Short-Term Memory and Chemically Derived Information
title ACP-BC: A Model for Accurate Identification of Anticancer Peptides Based on Fusion Features of Bidirectional Long Short-Term Memory and Chemically Derived Information
title_full ACP-BC: A Model for Accurate Identification of Anticancer Peptides Based on Fusion Features of Bidirectional Long Short-Term Memory and Chemically Derived Information
title_fullStr ACP-BC: A Model for Accurate Identification of Anticancer Peptides Based on Fusion Features of Bidirectional Long Short-Term Memory and Chemically Derived Information
title_full_unstemmed ACP-BC: A Model for Accurate Identification of Anticancer Peptides Based on Fusion Features of Bidirectional Long Short-Term Memory and Chemically Derived Information
title_short ACP-BC: A Model for Accurate Identification of Anticancer Peptides Based on Fusion Features of Bidirectional Long Short-Term Memory and Chemically Derived Information
title_sort acp-bc: a model for accurate identification of anticancer peptides based on fusion features of bidirectional long short-term memory and chemically derived information
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10607064/
https://www.ncbi.nlm.nih.gov/pubmed/37895128
http://dx.doi.org/10.3390/ijms242015447
work_keys_str_mv AT sunmingwei acpbcamodelforaccurateidentificationofanticancerpeptidesbasedonfusionfeaturesofbidirectionallongshorttermmemoryandchemicallyderivedinformation
AT huhaoyuan acpbcamodelforaccurateidentificationofanticancerpeptidesbasedonfusionfeaturesofbidirectionallongshorttermmemoryandchemicallyderivedinformation
AT pangwei acpbcamodelforaccurateidentificationofanticancerpeptidesbasedonfusionfeaturesofbidirectionallongshorttermmemoryandchemicallyderivedinformation
AT zhouyou acpbcamodelforaccurateidentificationofanticancerpeptidesbasedonfusionfeaturesofbidirectionallongshorttermmemoryandchemicallyderivedinformation