Cargando…
Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides
Anticancer peptides (ACPs) represent a promising new therapeutic approach in cancer treatment. They can target cancer cells without affecting healthy tissues or altering normal physiological functions. Machine learning algorithms have increasingly been utilized for predicting peptide sequences with...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10341712/ https://www.ncbi.nlm.nih.gov/pubmed/37446031 http://dx.doi.org/10.3390/ijms241310854 |
_version_ | 1785072327116980224 |
---|---|
author | Deng, Yiting Ma, Shuhan Li, Jiayu Zheng, Bowen Lv, Zhibin |
author_facet | Deng, Yiting Ma, Shuhan Li, Jiayu Zheng, Bowen Lv, Zhibin |
author_sort | Deng, Yiting |
collection | PubMed |
description | Anticancer peptides (ACPs) represent a promising new therapeutic approach in cancer treatment. They can target cancer cells without affecting healthy tissues or altering normal physiological functions. Machine learning algorithms have increasingly been utilized for predicting peptide sequences with potential ACP effects. This study analyzed four benchmark datasets based on a well-established random forest (RF) algorithm. The peptide sequences were converted into 566 physicochemical features extracted from the amino acid index (AAindex) library, which were then subjected to feature selection using four methods: light gradient-boosting machine (LGBM), analysis of variance (ANOVA), chi-squared test (Chi(2)), and mutual information (MI). Presenting and merging the identified features using Venn diagrams, 19 key amino acid physicochemical properties were identified that can be used to predict the likelihood of a peptide sequence functioning as an ACP. The results were quantified by performance evaluation metrics to determine the accuracy of predictions. This study aims to enhance the efficiency of designing peptide sequences for cancer treatment. |
format | Online Article Text |
id | pubmed-10341712 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-103417122023-07-14 Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides Deng, Yiting Ma, Shuhan Li, Jiayu Zheng, Bowen Lv, Zhibin Int J Mol Sci Article Anticancer peptides (ACPs) represent a promising new therapeutic approach in cancer treatment. They can target cancer cells without affecting healthy tissues or altering normal physiological functions. Machine learning algorithms have increasingly been utilized for predicting peptide sequences with potential ACP effects. This study analyzed four benchmark datasets based on a well-established random forest (RF) algorithm. The peptide sequences were converted into 566 physicochemical features extracted from the amino acid index (AAindex) library, which were then subjected to feature selection using four methods: light gradient-boosting machine (LGBM), analysis of variance (ANOVA), chi-squared test (Chi(2)), and mutual information (MI). Presenting and merging the identified features using Venn diagrams, 19 key amino acid physicochemical properties were identified that can be used to predict the likelihood of a peptide sequence functioning as an ACP. The results were quantified by performance evaluation metrics to determine the accuracy of predictions. This study aims to enhance the efficiency of designing peptide sequences for cancer treatment. MDPI 2023-06-29 /pmc/articles/PMC10341712/ /pubmed/37446031 http://dx.doi.org/10.3390/ijms241310854 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Deng, Yiting Ma, Shuhan Li, Jiayu Zheng, Bowen Lv, Zhibin Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides |
title | Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides |
title_full | Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides |
title_fullStr | Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides |
title_full_unstemmed | Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides |
title_short | Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides |
title_sort | using the random forest for identifying key physicochemical properties of amino acids to discriminate anticancer and non-anticancer peptides |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10341712/ https://www.ncbi.nlm.nih.gov/pubmed/37446031 http://dx.doi.org/10.3390/ijms241310854 |
work_keys_str_mv | AT dengyiting usingtherandomforestforidentifyingkeyphysicochemicalpropertiesofaminoacidstodiscriminateanticancerandnonanticancerpeptides AT mashuhan usingtherandomforestforidentifyingkeyphysicochemicalpropertiesofaminoacidstodiscriminateanticancerandnonanticancerpeptides AT lijiayu usingtherandomforestforidentifyingkeyphysicochemicalpropertiesofaminoacidstodiscriminateanticancerandnonanticancerpeptides AT zhengbowen usingtherandomforestforidentifyingkeyphysicochemicalpropertiesofaminoacidstodiscriminateanticancerandnonanticancerpeptides AT lvzhibin usingtherandomforestforidentifyingkeyphysicochemicalpropertiesofaminoacidstodiscriminateanticancerandnonanticancerpeptides |