Cargando…

Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides

Anticancer peptides (ACPs) represent a promising new therapeutic approach in cancer treatment. They can target cancer cells without affecting healthy tissues or altering normal physiological functions. Machine learning algorithms have increasingly been utilized for predicting peptide sequences with...

Descripción completa

Detalles Bibliográficos
Autores principales: Deng, Yiting, Ma, Shuhan, Li, Jiayu, Zheng, Bowen, Lv, Zhibin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10341712/
https://www.ncbi.nlm.nih.gov/pubmed/37446031
http://dx.doi.org/10.3390/ijms241310854
_version_ 1785072327116980224
author Deng, Yiting
Ma, Shuhan
Li, Jiayu
Zheng, Bowen
Lv, Zhibin
author_facet Deng, Yiting
Ma, Shuhan
Li, Jiayu
Zheng, Bowen
Lv, Zhibin
author_sort Deng, Yiting
collection PubMed
description Anticancer peptides (ACPs) represent a promising new therapeutic approach in cancer treatment. They can target cancer cells without affecting healthy tissues or altering normal physiological functions. Machine learning algorithms have increasingly been utilized for predicting peptide sequences with potential ACP effects. This study analyzed four benchmark datasets based on a well-established random forest (RF) algorithm. The peptide sequences were converted into 566 physicochemical features extracted from the amino acid index (AAindex) library, which were then subjected to feature selection using four methods: light gradient-boosting machine (LGBM), analysis of variance (ANOVA), chi-squared test (Chi(2)), and mutual information (MI). Presenting and merging the identified features using Venn diagrams, 19 key amino acid physicochemical properties were identified that can be used to predict the likelihood of a peptide sequence functioning as an ACP. The results were quantified by performance evaluation metrics to determine the accuracy of predictions. This study aims to enhance the efficiency of designing peptide sequences for cancer treatment.
format Online
Article
Text
id pubmed-10341712
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103417122023-07-14 Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides Deng, Yiting Ma, Shuhan Li, Jiayu Zheng, Bowen Lv, Zhibin Int J Mol Sci Article Anticancer peptides (ACPs) represent a promising new therapeutic approach in cancer treatment. They can target cancer cells without affecting healthy tissues or altering normal physiological functions. Machine learning algorithms have increasingly been utilized for predicting peptide sequences with potential ACP effects. This study analyzed four benchmark datasets based on a well-established random forest (RF) algorithm. The peptide sequences were converted into 566 physicochemical features extracted from the amino acid index (AAindex) library, which were then subjected to feature selection using four methods: light gradient-boosting machine (LGBM), analysis of variance (ANOVA), chi-squared test (Chi(2)), and mutual information (MI). Presenting and merging the identified features using Venn diagrams, 19 key amino acid physicochemical properties were identified that can be used to predict the likelihood of a peptide sequence functioning as an ACP. The results were quantified by performance evaluation metrics to determine the accuracy of predictions. This study aims to enhance the efficiency of designing peptide sequences for cancer treatment. MDPI 2023-06-29 /pmc/articles/PMC10341712/ /pubmed/37446031 http://dx.doi.org/10.3390/ijms241310854 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Deng, Yiting
Ma, Shuhan
Li, Jiayu
Zheng, Bowen
Lv, Zhibin
Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides
title Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides
title_full Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides
title_fullStr Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides
title_full_unstemmed Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides
title_short Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides
title_sort using the random forest for identifying key physicochemical properties of amino acids to discriminate anticancer and non-anticancer peptides
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10341712/
https://www.ncbi.nlm.nih.gov/pubmed/37446031
http://dx.doi.org/10.3390/ijms241310854
work_keys_str_mv AT dengyiting usingtherandomforestforidentifyingkeyphysicochemicalpropertiesofaminoacidstodiscriminateanticancerandnonanticancerpeptides
AT mashuhan usingtherandomforestforidentifyingkeyphysicochemicalpropertiesofaminoacidstodiscriminateanticancerandnonanticancerpeptides
AT lijiayu usingtherandomforestforidentifyingkeyphysicochemicalpropertiesofaminoacidstodiscriminateanticancerandnonanticancerpeptides
AT zhengbowen usingtherandomforestforidentifyingkeyphysicochemicalpropertiesofaminoacidstodiscriminateanticancerandnonanticancerpeptides
AT lvzhibin usingtherandomforestforidentifyingkeyphysicochemicalpropertiesofaminoacidstodiscriminateanticancerandnonanticancerpeptides