Cargando…
A Random Forest Model for Peptide Classification Based on Virtual Docking Data
The affinity of peptides is a crucial factor in studying peptide–protein interactions. Despite the development of various techniques to evaluate peptide–receptor affinity, the results may not always reflect the actual affinity of the peptides accurately. The current study provides a free tool to ass...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10380188/ https://www.ncbi.nlm.nih.gov/pubmed/37511165 http://dx.doi.org/10.3390/ijms241411409 |
_version_ | 1785080124221161472 |
---|---|
author | Feng, Hua Wang, Fangyu Li, Ning Xu, Qian Zheng, Guanming Sun, Xuefeng Hu, Man Xing, Guangxu Zhang, Gaiping |
author_facet | Feng, Hua Wang, Fangyu Li, Ning Xu, Qian Zheng, Guanming Sun, Xuefeng Hu, Man Xing, Guangxu Zhang, Gaiping |
author_sort | Feng, Hua |
collection | PubMed |
description | The affinity of peptides is a crucial factor in studying peptide–protein interactions. Despite the development of various techniques to evaluate peptide–receptor affinity, the results may not always reflect the actual affinity of the peptides accurately. The current study provides a free tool to assess the actual peptide affinity based on virtual docking data. This study employed a dataset that combined actual peptide affinity information (active and inactive) and virtual peptide–receptor docking data, and different machine learning algorithms were utilized. Compared with the other algorithms, the random forest (RF) algorithm showed the best performance and was used in building three RF models using different numbers of significant features (four, three, and two). Further analysis revealed that the four-feature RF model achieved the highest Accuracy of 0.714 in classifying an independent unknown peptide dataset designed with the PEDV spike protein, and it also revealed overfitting problems in the other models. This four-feature RF model was used to evaluate peptide affinity by constructing the relationship between the actual affinity and the virtual docking scores of peptides to their receptors. |
format | Online Article Text |
id | pubmed-10380188 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-103801882023-07-29 A Random Forest Model for Peptide Classification Based on Virtual Docking Data Feng, Hua Wang, Fangyu Li, Ning Xu, Qian Zheng, Guanming Sun, Xuefeng Hu, Man Xing, Guangxu Zhang, Gaiping Int J Mol Sci Article The affinity of peptides is a crucial factor in studying peptide–protein interactions. Despite the development of various techniques to evaluate peptide–receptor affinity, the results may not always reflect the actual affinity of the peptides accurately. The current study provides a free tool to assess the actual peptide affinity based on virtual docking data. This study employed a dataset that combined actual peptide affinity information (active and inactive) and virtual peptide–receptor docking data, and different machine learning algorithms were utilized. Compared with the other algorithms, the random forest (RF) algorithm showed the best performance and was used in building three RF models using different numbers of significant features (four, three, and two). Further analysis revealed that the four-feature RF model achieved the highest Accuracy of 0.714 in classifying an independent unknown peptide dataset designed with the PEDV spike protein, and it also revealed overfitting problems in the other models. This four-feature RF model was used to evaluate peptide affinity by constructing the relationship between the actual affinity and the virtual docking scores of peptides to their receptors. MDPI 2023-07-13 /pmc/articles/PMC10380188/ /pubmed/37511165 http://dx.doi.org/10.3390/ijms241411409 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Feng, Hua Wang, Fangyu Li, Ning Xu, Qian Zheng, Guanming Sun, Xuefeng Hu, Man Xing, Guangxu Zhang, Gaiping A Random Forest Model for Peptide Classification Based on Virtual Docking Data |
title | A Random Forest Model for Peptide Classification Based on Virtual Docking Data |
title_full | A Random Forest Model for Peptide Classification Based on Virtual Docking Data |
title_fullStr | A Random Forest Model for Peptide Classification Based on Virtual Docking Data |
title_full_unstemmed | A Random Forest Model for Peptide Classification Based on Virtual Docking Data |
title_short | A Random Forest Model for Peptide Classification Based on Virtual Docking Data |
title_sort | random forest model for peptide classification based on virtual docking data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10380188/ https://www.ncbi.nlm.nih.gov/pubmed/37511165 http://dx.doi.org/10.3390/ijms241411409 |
work_keys_str_mv | AT fenghua arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT wangfangyu arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT lining arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT xuqian arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT zhengguanming arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT sunxuefeng arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT human arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT xingguangxu arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT zhanggaiping arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT fenghua randomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT wangfangyu randomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT lining randomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT xuqian randomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT zhengguanming randomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT sunxuefeng randomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT human randomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT xingguangxu randomforestmodelforpeptideclassificationbasedonvirtualdockingdata AT zhanggaiping randomforestmodelforpeptideclassificationbasedonvirtualdockingdata |