Cargando…

A Random Forest Model for Peptide Classification Based on Virtual Docking Data

The affinity of peptides is a crucial factor in studying peptide–protein interactions. Despite the development of various techniques to evaluate peptide–receptor affinity, the results may not always reflect the actual affinity of the peptides accurately. The current study provides a free tool to ass...

Descripción completa

Detalles Bibliográficos
Autores principales: Feng, Hua, Wang, Fangyu, Li, Ning, Xu, Qian, Zheng, Guanming, Sun, Xuefeng, Hu, Man, Xing, Guangxu, Zhang, Gaiping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10380188/
https://www.ncbi.nlm.nih.gov/pubmed/37511165
http://dx.doi.org/10.3390/ijms241411409
_version_ 1785080124221161472
author Feng, Hua
Wang, Fangyu
Li, Ning
Xu, Qian
Zheng, Guanming
Sun, Xuefeng
Hu, Man
Xing, Guangxu
Zhang, Gaiping
author_facet Feng, Hua
Wang, Fangyu
Li, Ning
Xu, Qian
Zheng, Guanming
Sun, Xuefeng
Hu, Man
Xing, Guangxu
Zhang, Gaiping
author_sort Feng, Hua
collection PubMed
description The affinity of peptides is a crucial factor in studying peptide–protein interactions. Despite the development of various techniques to evaluate peptide–receptor affinity, the results may not always reflect the actual affinity of the peptides accurately. The current study provides a free tool to assess the actual peptide affinity based on virtual docking data. This study employed a dataset that combined actual peptide affinity information (active and inactive) and virtual peptide–receptor docking data, and different machine learning algorithms were utilized. Compared with the other algorithms, the random forest (RF) algorithm showed the best performance and was used in building three RF models using different numbers of significant features (four, three, and two). Further analysis revealed that the four-feature RF model achieved the highest Accuracy of 0.714 in classifying an independent unknown peptide dataset designed with the PEDV spike protein, and it also revealed overfitting problems in the other models. This four-feature RF model was used to evaluate peptide affinity by constructing the relationship between the actual affinity and the virtual docking scores of peptides to their receptors.
format Online
Article
Text
id pubmed-10380188
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103801882023-07-29 A Random Forest Model for Peptide Classification Based on Virtual Docking Data Feng, Hua Wang, Fangyu Li, Ning Xu, Qian Zheng, Guanming Sun, Xuefeng Hu, Man Xing, Guangxu Zhang, Gaiping Int J Mol Sci Article The affinity of peptides is a crucial factor in studying peptide–protein interactions. Despite the development of various techniques to evaluate peptide–receptor affinity, the results may not always reflect the actual affinity of the peptides accurately. The current study provides a free tool to assess the actual peptide affinity based on virtual docking data. This study employed a dataset that combined actual peptide affinity information (active and inactive) and virtual peptide–receptor docking data, and different machine learning algorithms were utilized. Compared with the other algorithms, the random forest (RF) algorithm showed the best performance and was used in building three RF models using different numbers of significant features (four, three, and two). Further analysis revealed that the four-feature RF model achieved the highest Accuracy of 0.714 in classifying an independent unknown peptide dataset designed with the PEDV spike protein, and it also revealed overfitting problems in the other models. This four-feature RF model was used to evaluate peptide affinity by constructing the relationship between the actual affinity and the virtual docking scores of peptides to their receptors. MDPI 2023-07-13 /pmc/articles/PMC10380188/ /pubmed/37511165 http://dx.doi.org/10.3390/ijms241411409 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Feng, Hua
Wang, Fangyu
Li, Ning
Xu, Qian
Zheng, Guanming
Sun, Xuefeng
Hu, Man
Xing, Guangxu
Zhang, Gaiping
A Random Forest Model for Peptide Classification Based on Virtual Docking Data
title A Random Forest Model for Peptide Classification Based on Virtual Docking Data
title_full A Random Forest Model for Peptide Classification Based on Virtual Docking Data
title_fullStr A Random Forest Model for Peptide Classification Based on Virtual Docking Data
title_full_unstemmed A Random Forest Model for Peptide Classification Based on Virtual Docking Data
title_short A Random Forest Model for Peptide Classification Based on Virtual Docking Data
title_sort random forest model for peptide classification based on virtual docking data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10380188/
https://www.ncbi.nlm.nih.gov/pubmed/37511165
http://dx.doi.org/10.3390/ijms241411409
work_keys_str_mv AT fenghua arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT wangfangyu arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT lining arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT xuqian arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT zhengguanming arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT sunxuefeng arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT human arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT xingguangxu arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT zhanggaiping arandomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT fenghua randomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT wangfangyu randomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT lining randomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT xuqian randomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT zhengguanming randomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT sunxuefeng randomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT human randomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT xingguangxu randomforestmodelforpeptideclassificationbasedonvirtualdockingdata
AT zhanggaiping randomforestmodelforpeptideclassificationbasedonvirtualdockingdata