Cargando…

Application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs

The optimization of drug properties in the process of cancer drug development is very important to save research and development time and cost. In order to make the anti-breast cancer drug candidates with good biological activity, this paper collected 1974 compounds, firstly, the top 20 molecular de...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Jiajia, Zhou, Zhihui, Kong, Shanshan, Ma, Zezhong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353770/
https://www.ncbi.nlm.nih.gov/pubmed/35936743
http://dx.doi.org/10.3389/fonc.2022.956705
_version_ 1784762926422294528
author Liu, Jiajia
Zhou, Zhihui
Kong, Shanshan
Ma, Zezhong
author_facet Liu, Jiajia
Zhou, Zhihui
Kong, Shanshan
Ma, Zezhong
author_sort Liu, Jiajia
collection PubMed
description The optimization of drug properties in the process of cancer drug development is very important to save research and development time and cost. In order to make the anti-breast cancer drug candidates with good biological activity, this paper collected 1974 compounds, firstly, the top 20 molecular descriptors that have the most influence on biological activity were screened by using XGBoost-based data feature selection; secondly, on this basis, take pIC50 values as feature data and use a variety of machine learning algorithms to compare, soas to select a most suitable algorithm to predict the IC50 and pIC50 values. It is preliminarily found that the effects of Random Forest, XGBoost and Gradient-enhanced algorithms are good and have little difference, and the Support vector machine is the worst. Then, using the Semi-automatic parameter adjustment method to adjust the parameters of Random Forest, XGBoost and Gradient-enhanced algorithms to find the optimal parameters. It is found that the Random Forest algorithm has high accuracy and excellent anti over fitting, and the algorithm is stable. Its prediction accuracy is 0.745. Finally, the accuracy of the results is verified by training the model with the preliminarily selected data, which provides an innovative solution for the optimization of the properties of anti- breast cancer drugs, and can provide better support for the early research and development of anti-breast cancer drugs.
format Online
Article
Text
id pubmed-9353770
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-93537702022-08-06 Application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs Liu, Jiajia Zhou, Zhihui Kong, Shanshan Ma, Zezhong Front Oncol Oncology The optimization of drug properties in the process of cancer drug development is very important to save research and development time and cost. In order to make the anti-breast cancer drug candidates with good biological activity, this paper collected 1974 compounds, firstly, the top 20 molecular descriptors that have the most influence on biological activity were screened by using XGBoost-based data feature selection; secondly, on this basis, take pIC50 values as feature data and use a variety of machine learning algorithms to compare, soas to select a most suitable algorithm to predict the IC50 and pIC50 values. It is preliminarily found that the effects of Random Forest, XGBoost and Gradient-enhanced algorithms are good and have little difference, and the Support vector machine is the worst. Then, using the Semi-automatic parameter adjustment method to adjust the parameters of Random Forest, XGBoost and Gradient-enhanced algorithms to find the optimal parameters. It is found that the Random Forest algorithm has high accuracy and excellent anti over fitting, and the algorithm is stable. Its prediction accuracy is 0.745. Finally, the accuracy of the results is verified by training the model with the preliminarily selected data, which provides an innovative solution for the optimization of the properties of anti- breast cancer drugs, and can provide better support for the early research and development of anti-breast cancer drugs. Frontiers Media S.A. 2022-07-22 /pmc/articles/PMC9353770/ /pubmed/35936743 http://dx.doi.org/10.3389/fonc.2022.956705 Text en Copyright © 2022 Liu, Zhou, Kong and Ma https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Oncology
Liu, Jiajia
Zhou, Zhihui
Kong, Shanshan
Ma, Zezhong
Application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs
title Application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs
title_full Application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs
title_fullStr Application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs
title_full_unstemmed Application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs
title_short Application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs
title_sort application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs
topic Oncology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353770/
https://www.ncbi.nlm.nih.gov/pubmed/35936743
http://dx.doi.org/10.3389/fonc.2022.956705
work_keys_str_mv AT liujiajia applicationofrandomforestbasedonsemiautomaticparameteradjustmentforoptimizationofantibreastcancerdrugs
AT zhouzhihui applicationofrandomforestbasedonsemiautomaticparameteradjustmentforoptimizationofantibreastcancerdrugs
AT kongshanshan applicationofrandomforestbasedonsemiautomaticparameteradjustmentforoptimizationofantibreastcancerdrugs
AT mazezhong applicationofrandomforestbasedonsemiautomaticparameteradjustmentforoptimizationofantibreastcancerdrugs