Cargando…
Application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs
The optimization of drug properties in the process of cancer drug development is very important to save research and development time and cost. In order to make the anti-breast cancer drug candidates with good biological activity, this paper collected 1974 compounds, firstly, the top 20 molecular de...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353770/ https://www.ncbi.nlm.nih.gov/pubmed/35936743 http://dx.doi.org/10.3389/fonc.2022.956705 |
_version_ | 1784762926422294528 |
---|---|
author | Liu, Jiajia Zhou, Zhihui Kong, Shanshan Ma, Zezhong |
author_facet | Liu, Jiajia Zhou, Zhihui Kong, Shanshan Ma, Zezhong |
author_sort | Liu, Jiajia |
collection | PubMed |
description | The optimization of drug properties in the process of cancer drug development is very important to save research and development time and cost. In order to make the anti-breast cancer drug candidates with good biological activity, this paper collected 1974 compounds, firstly, the top 20 molecular descriptors that have the most influence on biological activity were screened by using XGBoost-based data feature selection; secondly, on this basis, take pIC50 values as feature data and use a variety of machine learning algorithms to compare, soas to select a most suitable algorithm to predict the IC50 and pIC50 values. It is preliminarily found that the effects of Random Forest, XGBoost and Gradient-enhanced algorithms are good and have little difference, and the Support vector machine is the worst. Then, using the Semi-automatic parameter adjustment method to adjust the parameters of Random Forest, XGBoost and Gradient-enhanced algorithms to find the optimal parameters. It is found that the Random Forest algorithm has high accuracy and excellent anti over fitting, and the algorithm is stable. Its prediction accuracy is 0.745. Finally, the accuracy of the results is verified by training the model with the preliminarily selected data, which provides an innovative solution for the optimization of the properties of anti- breast cancer drugs, and can provide better support for the early research and development of anti-breast cancer drugs. |
format | Online Article Text |
id | pubmed-9353770 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-93537702022-08-06 Application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs Liu, Jiajia Zhou, Zhihui Kong, Shanshan Ma, Zezhong Front Oncol Oncology The optimization of drug properties in the process of cancer drug development is very important to save research and development time and cost. In order to make the anti-breast cancer drug candidates with good biological activity, this paper collected 1974 compounds, firstly, the top 20 molecular descriptors that have the most influence on biological activity were screened by using XGBoost-based data feature selection; secondly, on this basis, take pIC50 values as feature data and use a variety of machine learning algorithms to compare, soas to select a most suitable algorithm to predict the IC50 and pIC50 values. It is preliminarily found that the effects of Random Forest, XGBoost and Gradient-enhanced algorithms are good and have little difference, and the Support vector machine is the worst. Then, using the Semi-automatic parameter adjustment method to adjust the parameters of Random Forest, XGBoost and Gradient-enhanced algorithms to find the optimal parameters. It is found that the Random Forest algorithm has high accuracy and excellent anti over fitting, and the algorithm is stable. Its prediction accuracy is 0.745. Finally, the accuracy of the results is verified by training the model with the preliminarily selected data, which provides an innovative solution for the optimization of the properties of anti- breast cancer drugs, and can provide better support for the early research and development of anti-breast cancer drugs. Frontiers Media S.A. 2022-07-22 /pmc/articles/PMC9353770/ /pubmed/35936743 http://dx.doi.org/10.3389/fonc.2022.956705 Text en Copyright © 2022 Liu, Zhou, Kong and Ma https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Oncology Liu, Jiajia Zhou, Zhihui Kong, Shanshan Ma, Zezhong Application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs |
title | Application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs |
title_full | Application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs |
title_fullStr | Application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs |
title_full_unstemmed | Application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs |
title_short | Application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs |
title_sort | application of random forest based on semi-automatic parameter adjustment for optimization of anti-breast cancer drugs |
topic | Oncology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353770/ https://www.ncbi.nlm.nih.gov/pubmed/35936743 http://dx.doi.org/10.3389/fonc.2022.956705 |
work_keys_str_mv | AT liujiajia applicationofrandomforestbasedonsemiautomaticparameteradjustmentforoptimizationofantibreastcancerdrugs AT zhouzhihui applicationofrandomforestbasedonsemiautomaticparameteradjustmentforoptimizationofantibreastcancerdrugs AT kongshanshan applicationofrandomforestbasedonsemiautomaticparameteradjustmentforoptimizationofantibreastcancerdrugs AT mazezhong applicationofrandomforestbasedonsemiautomaticparameteradjustmentforoptimizationofantibreastcancerdrugs |