Cargando…
Biological Activity Predictions of Ligands Based on Hybrid Molecular Fingerprinting and Ensemble Learning
[Image: see text] The biological activity predictions of ligands are an important research direction, which can improve the efficiency and success probability of drug screening. However, the traditional prediction method has the disadvantages of complex modeling and low screening efficiency. Machine...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2023
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9933080/ https://www.ncbi.nlm.nih.gov/pubmed/36816680 http://dx.doi.org/10.1021/acsomega.2c06944 |
_version_ | 1784889596321988608 |
---|---|
author | Li, Mengshan Zeng, Ming Zhang, Hang Chen, Huijie Guan, Lixin |
author_facet | Li, Mengshan Zeng, Ming Zhang, Hang Chen, Huijie Guan, Lixin |
author_sort | Li, Mengshan |
collection | PubMed |
description | [Image: see text] The biological activity predictions of ligands are an important research direction, which can improve the efficiency and success probability of drug screening. However, the traditional prediction method has the disadvantages of complex modeling and low screening efficiency. Machine learning is considered an important research direction to solve these traditional method problems in the near future. This paper proposes a machine learning model with high predictive accuracy and stable prediction ability, namely, the back propagation neural network cross-support vector regression model (BPCSVR). By comparing multiple molecular descriptors, MACCS fingerprint and ECFP6 fingerprint were selected as inputs, and the stable prediction ability of the model was improved by integrating multiple models and correcting similar samples. We used leave-one-out cross-validation on 3038 samples from six data sets. The coefficient of determination, root mean square error, and absolute error were used as the evaluation parameters. After comparing the multiclass models, the results show that the BPCSVR model has stable prediction ability in different data sets, and the prediction accuracy is higher than other comparison models. |
format | Online Article Text |
id | pubmed-9933080 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-99330802023-02-17 Biological Activity Predictions of Ligands Based on Hybrid Molecular Fingerprinting and Ensemble Learning Li, Mengshan Zeng, Ming Zhang, Hang Chen, Huijie Guan, Lixin ACS Omega [Image: see text] The biological activity predictions of ligands are an important research direction, which can improve the efficiency and success probability of drug screening. However, the traditional prediction method has the disadvantages of complex modeling and low screening efficiency. Machine learning is considered an important research direction to solve these traditional method problems in the near future. This paper proposes a machine learning model with high predictive accuracy and stable prediction ability, namely, the back propagation neural network cross-support vector regression model (BPCSVR). By comparing multiple molecular descriptors, MACCS fingerprint and ECFP6 fingerprint were selected as inputs, and the stable prediction ability of the model was improved by integrating multiple models and correcting similar samples. We used leave-one-out cross-validation on 3038 samples from six data sets. The coefficient of determination, root mean square error, and absolute error were used as the evaluation parameters. After comparing the multiclass models, the results show that the BPCSVR model has stable prediction ability in different data sets, and the prediction accuracy is higher than other comparison models. American Chemical Society 2023-02-01 /pmc/articles/PMC9933080/ /pubmed/36816680 http://dx.doi.org/10.1021/acsomega.2c06944 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Li, Mengshan Zeng, Ming Zhang, Hang Chen, Huijie Guan, Lixin Biological Activity Predictions of Ligands Based on Hybrid Molecular Fingerprinting and Ensemble Learning |
title | Biological Activity
Predictions of Ligands Based on
Hybrid Molecular Fingerprinting and Ensemble Learning |
title_full | Biological Activity
Predictions of Ligands Based on
Hybrid Molecular Fingerprinting and Ensemble Learning |
title_fullStr | Biological Activity
Predictions of Ligands Based on
Hybrid Molecular Fingerprinting and Ensemble Learning |
title_full_unstemmed | Biological Activity
Predictions of Ligands Based on
Hybrid Molecular Fingerprinting and Ensemble Learning |
title_short | Biological Activity
Predictions of Ligands Based on
Hybrid Molecular Fingerprinting and Ensemble Learning |
title_sort | biological activity
predictions of ligands based on
hybrid molecular fingerprinting and ensemble learning |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9933080/ https://www.ncbi.nlm.nih.gov/pubmed/36816680 http://dx.doi.org/10.1021/acsomega.2c06944 |
work_keys_str_mv | AT limengshan biologicalactivitypredictionsofligandsbasedonhybridmolecularfingerprintingandensemblelearning AT zengming biologicalactivitypredictionsofligandsbasedonhybridmolecularfingerprintingandensemblelearning AT zhanghang biologicalactivitypredictionsofligandsbasedonhybridmolecularfingerprintingandensemblelearning AT chenhuijie biologicalactivitypredictionsofligandsbasedonhybridmolecularfingerprintingandensemblelearning AT guanlixin biologicalactivitypredictionsofligandsbasedonhybridmolecularfingerprintingandensemblelearning |