Cargando…

Biological Activity Predictions of Ligands Based on Hybrid Molecular Fingerprinting and Ensemble Learning

[Image: see text] The biological activity predictions of ligands are an important research direction, which can improve the efficiency and success probability of drug screening. However, the traditional prediction method has the disadvantages of complex modeling and low screening efficiency. Machine...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Mengshan, Zeng, Ming, Zhang, Hang, Chen, Huijie, Guan, Lixin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9933080/
https://www.ncbi.nlm.nih.gov/pubmed/36816680
http://dx.doi.org/10.1021/acsomega.2c06944
_version_ 1784889596321988608
author Li, Mengshan
Zeng, Ming
Zhang, Hang
Chen, Huijie
Guan, Lixin
author_facet Li, Mengshan
Zeng, Ming
Zhang, Hang
Chen, Huijie
Guan, Lixin
author_sort Li, Mengshan
collection PubMed
description [Image: see text] The biological activity predictions of ligands are an important research direction, which can improve the efficiency and success probability of drug screening. However, the traditional prediction method has the disadvantages of complex modeling and low screening efficiency. Machine learning is considered an important research direction to solve these traditional method problems in the near future. This paper proposes a machine learning model with high predictive accuracy and stable prediction ability, namely, the back propagation neural network cross-support vector regression model (BPCSVR). By comparing multiple molecular descriptors, MACCS fingerprint and ECFP6 fingerprint were selected as inputs, and the stable prediction ability of the model was improved by integrating multiple models and correcting similar samples. We used leave-one-out cross-validation on 3038 samples from six data sets. The coefficient of determination, root mean square error, and absolute error were used as the evaluation parameters. After comparing the multiclass models, the results show that the BPCSVR model has stable prediction ability in different data sets, and the prediction accuracy is higher than other comparison models.
format Online
Article
Text
id pubmed-9933080
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-99330802023-02-17 Biological Activity Predictions of Ligands Based on Hybrid Molecular Fingerprinting and Ensemble Learning Li, Mengshan Zeng, Ming Zhang, Hang Chen, Huijie Guan, Lixin ACS Omega [Image: see text] The biological activity predictions of ligands are an important research direction, which can improve the efficiency and success probability of drug screening. However, the traditional prediction method has the disadvantages of complex modeling and low screening efficiency. Machine learning is considered an important research direction to solve these traditional method problems in the near future. This paper proposes a machine learning model with high predictive accuracy and stable prediction ability, namely, the back propagation neural network cross-support vector regression model (BPCSVR). By comparing multiple molecular descriptors, MACCS fingerprint and ECFP6 fingerprint were selected as inputs, and the stable prediction ability of the model was improved by integrating multiple models and correcting similar samples. We used leave-one-out cross-validation on 3038 samples from six data sets. The coefficient of determination, root mean square error, and absolute error were used as the evaluation parameters. After comparing the multiclass models, the results show that the BPCSVR model has stable prediction ability in different data sets, and the prediction accuracy is higher than other comparison models. American Chemical Society 2023-02-01 /pmc/articles/PMC9933080/ /pubmed/36816680 http://dx.doi.org/10.1021/acsomega.2c06944 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Li, Mengshan
Zeng, Ming
Zhang, Hang
Chen, Huijie
Guan, Lixin
Biological Activity Predictions of Ligands Based on Hybrid Molecular Fingerprinting and Ensemble Learning
title Biological Activity Predictions of Ligands Based on Hybrid Molecular Fingerprinting and Ensemble Learning
title_full Biological Activity Predictions of Ligands Based on Hybrid Molecular Fingerprinting and Ensemble Learning
title_fullStr Biological Activity Predictions of Ligands Based on Hybrid Molecular Fingerprinting and Ensemble Learning
title_full_unstemmed Biological Activity Predictions of Ligands Based on Hybrid Molecular Fingerprinting and Ensemble Learning
title_short Biological Activity Predictions of Ligands Based on Hybrid Molecular Fingerprinting and Ensemble Learning
title_sort biological activity predictions of ligands based on hybrid molecular fingerprinting and ensemble learning
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9933080/
https://www.ncbi.nlm.nih.gov/pubmed/36816680
http://dx.doi.org/10.1021/acsomega.2c06944
work_keys_str_mv AT limengshan biologicalactivitypredictionsofligandsbasedonhybridmolecularfingerprintingandensemblelearning
AT zengming biologicalactivitypredictionsofligandsbasedonhybridmolecularfingerprintingandensemblelearning
AT zhanghang biologicalactivitypredictionsofligandsbasedonhybridmolecularfingerprintingandensemblelearning
AT chenhuijie biologicalactivitypredictionsofligandsbasedonhybridmolecularfingerprintingandensemblelearning
AT guanlixin biologicalactivitypredictionsofligandsbasedonhybridmolecularfingerprintingandensemblelearning