Cargando…

Machine Learning Models Combined with Virtual Screening and Molecular Docking to Predict Human Topoisomerase I Inhibitors

In this work, random forest (RF), support vector machine, k-nearest neighbor and C4.5 decision tree, were used to establish classification models for predicting whether an unknown molecule is an inhibitor of human topoisomerase I (Top1) protein. All these models have achieved satisfactory results, w...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Bingke, Kang, Xiaokang, Zhao, Dan, Zou, Yurong, Huang, Xudong, Wang, Jiexue, Zhang, Chenghua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6601036/
https://www.ncbi.nlm.nih.gov/pubmed/31167344
http://dx.doi.org/10.3390/molecules24112107
_version_ 1783431228734570496
author Li, Bingke
Kang, Xiaokang
Zhao, Dan
Zou, Yurong
Huang, Xudong
Wang, Jiexue
Zhang, Chenghua
author_facet Li, Bingke
Kang, Xiaokang
Zhao, Dan
Zou, Yurong
Huang, Xudong
Wang, Jiexue
Zhang, Chenghua
author_sort Li, Bingke
collection PubMed
description In this work, random forest (RF), support vector machine, k-nearest neighbor and C4.5 decision tree, were used to establish classification models for predicting whether an unknown molecule is an inhibitor of human topoisomerase I (Top1) protein. All these models have achieved satisfactory results, with total prediction accuracies from 89.70% to 97.12%. Through comparative analysis, it can be found that the RF model has the best forecasting effect. The parameters were further optimized to generate the best-performing RF model. At the same time, features selection was implemented to choose properties most relevant to the inhibition of Top1 from 189 molecular descriptors through a special RF procedure. Subsequently, a ligand-based virtual screening was performed from the Maybridge database by the optimal RF model and 596 hits were picked out. Then, 67 molecules with relative probability scores over 0.7 were selected based on the screening results. Next, the 67 molecules above were docked to Top1 using AutoDock Vina. Finally, six top-ranked molecules with binding energies less than −10.0 kcal/mol were screened out and a common backbone, which is entirely different from that of existing Top1 inhibitors reported in the literature, was found.
format Online
Article
Text
id pubmed-6601036
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-66010362019-07-18 Machine Learning Models Combined with Virtual Screening and Molecular Docking to Predict Human Topoisomerase I Inhibitors Li, Bingke Kang, Xiaokang Zhao, Dan Zou, Yurong Huang, Xudong Wang, Jiexue Zhang, Chenghua Molecules Article In this work, random forest (RF), support vector machine, k-nearest neighbor and C4.5 decision tree, were used to establish classification models for predicting whether an unknown molecule is an inhibitor of human topoisomerase I (Top1) protein. All these models have achieved satisfactory results, with total prediction accuracies from 89.70% to 97.12%. Through comparative analysis, it can be found that the RF model has the best forecasting effect. The parameters were further optimized to generate the best-performing RF model. At the same time, features selection was implemented to choose properties most relevant to the inhibition of Top1 from 189 molecular descriptors through a special RF procedure. Subsequently, a ligand-based virtual screening was performed from the Maybridge database by the optimal RF model and 596 hits were picked out. Then, 67 molecules with relative probability scores over 0.7 were selected based on the screening results. Next, the 67 molecules above were docked to Top1 using AutoDock Vina. Finally, six top-ranked molecules with binding energies less than −10.0 kcal/mol were screened out and a common backbone, which is entirely different from that of existing Top1 inhibitors reported in the literature, was found. MDPI 2019-06-04 /pmc/articles/PMC6601036/ /pubmed/31167344 http://dx.doi.org/10.3390/molecules24112107 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Bingke
Kang, Xiaokang
Zhao, Dan
Zou, Yurong
Huang, Xudong
Wang, Jiexue
Zhang, Chenghua
Machine Learning Models Combined with Virtual Screening and Molecular Docking to Predict Human Topoisomerase I Inhibitors
title Machine Learning Models Combined with Virtual Screening and Molecular Docking to Predict Human Topoisomerase I Inhibitors
title_full Machine Learning Models Combined with Virtual Screening and Molecular Docking to Predict Human Topoisomerase I Inhibitors
title_fullStr Machine Learning Models Combined with Virtual Screening and Molecular Docking to Predict Human Topoisomerase I Inhibitors
title_full_unstemmed Machine Learning Models Combined with Virtual Screening and Molecular Docking to Predict Human Topoisomerase I Inhibitors
title_short Machine Learning Models Combined with Virtual Screening and Molecular Docking to Predict Human Topoisomerase I Inhibitors
title_sort machine learning models combined with virtual screening and molecular docking to predict human topoisomerase i inhibitors
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6601036/
https://www.ncbi.nlm.nih.gov/pubmed/31167344
http://dx.doi.org/10.3390/molecules24112107
work_keys_str_mv AT libingke machinelearningmodelscombinedwithvirtualscreeningandmoleculardockingtopredicthumantopoisomeraseiinhibitors
AT kangxiaokang machinelearningmodelscombinedwithvirtualscreeningandmoleculardockingtopredicthumantopoisomeraseiinhibitors
AT zhaodan machinelearningmodelscombinedwithvirtualscreeningandmoleculardockingtopredicthumantopoisomeraseiinhibitors
AT zouyurong machinelearningmodelscombinedwithvirtualscreeningandmoleculardockingtopredicthumantopoisomeraseiinhibitors
AT huangxudong machinelearningmodelscombinedwithvirtualscreeningandmoleculardockingtopredicthumantopoisomeraseiinhibitors
AT wangjiexue machinelearningmodelscombinedwithvirtualscreeningandmoleculardockingtopredicthumantopoisomeraseiinhibitors
AT zhangchenghua machinelearningmodelscombinedwithvirtualscreeningandmoleculardockingtopredicthumantopoisomeraseiinhibitors