Cargando…

GNINA 1.0: molecular docking with deep learning

Molecular docking computationally predicts the conformation of a small molecule when binding to a receptor. Scoring functions are a vital piece of any molecular docking pipeline as they determine the fitness of sampled poses. Here we describe and evaluate the 1.0 release of the Gnina docking softwar...

Descripción completa

Detalles Bibliográficos
Autores principales: McNutt, Andrew T., Francoeur, Paul, Aggarwal, Rishal, Masuda, Tomohide, Meli, Rocco, Ragoza, Matthew, Sunseri, Jocelyn, Koes, David Ryan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8191141/
https://www.ncbi.nlm.nih.gov/pubmed/34108002
http://dx.doi.org/10.1186/s13321-021-00522-2
_version_ 1783705818691010560
author McNutt, Andrew T.
Francoeur, Paul
Aggarwal, Rishal
Masuda, Tomohide
Meli, Rocco
Ragoza, Matthew
Sunseri, Jocelyn
Koes, David Ryan
author_facet McNutt, Andrew T.
Francoeur, Paul
Aggarwal, Rishal
Masuda, Tomohide
Meli, Rocco
Ragoza, Matthew
Sunseri, Jocelyn
Koes, David Ryan
author_sort McNutt, Andrew T.
collection PubMed
description Molecular docking computationally predicts the conformation of a small molecule when binding to a receptor. Scoring functions are a vital piece of any molecular docking pipeline as they determine the fitness of sampled poses. Here we describe and evaluate the 1.0 release of the Gnina docking software, which utilizes an ensemble of convolutional neural networks (CNNs) as a scoring function. We also explore an array of parameter values for Gnina 1.0 to optimize docking performance and computational cost. Docking performance, as evaluated by the percentage of targets where the top pose is better than 2Å root mean square deviation (Top1), is compared to AutoDock Vina scoring when utilizing explicitly defined binding pockets or whole protein docking. Gnina, utilizing a CNN scoring function to rescore the output poses, outperforms AutoDock Vina scoring on redocking and cross-docking tasks when the binding pocket is defined (Top1 increases from 58% to 73% and from 27% to 37%, respectively) and when the whole protein defines the binding pocket (Top1 increases from 31% to 38% and from 12% to 16%, respectively). The derived ensemble of CNNs generalizes to unseen proteins and ligands and produces scores that correlate well with the root mean square deviation to the known binding pose. We provide the 1.0 version of Gnina under an open source license for use as a molecular docking tool at https://github.com/gnina/gnina. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-021-00522-2.
format Online
Article
Text
id pubmed-8191141
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-81911412021-06-10 GNINA 1.0: molecular docking with deep learning McNutt, Andrew T. Francoeur, Paul Aggarwal, Rishal Masuda, Tomohide Meli, Rocco Ragoza, Matthew Sunseri, Jocelyn Koes, David Ryan J Cheminform Software Molecular docking computationally predicts the conformation of a small molecule when binding to a receptor. Scoring functions are a vital piece of any molecular docking pipeline as they determine the fitness of sampled poses. Here we describe and evaluate the 1.0 release of the Gnina docking software, which utilizes an ensemble of convolutional neural networks (CNNs) as a scoring function. We also explore an array of parameter values for Gnina 1.0 to optimize docking performance and computational cost. Docking performance, as evaluated by the percentage of targets where the top pose is better than 2Å root mean square deviation (Top1), is compared to AutoDock Vina scoring when utilizing explicitly defined binding pockets or whole protein docking. Gnina, utilizing a CNN scoring function to rescore the output poses, outperforms AutoDock Vina scoring on redocking and cross-docking tasks when the binding pocket is defined (Top1 increases from 58% to 73% and from 27% to 37%, respectively) and when the whole protein defines the binding pocket (Top1 increases from 31% to 38% and from 12% to 16%, respectively). The derived ensemble of CNNs generalizes to unseen proteins and ligands and produces scores that correlate well with the root mean square deviation to the known binding pose. We provide the 1.0 version of Gnina under an open source license for use as a molecular docking tool at https://github.com/gnina/gnina. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-021-00522-2. Springer International Publishing 2021-06-09 /pmc/articles/PMC8191141/ /pubmed/34108002 http://dx.doi.org/10.1186/s13321-021-00522-2 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
McNutt, Andrew T.
Francoeur, Paul
Aggarwal, Rishal
Masuda, Tomohide
Meli, Rocco
Ragoza, Matthew
Sunseri, Jocelyn
Koes, David Ryan
GNINA 1.0: molecular docking with deep learning
title GNINA 1.0: molecular docking with deep learning
title_full GNINA 1.0: molecular docking with deep learning
title_fullStr GNINA 1.0: molecular docking with deep learning
title_full_unstemmed GNINA 1.0: molecular docking with deep learning
title_short GNINA 1.0: molecular docking with deep learning
title_sort gnina 1.0: molecular docking with deep learning
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8191141/
https://www.ncbi.nlm.nih.gov/pubmed/34108002
http://dx.doi.org/10.1186/s13321-021-00522-2
work_keys_str_mv AT mcnuttandrewt gnina10moleculardockingwithdeeplearning
AT francoeurpaul gnina10moleculardockingwithdeeplearning
AT aggarwalrishal gnina10moleculardockingwithdeeplearning
AT masudatomohide gnina10moleculardockingwithdeeplearning
AT melirocco gnina10moleculardockingwithdeeplearning
AT ragozamatthew gnina10moleculardockingwithdeeplearning
AT sunserijocelyn gnina10moleculardockingwithdeeplearning
AT koesdavidryan gnina10moleculardockingwithdeeplearning