Cargando…

Simplified, interpretable graph convolutional neural networks for small molecule activity prediction

We here present a streamlined, explainable graph convolutional neural network (gCNN) architecture for small molecule activity prediction. We first conduct a hyperparameter optimization across nearly 800 protein targets that produces a simplified gCNN QSAR architecture, and we observe that such a mod...

Descripción completa

Detalles Bibliográficos
Autores principales: Weber, Jeffrey K., Morrone, Joseph A., Bagchi, Sugato, Pabon, Jan D. Estrada, Kang, Seung-gu, Zhang, Leili, Cornell, Wendy D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9325818/
https://www.ncbi.nlm.nih.gov/pubmed/34817762
http://dx.doi.org/10.1007/s10822-021-00421-6
_version_ 1784757141037383680
author Weber, Jeffrey K.
Morrone, Joseph A.
Bagchi, Sugato
Pabon, Jan D. Estrada
Kang, Seung-gu
Zhang, Leili
Cornell, Wendy D.
author_facet Weber, Jeffrey K.
Morrone, Joseph A.
Bagchi, Sugato
Pabon, Jan D. Estrada
Kang, Seung-gu
Zhang, Leili
Cornell, Wendy D.
author_sort Weber, Jeffrey K.
collection PubMed
description We here present a streamlined, explainable graph convolutional neural network (gCNN) architecture for small molecule activity prediction. We first conduct a hyperparameter optimization across nearly 800 protein targets that produces a simplified gCNN QSAR architecture, and we observe that such a model can yield performance improvements over both standard gCNN and RF methods on difficult-to-classify test sets. Additionally, we discuss how reductions in convolutional layer dimensions potentially speak to the “anatomical” needs of gCNNs with respect to radial coarse graining of molecular substructure. We augment this simplified architecture with saliency map technology that highlights molecular substructures relevant to activity, and we perform saliency analysis on nearly 100 data-rich protein targets. We show that resultant substructural clusters are useful visualization tools for understanding substructure-activity relationships. We go on to highlight connections between our models’ saliency predictions and observations made in the medicinal chemistry literature, focusing on four case studies of past lead finding and lead optimization campaigns. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s10822-021-00421-6.
format Online
Article
Text
id pubmed-9325818
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-93258182022-07-28 Simplified, interpretable graph convolutional neural networks for small molecule activity prediction Weber, Jeffrey K. Morrone, Joseph A. Bagchi, Sugato Pabon, Jan D. Estrada Kang, Seung-gu Zhang, Leili Cornell, Wendy D. J Comput Aided Mol Des Article We here present a streamlined, explainable graph convolutional neural network (gCNN) architecture for small molecule activity prediction. We first conduct a hyperparameter optimization across nearly 800 protein targets that produces a simplified gCNN QSAR architecture, and we observe that such a model can yield performance improvements over both standard gCNN and RF methods on difficult-to-classify test sets. Additionally, we discuss how reductions in convolutional layer dimensions potentially speak to the “anatomical” needs of gCNNs with respect to radial coarse graining of molecular substructure. We augment this simplified architecture with saliency map technology that highlights molecular substructures relevant to activity, and we perform saliency analysis on nearly 100 data-rich protein targets. We show that resultant substructural clusters are useful visualization tools for understanding substructure-activity relationships. We go on to highlight connections between our models’ saliency predictions and observations made in the medicinal chemistry literature, focusing on four case studies of past lead finding and lead optimization campaigns. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s10822-021-00421-6. Springer International Publishing 2021-11-24 2022 /pmc/articles/PMC9325818/ /pubmed/34817762 http://dx.doi.org/10.1007/s10822-021-00421-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Weber, Jeffrey K.
Morrone, Joseph A.
Bagchi, Sugato
Pabon, Jan D. Estrada
Kang, Seung-gu
Zhang, Leili
Cornell, Wendy D.
Simplified, interpretable graph convolutional neural networks for small molecule activity prediction
title Simplified, interpretable graph convolutional neural networks for small molecule activity prediction
title_full Simplified, interpretable graph convolutional neural networks for small molecule activity prediction
title_fullStr Simplified, interpretable graph convolutional neural networks for small molecule activity prediction
title_full_unstemmed Simplified, interpretable graph convolutional neural networks for small molecule activity prediction
title_short Simplified, interpretable graph convolutional neural networks for small molecule activity prediction
title_sort simplified, interpretable graph convolutional neural networks for small molecule activity prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9325818/
https://www.ncbi.nlm.nih.gov/pubmed/34817762
http://dx.doi.org/10.1007/s10822-021-00421-6
work_keys_str_mv AT weberjeffreyk simplifiedinterpretablegraphconvolutionalneuralnetworksforsmallmoleculeactivityprediction
AT morronejosepha simplifiedinterpretablegraphconvolutionalneuralnetworksforsmallmoleculeactivityprediction
AT bagchisugato simplifiedinterpretablegraphconvolutionalneuralnetworksforsmallmoleculeactivityprediction
AT pabonjandestrada simplifiedinterpretablegraphconvolutionalneuralnetworksforsmallmoleculeactivityprediction
AT kangseunggu simplifiedinterpretablegraphconvolutionalneuralnetworksforsmallmoleculeactivityprediction
AT zhangleili simplifiedinterpretablegraphconvolutionalneuralnetworksforsmallmoleculeactivityprediction
AT cornellwendyd simplifiedinterpretablegraphconvolutionalneuralnetworksforsmallmoleculeactivityprediction