Cargando…

Comprehensive Exploration of Target‐specific Ligands Using a Graph Convolution Neural Network

Machine learning approaches are widely used to evaluate ligand activities of chemical compounds toward potential target proteins. Especially, exploration of highly selective ligands is important for the development of new drugs with higher safety. One difficulty in constructing well‐performing model...

Descripción completa

Detalles Bibliográficos
Autores principales: Miyazaki, Yu, Ono, Naoaki, Huang, Ming, Altaf‐Ul‐Amin, Md., Kanaya, Shigehiko
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7050504/
https://www.ncbi.nlm.nih.gov/pubmed/31815371
http://dx.doi.org/10.1002/minf.201900095
_version_ 1783502622113660928
author Miyazaki, Yu
Ono, Naoaki
Huang, Ming
Altaf‐Ul‐Amin, Md.
Kanaya, Shigehiko
author_facet Miyazaki, Yu
Ono, Naoaki
Huang, Ming
Altaf‐Ul‐Amin, Md.
Kanaya, Shigehiko
author_sort Miyazaki, Yu
collection PubMed
description Machine learning approaches are widely used to evaluate ligand activities of chemical compounds toward potential target proteins. Especially, exploration of highly selective ligands is important for the development of new drugs with higher safety. One difficulty in constructing well‐performing model predicting such a ligand activity is the absence of data on true negative ligand‐protein interactions. In other words, in many cases we can access to plenty of information on ligands that bind to specific protein, but less or almost no information showing that compounds don't bind to proteins of interest. In this paper, we suggested an approach to comprehensively explore candidates for ligands specifically targeting toward proteins without using information on the true negative interaction. The approach consists of 4 steps: 1) constructing a model that distinguishes ligands for the target proteins of interest from those targeting proteins that cause off‐target effects, by using graph convolution neural network (GCNN); 2) extracting feature vectors after convolution/pooling processes and mapping their principal components in two dimensions; 3) specifying regions with higher density for two ligand groups through kernel density estimation; and 4) investigating the distribution of compounds for exploration on the density map using the same classifier and decomposer. If compounds for exploration are located in higher‐density regions of ligand compounds, these compounds can be regarded as having relatively high binding affinity to the major target or off‐target proteins compared with other compounds. We applied the approach to the exploration of ligands for β‐site amyloid precursor protein [APP]‐cleaving enzyme 1 (BACE1), a major target for Alzheimer Disease (AD), with less off‐target effect toward cathepsin D. We demonstrated that the density region of BACE1 and cathepsin D ligands are well‐divided, and a group of natural compounds as a target for exploration of new drug candidates also has significantly different distribution on the density map.
format Online
Article
Text
id pubmed-7050504
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-70505042020-03-09 Comprehensive Exploration of Target‐specific Ligands Using a Graph Convolution Neural Network Miyazaki, Yu Ono, Naoaki Huang, Ming Altaf‐Ul‐Amin, Md. Kanaya, Shigehiko Mol Inform Full Papers Machine learning approaches are widely used to evaluate ligand activities of chemical compounds toward potential target proteins. Especially, exploration of highly selective ligands is important for the development of new drugs with higher safety. One difficulty in constructing well‐performing model predicting such a ligand activity is the absence of data on true negative ligand‐protein interactions. In other words, in many cases we can access to plenty of information on ligands that bind to specific protein, but less or almost no information showing that compounds don't bind to proteins of interest. In this paper, we suggested an approach to comprehensively explore candidates for ligands specifically targeting toward proteins without using information on the true negative interaction. The approach consists of 4 steps: 1) constructing a model that distinguishes ligands for the target proteins of interest from those targeting proteins that cause off‐target effects, by using graph convolution neural network (GCNN); 2) extracting feature vectors after convolution/pooling processes and mapping their principal components in two dimensions; 3) specifying regions with higher density for two ligand groups through kernel density estimation; and 4) investigating the distribution of compounds for exploration on the density map using the same classifier and decomposer. If compounds for exploration are located in higher‐density regions of ligand compounds, these compounds can be regarded as having relatively high binding affinity to the major target or off‐target proteins compared with other compounds. We applied the approach to the exploration of ligands for β‐site amyloid precursor protein [APP]‐cleaving enzyme 1 (BACE1), a major target for Alzheimer Disease (AD), with less off‐target effect toward cathepsin D. We demonstrated that the density region of BACE1 and cathepsin D ligands are well‐divided, and a group of natural compounds as a target for exploration of new drug candidates also has significantly different distribution on the density map. John Wiley and Sons Inc. 2019-12-09 2020-01 /pmc/articles/PMC7050504/ /pubmed/31815371 http://dx.doi.org/10.1002/minf.201900095 Text en © 2019 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA. This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle Full Papers
Miyazaki, Yu
Ono, Naoaki
Huang, Ming
Altaf‐Ul‐Amin, Md.
Kanaya, Shigehiko
Comprehensive Exploration of Target‐specific Ligands Using a Graph Convolution Neural Network
title Comprehensive Exploration of Target‐specific Ligands Using a Graph Convolution Neural Network
title_full Comprehensive Exploration of Target‐specific Ligands Using a Graph Convolution Neural Network
title_fullStr Comprehensive Exploration of Target‐specific Ligands Using a Graph Convolution Neural Network
title_full_unstemmed Comprehensive Exploration of Target‐specific Ligands Using a Graph Convolution Neural Network
title_short Comprehensive Exploration of Target‐specific Ligands Using a Graph Convolution Neural Network
title_sort comprehensive exploration of target‐specific ligands using a graph convolution neural network
topic Full Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7050504/
https://www.ncbi.nlm.nih.gov/pubmed/31815371
http://dx.doi.org/10.1002/minf.201900095
work_keys_str_mv AT miyazakiyu comprehensiveexplorationoftargetspecificligandsusingagraphconvolutionneuralnetwork
AT ononaoaki comprehensiveexplorationoftargetspecificligandsusingagraphconvolutionneuralnetwork
AT huangming comprehensiveexplorationoftargetspecificligandsusingagraphconvolutionneuralnetwork
AT altafulaminmd comprehensiveexplorationoftargetspecificligandsusingagraphconvolutionneuralnetwork
AT kanayashigehiko comprehensiveexplorationoftargetspecificligandsusingagraphconvolutionneuralnetwork