Cargando…
Comprehensive Exploration of Target‐specific Ligands Using a Graph Convolution Neural Network
Machine learning approaches are widely used to evaluate ligand activities of chemical compounds toward potential target proteins. Especially, exploration of highly selective ligands is important for the development of new drugs with higher safety. One difficulty in constructing well‐performing model...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7050504/ https://www.ncbi.nlm.nih.gov/pubmed/31815371 http://dx.doi.org/10.1002/minf.201900095 |
_version_ | 1783502622113660928 |
---|---|
author | Miyazaki, Yu Ono, Naoaki Huang, Ming Altaf‐Ul‐Amin, Md. Kanaya, Shigehiko |
author_facet | Miyazaki, Yu Ono, Naoaki Huang, Ming Altaf‐Ul‐Amin, Md. Kanaya, Shigehiko |
author_sort | Miyazaki, Yu |
collection | PubMed |
description | Machine learning approaches are widely used to evaluate ligand activities of chemical compounds toward potential target proteins. Especially, exploration of highly selective ligands is important for the development of new drugs with higher safety. One difficulty in constructing well‐performing model predicting such a ligand activity is the absence of data on true negative ligand‐protein interactions. In other words, in many cases we can access to plenty of information on ligands that bind to specific protein, but less or almost no information showing that compounds don't bind to proteins of interest. In this paper, we suggested an approach to comprehensively explore candidates for ligands specifically targeting toward proteins without using information on the true negative interaction. The approach consists of 4 steps: 1) constructing a model that distinguishes ligands for the target proteins of interest from those targeting proteins that cause off‐target effects, by using graph convolution neural network (GCNN); 2) extracting feature vectors after convolution/pooling processes and mapping their principal components in two dimensions; 3) specifying regions with higher density for two ligand groups through kernel density estimation; and 4) investigating the distribution of compounds for exploration on the density map using the same classifier and decomposer. If compounds for exploration are located in higher‐density regions of ligand compounds, these compounds can be regarded as having relatively high binding affinity to the major target or off‐target proteins compared with other compounds. We applied the approach to the exploration of ligands for β‐site amyloid precursor protein [APP]‐cleaving enzyme 1 (BACE1), a major target for Alzheimer Disease (AD), with less off‐target effect toward cathepsin D. We demonstrated that the density region of BACE1 and cathepsin D ligands are well‐divided, and a group of natural compounds as a target for exploration of new drug candidates also has significantly different distribution on the density map. |
format | Online Article Text |
id | pubmed-7050504 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-70505042020-03-09 Comprehensive Exploration of Target‐specific Ligands Using a Graph Convolution Neural Network Miyazaki, Yu Ono, Naoaki Huang, Ming Altaf‐Ul‐Amin, Md. Kanaya, Shigehiko Mol Inform Full Papers Machine learning approaches are widely used to evaluate ligand activities of chemical compounds toward potential target proteins. Especially, exploration of highly selective ligands is important for the development of new drugs with higher safety. One difficulty in constructing well‐performing model predicting such a ligand activity is the absence of data on true negative ligand‐protein interactions. In other words, in many cases we can access to plenty of information on ligands that bind to specific protein, but less or almost no information showing that compounds don't bind to proteins of interest. In this paper, we suggested an approach to comprehensively explore candidates for ligands specifically targeting toward proteins without using information on the true negative interaction. The approach consists of 4 steps: 1) constructing a model that distinguishes ligands for the target proteins of interest from those targeting proteins that cause off‐target effects, by using graph convolution neural network (GCNN); 2) extracting feature vectors after convolution/pooling processes and mapping their principal components in two dimensions; 3) specifying regions with higher density for two ligand groups through kernel density estimation; and 4) investigating the distribution of compounds for exploration on the density map using the same classifier and decomposer. If compounds for exploration are located in higher‐density regions of ligand compounds, these compounds can be regarded as having relatively high binding affinity to the major target or off‐target proteins compared with other compounds. We applied the approach to the exploration of ligands for β‐site amyloid precursor protein [APP]‐cleaving enzyme 1 (BACE1), a major target for Alzheimer Disease (AD), with less off‐target effect toward cathepsin D. We demonstrated that the density region of BACE1 and cathepsin D ligands are well‐divided, and a group of natural compounds as a target for exploration of new drug candidates also has significantly different distribution on the density map. John Wiley and Sons Inc. 2019-12-09 2020-01 /pmc/articles/PMC7050504/ /pubmed/31815371 http://dx.doi.org/10.1002/minf.201900095 Text en © 2019 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA. This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made. |
spellingShingle | Full Papers Miyazaki, Yu Ono, Naoaki Huang, Ming Altaf‐Ul‐Amin, Md. Kanaya, Shigehiko Comprehensive Exploration of Target‐specific Ligands Using a Graph Convolution Neural Network |
title | Comprehensive Exploration of Target‐specific Ligands Using a Graph Convolution Neural Network |
title_full | Comprehensive Exploration of Target‐specific Ligands Using a Graph Convolution Neural Network |
title_fullStr | Comprehensive Exploration of Target‐specific Ligands Using a Graph Convolution Neural Network |
title_full_unstemmed | Comprehensive Exploration of Target‐specific Ligands Using a Graph Convolution Neural Network |
title_short | Comprehensive Exploration of Target‐specific Ligands Using a Graph Convolution Neural Network |
title_sort | comprehensive exploration of target‐specific ligands using a graph convolution neural network |
topic | Full Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7050504/ https://www.ncbi.nlm.nih.gov/pubmed/31815371 http://dx.doi.org/10.1002/minf.201900095 |
work_keys_str_mv | AT miyazakiyu comprehensiveexplorationoftargetspecificligandsusingagraphconvolutionneuralnetwork AT ononaoaki comprehensiveexplorationoftargetspecificligandsusingagraphconvolutionneuralnetwork AT huangming comprehensiveexplorationoftargetspecificligandsusingagraphconvolutionneuralnetwork AT altafulaminmd comprehensiveexplorationoftargetspecificligandsusingagraphconvolutionneuralnetwork AT kanayashigehiko comprehensiveexplorationoftargetspecificligandsusingagraphconvolutionneuralnetwork |