Cargando…

Multi-channel PINN: investigating scalable and transferable neural networks for drug discovery

Analysis of compound–protein interactions (CPIs) has become a crucial prerequisite for drug discovery and drug repositioning. In vitro experiments are commonly used in identifying CPIs, but it is not feasible to discover the molecular and proteomic space only through experimental approaches. Machine...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lee, Munhwan, Kim, Hyeyeon, Joe, Hyunwhan, Kim, Hong-Gee
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer International Publishing 2019
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6617572/ https://www.ncbi.nlm.nih.gov/pubmed/31289963 http://dx.doi.org/10.1186/s13321-019-0368-1

_version_	1783433720870469632
author	Lee, Munhwan Kim, Hyeyeon Joe, Hyunwhan Kim, Hong-Gee
author_facet	Lee, Munhwan Kim, Hyeyeon Joe, Hyunwhan Kim, Hong-Gee
author_sort	Lee, Munhwan
collection	PubMed
description	Analysis of compound–protein interactions (CPIs) has become a crucial prerequisite for drug discovery and drug repositioning. In vitro experiments are commonly used in identifying CPIs, but it is not feasible to discover the molecular and proteomic space only through experimental approaches. Machine learning’s advances in predicting CPIs have made significant contributions to drug discovery. Deep neural networks (DNNs), which have recently been applied to predict CPIs, performed better than other shallow classifiers. However, such techniques commonly require a considerable volume of dense data for each training target. Although the number of publicly available CPI data has grown rapidly, public data is still sparse and has a large number of measurement errors. In this paper, we propose a novel method, Multi-channel PINN, to fully utilize sparse data in terms of representation learning. With representation learning, Multi-channel PINN can utilize three approaches of DNNs which are a classifier, a feature extractor, and an end-to-end learner. Multi-channel PINN can be fed with both low and high levels of representations and incorporates each of them by utilizing all approaches within a single model. To fully utilize sparse public data, we additionally explore the potential of transferring representations from training tasks to test tasks. As a proof of concept, Multi-channel PINN was evaluated on fifteen combinations of feature pairs to investigate how they affect the performance in terms of highest performance, initial performance, and convergence speed. The experimental results obtained indicate that the multi-channel models using protein features performed better than single-channel models or multi-channel models using compound features. Therefore, Multi-channel PINN can be advantageous when used with appropriate representations. Additionally, we pretrained models on a training task then finetuned them on a test task to figure out whether Multi-channel PINN can capture general representations for compounds and proteins. We found that there were significant differences in performance between pretrained models and non-pretrained models. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13321-019-0368-1) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-6617572
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	Springer International Publishing
record_format	MEDLINE/PubMed
spelling	pubmed-66175722019-07-22 Multi-channel PINN: investigating scalable and transferable neural networks for drug discovery Lee, Munhwan Kim, Hyeyeon Joe, Hyunwhan Kim, Hong-Gee J Cheminform Research Article Analysis of compound–protein interactions (CPIs) has become a crucial prerequisite for drug discovery and drug repositioning. In vitro experiments are commonly used in identifying CPIs, but it is not feasible to discover the molecular and proteomic space only through experimental approaches. Machine learning’s advances in predicting CPIs have made significant contributions to drug discovery. Deep neural networks (DNNs), which have recently been applied to predict CPIs, performed better than other shallow classifiers. However, such techniques commonly require a considerable volume of dense data for each training target. Although the number of publicly available CPI data has grown rapidly, public data is still sparse and has a large number of measurement errors. In this paper, we propose a novel method, Multi-channel PINN, to fully utilize sparse data in terms of representation learning. With representation learning, Multi-channel PINN can utilize three approaches of DNNs which are a classifier, a feature extractor, and an end-to-end learner. Multi-channel PINN can be fed with both low and high levels of representations and incorporates each of them by utilizing all approaches within a single model. To fully utilize sparse public data, we additionally explore the potential of transferring representations from training tasks to test tasks. As a proof of concept, Multi-channel PINN was evaluated on fifteen combinations of feature pairs to investigate how they affect the performance in terms of highest performance, initial performance, and convergence speed. The experimental results obtained indicate that the multi-channel models using protein features performed better than single-channel models or multi-channel models using compound features. Therefore, Multi-channel PINN can be advantageous when used with appropriate representations. Additionally, we pretrained models on a training task then finetuned them on a test task to figure out whether Multi-channel PINN can capture general representations for compounds and proteins. We found that there were significant differences in performance between pretrained models and non-pretrained models. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13321-019-0368-1) contains supplementary material, which is available to authorized users. Springer International Publishing 2019-07-09 /pmc/articles/PMC6617572/ /pubmed/31289963 http://dx.doi.org/10.1186/s13321-019-0368-1 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article Lee, Munhwan Kim, Hyeyeon Joe, Hyunwhan Kim, Hong-Gee Multi-channel PINN: investigating scalable and transferable neural networks for drug discovery
title	Multi-channel PINN: investigating scalable and transferable neural networks for drug discovery
title_full	Multi-channel PINN: investigating scalable and transferable neural networks for drug discovery
title_fullStr	Multi-channel PINN: investigating scalable and transferable neural networks for drug discovery
title_full_unstemmed	Multi-channel PINN: investigating scalable and transferable neural networks for drug discovery
title_short	Multi-channel PINN: investigating scalable and transferable neural networks for drug discovery
title_sort	multi-channel pinn: investigating scalable and transferable neural networks for drug discovery
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6617572/ https://www.ncbi.nlm.nih.gov/pubmed/31289963 http://dx.doi.org/10.1186/s13321-019-0368-1
work_keys_str_mv	AT leemunhwan multichannelpinninvestigatingscalableandtransferableneuralnetworksfordrugdiscovery AT kimhyeyeon multichannelpinninvestigatingscalableandtransferableneuralnetworksfordrugdiscovery AT joehyunwhan multichannelpinninvestigatingscalableandtransferableneuralnetworksfordrugdiscovery AT kimhonggee multichannelpinninvestigatingscalableandtransferableneuralnetworksfordrugdiscovery

Multi-channel PINN: investigating scalable and transferable neural networks for drug discovery

Ejemplares similares