Cargando…

Deep learning integration of molecular and interactome data for protein–compound interaction prediction

MOTIVATION: Virtual screening, which can computationally predict the presence or absence of protein–compound interactions, has attracted attention as a large-scale, low-cost, and short-term search method for seed compounds. Existing machine learning methods for predicting protein–compound interactio...

Descripción completa

Detalles Bibliográficos
Autores principales:	Watanabe, Narumi, Ohnuki, Yuuto, Sakakibara, Yasubumi
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer International Publishing 2021
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8088618/ https://www.ncbi.nlm.nih.gov/pubmed/33933121 http://dx.doi.org/10.1186/s13321-021-00513-3

_version_	1783686878385405952
author	Watanabe, Narumi Ohnuki, Yuuto Sakakibara, Yasubumi
author_facet	Watanabe, Narumi Ohnuki, Yuuto Sakakibara, Yasubumi
author_sort	Watanabe, Narumi
collection	PubMed
description	MOTIVATION: Virtual screening, which can computationally predict the presence or absence of protein–compound interactions, has attracted attention as a large-scale, low-cost, and short-term search method for seed compounds. Existing machine learning methods for predicting protein–compound interactions are largely divided into those based on molecular structure data and those based on network data. The former utilize information on proteins and compounds, such as amino acid sequences and chemical structures; the latter rely on interaction network data, such as protein–protein interactions and compound–compound interactions. However, there have been few attempts to combine both types of data in molecular information and interaction networks. RESULTS: We developed a deep learning-based method that integrates protein features, compound features, and multiple types of interactome data to predict protein–compound interactions. We designed three benchmark datasets with different difficulties and applied them to evaluate the prediction method. The performance evaluations show that our deep learning framework for integrating molecular structure data and interactome data outperforms state-of-the-art machine learning methods for protein–compound interaction prediction tasks. The performance improvement is statistically significant according to the Wilcoxon signed-rank test. This finding reveals that the multi-interactome data captures perspectives other than amino acid sequence homology and chemical structure similarity and that both types of data synergistically improve the prediction accuracy. Furthermore, experiments on the three benchmark datasets show that our method is more robust than existing methods in accurately predicting interactions between proteins and compounds that are unseen in training samples.
format	Online Article Text
id	pubmed-8088618
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Springer International Publishing
record_format	MEDLINE/PubMed
spelling	pubmed-80886182021-05-03 Deep learning integration of molecular and interactome data for protein–compound interaction prediction Watanabe, Narumi Ohnuki, Yuuto Sakakibara, Yasubumi J Cheminform Research Article MOTIVATION: Virtual screening, which can computationally predict the presence or absence of protein–compound interactions, has attracted attention as a large-scale, low-cost, and short-term search method for seed compounds. Existing machine learning methods for predicting protein–compound interactions are largely divided into those based on molecular structure data and those based on network data. The former utilize information on proteins and compounds, such as amino acid sequences and chemical structures; the latter rely on interaction network data, such as protein–protein interactions and compound–compound interactions. However, there have been few attempts to combine both types of data in molecular information and interaction networks. RESULTS: We developed a deep learning-based method that integrates protein features, compound features, and multiple types of interactome data to predict protein–compound interactions. We designed three benchmark datasets with different difficulties and applied them to evaluate the prediction method. The performance evaluations show that our deep learning framework for integrating molecular structure data and interactome data outperforms state-of-the-art machine learning methods for protein–compound interaction prediction tasks. The performance improvement is statistically significant according to the Wilcoxon signed-rank test. This finding reveals that the multi-interactome data captures perspectives other than amino acid sequence homology and chemical structure similarity and that both types of data synergistically improve the prediction accuracy. Furthermore, experiments on the three benchmark datasets show that our method is more robust than existing methods in accurately predicting interactions between proteins and compounds that are unseen in training samples. Springer International Publishing 2021-05-01 /pmc/articles/PMC8088618/ /pubmed/33933121 http://dx.doi.org/10.1186/s13321-021-00513-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Article Watanabe, Narumi Ohnuki, Yuuto Sakakibara, Yasubumi Deep learning integration of molecular and interactome data for protein–compound interaction prediction
title	Deep learning integration of molecular and interactome data for protein–compound interaction prediction
title_full	Deep learning integration of molecular and interactome data for protein–compound interaction prediction
title_fullStr	Deep learning integration of molecular and interactome data for protein–compound interaction prediction
title_full_unstemmed	Deep learning integration of molecular and interactome data for protein–compound interaction prediction
title_short	Deep learning integration of molecular and interactome data for protein–compound interaction prediction
title_sort	deep learning integration of molecular and interactome data for protein–compound interaction prediction
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8088618/ https://www.ncbi.nlm.nih.gov/pubmed/33933121 http://dx.doi.org/10.1186/s13321-021-00513-3
work_keys_str_mv	AT watanabenarumi deeplearningintegrationofmolecularandinteractomedataforproteincompoundinteractionprediction AT ohnukiyuuto deeplearningintegrationofmolecularandinteractomedataforproteincompoundinteractionprediction AT sakakibarayasubumi deeplearningintegrationofmolecularandinteractomedataforproteincompoundinteractionprediction

Deep learning integration of molecular and interactome data for protein–compound interaction prediction

Ejemplares similares