Cargando…

Predicting protein network topology clusters from chemical structure using deep learning

Comparing chemical structures to infer protein targets and functions is a common approach, but basing comparisons on chemical similarity alone can be misleading. Here we present a methodology for predicting target protein clusters using deep neural networks. The model is trained on clusters of compo...

Descripción completa

Detalles Bibliográficos
Autores principales: Sreenivasan, Akshai P., Harrison, Philip J, Schaal, Wesley, Matuszewski, Damian J., Kultima, Kim, Spjuth, Ola
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9284831/
https://www.ncbi.nlm.nih.gov/pubmed/35841114
http://dx.doi.org/10.1186/s13321-022-00622-7
_version_ 1784747650304704512
author Sreenivasan, Akshai P.
Harrison, Philip J
Schaal, Wesley
Matuszewski, Damian J.
Kultima, Kim
Spjuth, Ola
author_facet Sreenivasan, Akshai P.
Harrison, Philip J
Schaal, Wesley
Matuszewski, Damian J.
Kultima, Kim
Spjuth, Ola
author_sort Sreenivasan, Akshai P.
collection PubMed
description Comparing chemical structures to infer protein targets and functions is a common approach, but basing comparisons on chemical similarity alone can be misleading. Here we present a methodology for predicting target protein clusters using deep neural networks. The model is trained on clusters of compounds based on similarities calculated from combined compound-protein and protein-protein interaction data using a network topology approach. We compare several deep learning architectures including both convolutional and recurrent neural networks. The best performing method, the recurrent neural network architecture MolPMoFiT, achieved an F1 score approaching 0.9 on a held-out test set of 8907 compounds. In addition, in-depth analysis on a set of eleven well-studied chemical compounds with known functions showed that predictions were justifiable for all but one of the chemicals. Four of the compounds, similar in their molecular structure but with dissimilarities in their function, revealed advantages of our method compared to using chemical similarity. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-022-00622-7.
format Online
Article
Text
id pubmed-9284831
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-92848312022-07-16 Predicting protein network topology clusters from chemical structure using deep learning Sreenivasan, Akshai P. Harrison, Philip J Schaal, Wesley Matuszewski, Damian J. Kultima, Kim Spjuth, Ola J Cheminform Research Article Comparing chemical structures to infer protein targets and functions is a common approach, but basing comparisons on chemical similarity alone can be misleading. Here we present a methodology for predicting target protein clusters using deep neural networks. The model is trained on clusters of compounds based on similarities calculated from combined compound-protein and protein-protein interaction data using a network topology approach. We compare several deep learning architectures including both convolutional and recurrent neural networks. The best performing method, the recurrent neural network architecture MolPMoFiT, achieved an F1 score approaching 0.9 on a held-out test set of 8907 compounds. In addition, in-depth analysis on a set of eleven well-studied chemical compounds with known functions showed that predictions were justifiable for all but one of the chemicals. Four of the compounds, similar in their molecular structure but with dissimilarities in their function, revealed advantages of our method compared to using chemical similarity. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-022-00622-7. Springer International Publishing 2022-07-15 /pmc/articles/PMC9284831/ /pubmed/35841114 http://dx.doi.org/10.1186/s13321-022-00622-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Sreenivasan, Akshai P.
Harrison, Philip J
Schaal, Wesley
Matuszewski, Damian J.
Kultima, Kim
Spjuth, Ola
Predicting protein network topology clusters from chemical structure using deep learning
title Predicting protein network topology clusters from chemical structure using deep learning
title_full Predicting protein network topology clusters from chemical structure using deep learning
title_fullStr Predicting protein network topology clusters from chemical structure using deep learning
title_full_unstemmed Predicting protein network topology clusters from chemical structure using deep learning
title_short Predicting protein network topology clusters from chemical structure using deep learning
title_sort predicting protein network topology clusters from chemical structure using deep learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9284831/
https://www.ncbi.nlm.nih.gov/pubmed/35841114
http://dx.doi.org/10.1186/s13321-022-00622-7
work_keys_str_mv AT sreenivasanakshaip predictingproteinnetworktopologyclustersfromchemicalstructureusingdeeplearning
AT harrisonphilipj predictingproteinnetworktopologyclustersfromchemicalstructureusingdeeplearning
AT schaalwesley predictingproteinnetworktopologyclustersfromchemicalstructureusingdeeplearning
AT matuszewskidamianj predictingproteinnetworktopologyclustersfromchemicalstructureusingdeeplearning
AT kultimakim predictingproteinnetworktopologyclustersfromchemicalstructureusingdeeplearning
AT spjuthola predictingproteinnetworktopologyclustersfromchemicalstructureusingdeeplearning